Skip to content

4x speedup #622

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
May 6, 2023
Merged

4x speedup #622

merged 2 commits into from
May 6, 2023

Conversation

montanalow
Copy link
Contributor

Using Array rather than Vec prevents an allocation, and speeds up results of cosine_similarity 4x for large scans, putting it within a few percentage points of the vectorized C equivalent code.

@montanalow
Copy link
Contributor Author

@eeeebbbbrrrr This one gets the c equivalent vectorization speedups with 0 copy arrays for most cases, except for aggregates. Those need a bit more work.

@montanalow montanalow merged commit 81ff9f3 into master May 6, 2023
@montanalow montanalow deleted the montana/array branch May 6, 2023 22:56
SilasMarvin pushed a commit that referenced this pull request Oct 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy