Adding embeddings results #47

vdaita · 2024-06-12T19:11:23Z

No description provided.

ganler

did a quick review. will check details again later

ganler · 2024-06-12T19:20:54Z

repoqa/provider/embeddings/openai.py

+from typing import List, Tuple
+
+from openai import Client
+from sklearn.metrics.pairwise import cosine_similarity


sklearn is a bit huge. ideally we want some lighter dependencies (avoiding the risk of dep conflicts in the future) or just impl the function ourselves.

ganler · 2024-06-12T19:23:07Z

repoqa/search_needle_function.py

    trust_remote_code: bool = False,
    attn_implementation=None,
+    is_embedding: bool = False,
+    embedding_context_chunk_size: int = 30


is this # of chunks or # tokens for each chunk? better make it more explicit :)
also i think specifying the size /#token of each chunk might be more intuitive

Currently it's lines of code per chunk, will change the variable name

…arn for cosine similarity

vdaita added 2 commits June 10, 2024 14:53

Adding embeddings

26cbfd9

Removing empty strings

f94d064

vdaita requested review from JialeTomTian and ganler June 12, 2024 19:11

ganler reviewed Jun 12, 2024

View reviewed changes

changed variable name in search_needle_function, no longer using skle…

a2cf56a

…arn for cosine similarity

vdaita requested a review from ganler June 23, 2024 18:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding embeddings results #47

Adding embeddings results #47

Uh oh!

vdaita commented Jun 12, 2024

Uh oh!

ganler left a comment

Uh oh!

ganler Jun 12, 2024

Uh oh!

ganler Jun 12, 2024

Uh oh!

vdaita Jun 12, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding embeddings results #47

Are you sure you want to change the base?

Adding embeddings results #47

Uh oh!

Conversation

vdaita commented Jun 12, 2024

Uh oh!

ganler left a comment

Choose a reason for hiding this comment

Uh oh!

ganler Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

ganler Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

vdaita Jun 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vdaita Jun 12, 2024 •

edited

Loading