Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sentence compare alternate impl #1612

Open
wants to merge 8 commits into
base: master
Choose a base branch
from

Conversation

capaximperii
Copy link

Using word vectors provided by spacy english model "en_core_web_lg" which is a 1 time 1Gb dowload.
Some edits are breaking changes for example the search filters but it is to initiate a discussion on how to create KMeans model 1 time by providing a comparison algorithm access to all data.

@gunthercox
Copy link
Owner

Let me know if you would like me to do a code review at any point. I've been holding off since there appears to be test failures and merge conflicts on this branch. I'd be happy to offer any additional insight that I can if it helps.

@capaximperii
Copy link
Author

Requires some insight for sure. I will fix the integration issues but first, this is valid only for languages for which spacy provides vectors. So it goes against the chatterbot philosophy of being independent of language. Then, the equivalent sentence can be found very quickly but after the search function returns, the chatterbot engine still takes an (unacceptably) long time to find the response. I have not had much time to see why but looks like something in the sqlite lookup.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants