User-friendly library to find similar objects
You can find Full Project Documentation here
- Mission
- Open Source Project
- Features
- Requirements
- Development Status
- Install
- Quickstart
- Contributing
The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making.
This is the open source project with MIT license. Be free to use, fork, clone and contribute.
Find similar texts
- on different languages
- with or without stopwords
- using dictionary (or not)
- using keywords (or not)
- nltk, pymorphy3
- See more in Full Documentation
- Package already available on PyPi
- See more in Full Documentation
pip install find-similar
See more in Full Documentation
from find_similar import find_similar
texts = ['one two', 'two three', 'three four']
text_to_compare = 'one four'
find_similar(text_to_compare, texts, count=10)[TokenText(text="one two", len(tokens)=2, cos=0.5), TokenText(text="three four", len(tokens)=2, cos=0.5), TokenText(text="two three", len(tokens)=2, cos=0)]
- The result is the list of
TokenTextinstances ordering bycos cosis the mark of texts similarity
See more examples in Full Documentation
You are welcome! To easy start please check: