tf-idf example prints the right score but the wrong document

When I added more examples to the documents array in the tf-idf example, the wrong document was shown as the most similar. For me, with scikit-learn version 0.24.1, the cosine similarities don't include the input document, so the index 'i' is actually one less than the corresponding document in the documents array. Therefore the most similar document turns out to be documents[highest_score_index + 1].

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tf-idf example prints the right score but the wrong document #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

tf-idf example prints the right score but the wrong document #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions