Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error : "Document ID 0 with page ID 1 already exists in the index" #31

Open
Leflak opened this issue Sep 30, 2024 · 0 comments
Open

Error : "Document ID 0 with page ID 1 already exists in the index" #31

Leflak opened this issue Sep 30, 2024 · 0 comments

Comments

@Leflak
Copy link

Leflak commented Sep 30, 2024

Hi, the error "Document ID 0 with page ID 1 already exists in the index" happens when I create an index with the same files as a previous one, even with overwrite=True.

Deepseek helped me, added following lines:
self.embed_id_to_doc_id = {}
self.indexed_embeddings = []
self.doc_ids_to_file_names = {}
self.doc_id_to_metadata = {}
self.highest_doc_id = -1

.. after the line 317 of colpali.py "shutil.rmtree(index_path)". This seems to allow to really delete existing index in memory and not just the folder.

Sorry if not proper way to raise that I am not a dev and do no understand anything to github.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant