You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Receive links from email sent to a unique per user alias.
Improve the way hackernews links are handled: make two KG entries, one for the target link and one for the discussion thread.
Start to connect documents: this can be done by extracting entities and created an edge between docs based on the number of shared entities. One complication is to figure out a normalisation strategy, so large docs are not more "strongly" linked to all other docs.
Improve the way sources are shown in the UI after the ask, mainly ensure this is consistent in terms of formatting.
Index many more sites and in the process fix bugs related to how sites are crawled, embedded and indexed.
Work out the admin features of re-processing existing URLs, to allow for improvements to things like chunking strategies. To facilitate a job to re-process (migrate) indexed URLs - we could add a version to each URL, to show which version of the system processed it. Another idea from this, is to treat these changes as migrations, by following the same strategies as done with SQL, i.e add a new chunking strategy to a folder and the system recognises and runs it.
Add tests! Especially for URL indexing, and ensuring failed URLs do not take down the rest in the batch.
use the summary and not the whole document for embedding of the text node.
Need to properly handle tables and formulae, both on input and output (generation).
Show to the user what parts of the contexts found his answer.
Another use case is "given all my documents, find a document in which x is mentioned".
Implement separate indexing service and use a smart chunking approach.
Improve the LLM prompting, use well known patterns.
use litellm for accessing multiple LLMs with a single API.
Serve the homepage as a static page: once this is figured out we can also start to serve more pages as static pages - for example all document node pages. Another idea is to expose past Q&A as static docs.
Add IP rate limiting, and swap from 429 to another status code to differentiate GCP cold start from too many requests from the same client or tokens exhausted.
Need a proper way to handle long text documents, right now the setup is very inefficient, and I have added hacky size reduction by chopping some words.
Explore use of pex to make start up times quicker.
UX
Implement an onboarding page with all keys actions explained.
Add tabs to popular document types; wikipedia, arxiv, HN, etc
Show a loader until the app is warmed up, or improve cold start.
Bugs
If auth fails on the BE log the user out on the UI, push to login page.
Styling
Update the LP to use some better styles, ref#1, ref#2, ref#3.
Done
Consider adding a feed page showing all recently indexed documents, their state etc.
Roadmap
A simple catch all issue for roadmap items.
Private roadmap with secret features is here.
Todo
Features:
ask
, mainly ensure this is consistent in terms of formatting.Technical
429
to another status code to differentiate GCP cold start from too many requests from the same client or tokens exhausted.UX
Bugs
Styling
Done
ask
page with the document inserted into the context.The text was updated successfully, but these errors were encountered: