high quality guides and tutorial

full stack deep learning - llm bootcamp: https://fullstackdeeplearning.com/llm-bootcamp/spring-2023/

no code

prototyping
- nat.dev
- https://play.vercel.ai/
prompt engineering
- https://github.com/ianarawjo/ChainForge An open-source visual programming environment for LLM experimentation and prompt evaluation.
  - alternatives
  - https://github.com/logspace-ai/langflow a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
  - https://github.com/FlowiseAI/Flowise - visual langchain builder
- vellum.ai has a visual flow editor thing. tools for prompt engineering, semantic search, version control, quantitative testing, and performance monitoring.

highest code level

owning the endpoint
- OpenLM - https://github.com/r2d4/openlm OpenAI-compatible Python client that can call any LLM
SDK wrappers
- https://github.com/minimaxir/simpleaichat
- https://github.com/vercel-labs/ai
prompt tooling
- langchain
  - https://www.pinecone.io/learn/langchain/
- llamaindex
- deepset haystack
- guardrails
- scale spellbook
vector databases

llmops

portkey https://twitter.com/jumbld/status/1648684887988117508?s=46&t=90xQ8sGy63D2OtiaoGJuww
helicone
Ozone - prompt unit testing https://twitter.com/at_sushi_/status/1667004844153131008
https://log10.io/ - pivoting to llm quality monitoring
eval
"LLM observability": Baserun, Athina, LangSmith, Parea, Arize, Langfuse
evals
- scorecard
- https://www.arthur.ai/blog/introducing-arthur-bench
data quality
- cleanlab.ai
- deepchecks <- bigger
- lilac ai
- gallileo

routing

https://github.com/BerriAI/litellm
martian
openrouter

typing/json structure libraries

Microsoft TypeChat https://news.ycombinator.com/item?id=36803124
jsonformer
lmql

lower code level

hugginface transformers https://huggingface.co/learn/nlp-course/chapter0/1?fw=pt
lightning https://twitter.com/_willfalcon/status/1665826619200614401
vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention (github) - PagedAttention is super efficient for production workloads and they do a great job with dynamic batching, queuing requests, etc.
Skypilot https://github.com/skypilot-org/skypilot a framework for running LLMs, AI, and batch jobs on any cloud, offering maximum cost savings, highest GPU availability, and managed execution.

SkyPilot abstracts away cloud infra burdens:

Launch jobs & clusters on any cloud
Easy scale-out: queue and run many jobs, automatically managed
Easy access to object stores (S3, GCS, R2)

vector databases

comparisons https://thedataquarry.com/posts/vector-db-1/

https://news.ycombinator.com/item?id=36943318

chroma
pinecone
weaviate
qdrant
marqo vector search
postgres
- supabase vector
- problems with it:
  https://twitter.com/nirantk/status/1674110063286571008?s=46

https://nextword.substack.com/p/vector-database-is-not-a-separate

Cloudflare launches vectorize, announced on September 27th, 2023
MongoDB Atlas Vector Search launched on June 22nd, 2023
Databricks announced on June 28th, 2023
Oracle integrated vector database announced on September 19th, 2023
IBM to announce vector database preview in Q4 2023
of course, companies such as Elastic and Microsoft already had vector DB offerings much earlier.

ETL

psychic.dev

fully vertically integrated RAG cloud

vectara -29m raised and from former cloudera founder
https://pezzo.ai - "enables you to build, test, monitor and instantly ship AI all in one platform, while constantly optimizing for cost and performance." - used by Meltwater CTO - from shack15
https://www.pulze.ai maybe?

infra

https://mlfoundry.com/
together.ai
model hosting and finetuning
- LLM Engine (https://llm-engine.scale.com) at Scale, which is our open source, self-hostable framework for open source LLM inference and fine-tuning. (source)
- replicate

coding tools

https://github.com/danielgross/localpilot
https://github.com/continuedev/continue
https://github.com/mudler/LocalAI
https://vxtwitter.com/ex3ndr/status/1726863029919482167

misc

AI relational database https://github.com/georgia-tech-db/eva
finetune industry
- https://predibase.com/
AI devtools
- codegen.ai
safety/security
- openai moderation endpoint (free)
- meta llamaguard/purple llama https://arxiv.org/abs/2312.06674

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Software 3.0 stack.md

Software 3.0 stack.md

high quality guides and tutorial

no code

highest code level

llmops

routing

typing/json structure libraries

lower code level

vector databases

infra

coding tools

misc

Files

Software 3.0 stack.md

Latest commit

History

Software 3.0 stack.md

File metadata and controls

high quality guides and tutorial

no code

highest code level

llmops

routing

typing/json structure libraries

lower code level

vector databases

infra

coding tools

misc