Unstract
LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows
Why it is included
Unstract is an open-source, no-code platform purpose-built for extracting data from unstructured documents using LLMs, with high accuracy. Easily deploy API and ETL pipelines for your unstructured data.
Best for
Users exploring vetted FOSS alternatives in this space (information processing).
Strengths
- ~6,522 GitHub stars (per upstream list)
- Open source
Limitations
- Verify license, platform support, and security posture for your environment.
Good alternatives
Related tools
AI & Machine Learning
Open Intepreter
A natural language interface for computers
AI & Machine Learning
screenpipe
run agents that work for you in the background based on what you do
AI & Machine Learning
gptme
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
AI & Machine Learning
WrenAI
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language — get accurate SQL, charts, and BI insights. Supports 12+ data sources (PostgreSQL, BigQuery, Snowflake, etc.) and any LLM (OpenAI, Claude, Gemini, Ollama)
AI & Machine Learning
TEN Agent
Open-source framework for conversational voice AI agents
AI & Machine Learning
Huginn
Create agents that monitor and act on your behalf. Your agents are standing by!
