CocoIndex
@cocoindex.bsky.social
📤 31
📥 24
📝 92
#OpenSource
Transform data for AI 🥥🌴 ⭐
https://github.com/cocoindex-io/cocoindex
pinned post!
🎉 CocoIndex just hit 1,000 ⭐️ on GitHub! Huge thanks to everyone who starred, forked, contributed, or shared the love! We’re just getting started. 🚀 🔗
github.com/cocoindex-io...
#OpenSource
#DataInfrastructure
#CocoIndex
#LLM
#Buildinpublic
#AIInfrastructure
#ETL
7 months ago
1
9
2
reposted by
CocoIndex
Linghua Jin
3 days ago
Improve your ai data pipeline throughput x times by adaptive batching
cocoindex.io/blogs/batching
#ai
#opensource
loading . . .
Adaptive Batching - 5x throughput on your data pipelines | CocoIndex
Discover how CocoIndex delivers automatic batch processing for GPU workloads and machine learning pipelines. Framework-level batching optimizes performance for text embeddings and other AI operations ...
https://cocoindex.io/blogs/batching
0
2
1
reposted by
CocoIndex
Linghua Jin
11 days ago
CocoIndex made to #1 Github trending global in Rust - data transformation engine. Grateful to the Rust open source community and all the amazing Rustaceans for the support 💛
#rustlang
#github
#opensource
1
6
2
reposted by
CocoIndex
Linghua Jin
about 2 months ago
vibe coding, not vibe data
www.youtube.com/watch?v=crV7...
#ai
#youtube
#llm
#data
#opensource
loading . . .
Fast Iterate Your Indexing Strategy 🚀
YouTube video by CocoIndex
https://www.youtube.com/watch?v=crV7odEVYTE
0
4
1
reposted by
CocoIndex
Linghua Jin
3 months ago
CocoIndex -
github.com/cocoindex-io...
- Build fresh knowledge for AI made to the top 3 in trending github in
#rustlang
! 🌟 star if you like it! Let's go Rust 🦀. Data infra project should be on
#Rustlang
.
#opensource
#buildinpublic
0
7
3
A mental framework to really understand how Rust’s ownership, borrowing, and memory safety work. With this - moves, borrows, Send, Sync, and runtime checks become intuitive and predictable tools in your programming toolbox.
cocoindex.io/blogs/rust-o...
#rustlang
#opensource
#programming
loading . . .
Thinking in Rust: Ownership, Access, and Memory Safety | CocoIndex
A mental framework for Rust's memory safety concepts. Think systematically about ownership, references, Send, Sync, and Rc, Arc, RefCell, Mutex, etc.
https://cocoindex.io/blogs/rust-ownership-access
3 months ago
0
4
1
reposted by
CocoIndex
Linghua Jin
3 months ago
Multi-dimensional vectors? That’s how AI understands everything — text, images, audio, all at once. We broke it down here.
cocoindex.io/blogs/multi-...
🌟Star the repo if you like it!
github.com/cocoindex-io...
#opensource
#Vision
#LLM
#AI
#buildinpublic
loading . . .
Multi-Dimensional Vector Support in CocoIndex | CocoIndex
CocoIndex natively handles typed multi-dimensional vectors — from simple arrays to multi-vector embeddings, unlocks multimodal AI pipelines at scale.
https://cocoindex.io/blogs/multi-vector
1
7
2
reposted by
CocoIndex
Linghua Jin
4 months ago
#vibecoding
#opensource
#ClaudeCode
#Claude
#GPT5
#GPT
#OPENAI
#GEMINI
0
5
2
CocoIndex is officially supporting custom targets -
cocoindex.io/docs/custom_...
. We believe this work will add more flexibility for using coco / bring your own lego for targets as well beyond the flow ops. Thanks our community for the great suggestions!
loading . . .
Custom Targets | CocoIndex
Learn how to create custom targets in CocoIndex to export data to any destination including databases, cloud storage, file systems, and APIs. Build target specs and connectors with setup and data meth...
https://cocoindex.io/docs/custom_ops/custom_targets
4 months ago
0
2
0
🚀 Build Your Own Google Photo Search with Face Indexing at Scale. We just dropped a new tutorial on building a scalable face recognition pipeline using CocoIndex and Qdrant.
cocoindex.io/blogs/face-d...
#opensource
#faceRecognition
#LLM
#AI
#startup
#Qdrant
#CocoIndex
#ColdPlay
loading . . .
Indexing Faces for Scalable Visual Search - Build your own Google Photo Search | CocoIndex
Build a scalable face detection and recognition pipeline using CocoIndex. This tutorial covers extracting and embedding faces from images, structuring data for visual search, and exporting to a vect...
https://cocoindex.io/blogs/face-detection/
4 months ago
0
2
1
A mental framework -- as a simple and natural interpretation -- on Rust's memory safety models
cocoindex.io/blogs/rust-o...
#rustlang
#buildinpublic
loading . . .
Thinking in Rust: Ownership, Access, and Memory Safety | CocoIndex
A mental framework for Rust's memory safety concepts. Think systematically about ownership, references, Send, Sync, and Rc, Arc, RefCell, Mutex, etc.
https://cocoindex.io/blogs/rust-ownership-access/
4 months ago
0
6
2
reposted by
CocoIndex
Linghua Jin
4 months ago
A holistic, top-down perspective on Rust’s ownership, permission, and memory safety model
cocoindex.io/blogs/rust-o...
#Rustlang
#Programming
#MemorySafty
#Data
loading . . .
Thinking in Rust: Ownership, Access, and Memory Safety | CocoIndex
A mental framework for Rust's memory safety concepts. Think systematically about ownership, references, Send, Sync, and Rc, Arc, RefCell, Mutex, etc.
https://cocoindex.io/blogs/rust-ownership-access
0
5
1
#windsurf
#cursor
#codingagents
#llm
#coding
#tech
add a skeleton here at some point
4 months ago
0
3
1
reposted by
CocoIndex
HackerNoon
5 months ago
How to index academic research papers by extracting metadata (e.g., title, authors, abstract) for AI agents and AI workflows using LLMs and CocoIndex.
#ai
loading . . .
Turn Your PDF Library into a Searchable Research Database with 100 Lines of Code
https://hackernoon.com/turn-your-pdf-library-into-a-searchable-research-database-with-100-lines-of-code
0
3
2
reposted by
CocoIndex
Linghua Jin
5 months ago
What you get: ✅ LLM-extracted title, authors, abstract ✅ Embeddings for smart search ✅ Build 3 tables in one shot: metadata, author→paper, embeddings ✅ Real-time updates via Postgres + PGVector (one-line switch to Qdrant) Appreciate a star on the repo if it is helpful
github.com/cocoindex-io...
loading . . .
GitHub - cocoindex-io/cocoindex: Data transformation framework for AI. Ultra performant, with incremental processing.
Data transformation framework for AI. Ultra performant, with incremental processing. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
0
3
1
reposted by
CocoIndex
Linghua Jin
5 months ago
📚 Build LLM-ready, metadata-rich indexes from academic papers PDF in minutes. Get started: 🔗https://cocoindex.io/blogs/academic-papers-indexing Repo: 🌟
github.com/cocoindex-io...
#LLM
#RAG
#SemanticSearch
#opensource
#DataInfrastructure
#KnowledgeGraphs
#VectorSearch
#DevTools
#OpenSource
1
6
2
reposted by
CocoIndex
Linghua Jin
5 months ago
github.com/cocoindex-io...
Transform data for AI got to
@github.com
Rust Trending this week 🚀!
@github-trending.bsky.social
#rustlang
#opensource
loading . . .
GitHub - cocoindex-io/cocoindex: Data transformation framework for AI. Ultra performant, with incremental processing.
Data transformation framework for AI. Ultra performant, with incremental processing. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
0
4
1
Code ETL like LEGO
#opensource
#rustlang
github.com/cocoindex-io...
5 months ago
0
4
0
Wanna AI to understand your codebase? cocoindex has native support Checkout
github.com/cocoindex-io...
Appreciate a Github star! Step by step tutorial
cocoindex.io/blogs/index-...
to setup codebase indexing for coding agent in 10 min.
5 months ago
0
1
0
#OpenSource
- ideas move faster, together. Build openly, learn from others, and create impact beyond your own code.
5 months ago
0
4
1
way to go -
github.com/cocoindex-io...
2k stars
#OpenSource
#Rustlang
5 months ago
1
4
0
reposted by
CocoIndex
Linghua Jin
5 months ago
CocoIndex - super simple etl to prepare data for ai agents, with dynamic index - cross 2k Github stars today.
github.com/cocoindex-io...
When sources get updates, it automatically syncs to targets with minimal computation needed. Open source & on-prem ready. ⭐️ Star if it is helpful :)
1
3
1
reposted by
CocoIndex
Linghua Jin
5 months ago
Homepage v2 -
cocoindex.io
Ultra performant ETL for AI, within 100 lines of python code. Appreciate a star 🌟 on
github.com/cocoindex-io...
1
2
1
reposted by
CocoIndex
Linghua Jin
5 months ago
🚀 Introducing CocoInsight - Make your AI data pipeline exceptionally easy to understand — step-by-step. 🌟Works with CocoIndex:
github.com/cocoindex-io...
🎉Now live:
youtube.com/watch?v=MMrp...
#AI
#DataEngineering
#LLM
#RAG
#OpenSource
#VibeCoding
#AIInfrastructure
#MLOps
#LLMInfra
#CocoIndex
loading . . .
0
5
2
reposted by
CocoIndex
Linghua Jin
5 months ago
🚀 CocoIndex now officially supports LiteLLM. You can now plug any LiteLLM-supported model directly into your Coco AI data pipelines — making it even easier to run, experiment, and scale with your favorite LLMs.
cocoindex.io/docs/ai/llm#...
🌟Github:
github.com/cocoindex-io...
#LLM
#data
#AI
#etl
loading . . .
GitHub - cocoindex-io/cocoindex: Real-time data transformation framework for AI. Ultra performant, with incremental processing.
Real-time data transformation framework for AI. Ultra performant, with incremental processing. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
0
3
1
reposted by
CocoIndex
Linghua Jin
6 months ago
Real-time updates, syntax-aware chunking. Production-ready. Ultra-performant. Fully
#OpenSource
.
#LLM
#RAG
#AIForCode
#CodeSearch
#DevTools
#RealtimeAI
#OpenSource
#AIInfra
#Rust
#Python
#TreeSitter
#VectorDB
#Embeddings
#AIEngineering
#GenerativeAI
#Codex
#CodingAgents
#Claude
#Cursor
0
5
1
reposted by
CocoIndex
Linghua Jin
6 months ago
🚀 Build Real-Time
#Codebase
Indexing for LLMs with Tree-sitter for coding agents. ~100 lines of Python get started: 🔗
cocoindex.io/blogs/index-...
repo: 🌟
github.com/cocoindex-io...
1
4
1
Still duct-taping Python scripts to move data around? 🩹🐍 It’s 2025. Time to upgrade. Meet CocoIndex — real-time data transformation built in Rust, for devs who like things fast, fresh, and composable. 🦀⚡💅 👉
github.com/cocoindex-io...
#RustLang
#DataInfra
#OpenSourceTools
#LLMops
#BuildInPublic
loading . . .
GitHub - cocoindex-io/cocoindex: Real-time data transformation framework for AI. Ultra performant, with incremental processing.
Real-time data transformation framework for AI. Ultra performant, with incremental processing. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
6 months ago
0
2
0
Tired of clunky ETL pipelines and stale data? Meet CocoIndex—the ultra-performant, open-source framework built in Rust for real-time data transformation.
github.com/cocoindex-io...
#Data
#technology
#Opensource
#Buildinpublic
loading . . .
GitHub - cocoindex-io/cocoindex: Real-time data transformation framework for AI. Ultra performant, with incremental processing.
Real-time data transformation framework for AI. Ultra performant, with incremental processing. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
6 months ago
1
6
0
reposted by
CocoIndex
Linghua Jin
6 months ago
Most user-facing AI agents need data freshness. When users update documents, it's unexpected to see stale information in search results. If the search result is fed into an AI agent, it may mean unexpected response to users. It's more dangerous - users may take unexpected response without noticing.
0
3
1
reposted by
CocoIndex
HackerNoon
6 months ago
Learn how to build a real-time, incremental ETL pipeline using Amazon S3, SQS, and CocoIndex for efficient, low-latency data transformation and vector embedding
#ai
loading . . .
Real-Time S3 Processing Arrives on CocoIndex via AWS SQS Integration
https://hackernoon.com/real-time-s3-processing-arrives-on-cocoindex-via-aws-sqs-integration
1
6
2
Prototype: "Does it work?" Production: "Will it keep working?" CocoIndex is built for production from day 0 — no hacks, no hand-waving.
github.com/cocoindex-io...
loading . . .
GitHub - cocoindex-io/cocoindex: Real-time data transformation framework for AI. Ultra performant, with incremental processing.
Real-time data transformation framework for AI. Ultra performant, with incremental processing. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
6 months ago
0
4
0
Incremental, real-time, lineage-aware—CocoIndex is built for the new era of intelligent pipelines. Here’s the vision. 👇
cocoindex.io/blogs/cocoin...
loading . . .
Story of CocoIndex, at 1k stars 🎉 | CocoIndex
CocoIndex is the world's first open-source engine that supports both custom transformation logic and incremental processing specialized for data indexing. We just crossed 1k stars, thank you so much!
https://cocoindex.io/blogs/cocoindex-1k
6 months ago
0
1
0
checkout
github.com/cocoindex-io...
add a skeleton here at some point
6 months ago
0
3
0
reposted by
CocoIndex
Linghua Jin
6 months ago
🚀 Just launched: CocoIndex now supports real-time incremental processing from
#AmazonS3
with
#AmazonSQS
. If your AI agent needs fresh data in production systems, take a look (with end to end example). 📖 Dive in:
cocoindex.io/blogs/s3-inc...
🌟Repo:
github.com/cocoindex-io...
#OpenSource
#Data
loading . . .
Real-time data transformation pipeline with Amazon S3 bucket, SQS and CocoIndex | CocoIndex
Build real-time data transformation pipeline with S3 and CocoIndex.
https://cocoindex.io/blogs/s3-incremental-etl
1
4
1
#Opensource
is more than code—it's a conversation. A shared belief that building together beats building alone. 🌱👩💻
#opensource
#devcommunity
6 months ago
0
2
0
CocoIndex
github.com/cocoindex-io/cocoindex
is an ultra performant data transformation framework, with its core engine written in
#rustlang
. The problem it tries to solve is to make it easy to prepare fresh data for AI - creating embedding, building knowledge graphs, or other data transformations.
6 months ago
0
2
0
#OpenSource
is wild. strangers become collaborators. ideas evolve in the open. every PR is a small act of trust. Grateful for everyone building in public—your work moves the world forward.
#BuildInPublic
6 months ago
1
6
0
reposted by
CocoIndex
Linghua Jin
6 months ago
Text embeddings 101 - Build your own vector search in minutes. We walk through a minimal example — from file ingestion to embedding and querying with natural language, with step by step data insights 🧪 Tutorial:
cocoindex.io/blogs/text-e...
🔗 Repo:
github.com/cocoindex/co...
#AI
#LLM
#opensource
loading . . .
How to build index with text embeddings | CocoIndex
Indexing text with CocoIndex and text embeddings, and query it with natural language.
https://cocoindex.io/blogs/text-embeddings-101
0
4
1
reposted by
CocoIndex
Linghua Jin
6 months ago
Guess who just became our first
#OpenSource
AI contributor? Jules from
#Google
! Jules makes changes in a branch and sends a PR for review at
#GitHub
like a teammate— smooth as magic. We’re blown away ✨! 👇 Jule's PR in comment
#AI
#GenAI
#DeveloperTools
#MachineLearning
#LLM
#Coding
2
5
2
reposted by
CocoIndex
Linghua Jin
6 months ago
CocoIndex -
#opensource
Real-time data transformation framework for AI. Ultra performant, with incremental processing.
github.com/cocoindex-io...
We just reached 1.5k stars. Thanks the community for the love and support ❤️. Keep building!
#datainfra
#buildinpublic
#etl
#data
#python
#rustlang
1
13
1
reposted by
CocoIndex
HackerNoon
6 months ago
In this blog, we will build live image search and query it with natural language.
#ai
loading . . .
How to Build Live Image Search With Vision Model and Query With Natural Language
https://hackernoon.com/how-to-build-live-image-search-with-vision-model-and-query-with-natural-language
0
1
1
reposted by
CocoIndex
Linghua Jin
6 months ago
Search cute images! Use natural language to search: "an elephant", "cute animal" → get results instantly 🐿️✨
1
2
1
reposted by
CocoIndex
Linghua Jin
6 months ago
Build Real-Time Image Search 🔍 with vision model
#CLIP
by
#OpenAI
and
#VectorDatabase
@qdrant.bsky.social
🔗 repo:
github.com/cocoindex-io...
🔗 tutorial:
cocoindex.io/blogs/image-...
#ImageSearch
#MultimodalAI
#CLIPModel
#RealTimeAI
#VectorSearch
#AIInfra
#LLMApplications
#SemanticSearch
loading . . .
GitHub - cocoindex-io/cocoindex: Real-time data transformation framework for AI. Ultra performant, with incremental processing.
Real-time data transformation framework for AI. Ultra performant, with incremental processing. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
1
3
1
reposted by
CocoIndex
Linghua Jin
6 months ago
Build Real-Time Product Recommendation Engine 👍 with LLM
#OpenAI
and Graph Database
@neo4j.com
🔗 repo:
github.com/cocoindex-io...
🔗 tutorial:
cocoindex.io/blogs/produc...
#LLM
#GraphDatabase
#Neo4j
#AIRecommendations
#GenerativeAI
#KnowledgeGraph
#LLMApplications
#GraphAI
0
8
2
Some of the most important tools we rely on today came from people sharing their work freely, without asking for anything in return.
#opensource
6 months ago
0
1
0
#Opensource
is one of the most powerful forces in tech.
6 months ago
0
1
0
let's code knowledge for AI
youtu.be/2KVkpUGRtnk?...
loading . . .
Code with me - Build Real-Time Knowledge Graph For Documents with LLM
YouTube video by CocoIndex
https://youtu.be/2KVkpUGRtnk?si=JupnG0dyWIyGMtrq
6 months ago
0
2
0
reposted by
CocoIndex
Linghua Jin
7 months ago
🧠 Why knowledge graphs? Knowledge graphs are foundational for context-aware AI agents. They give structure to otherwise unstructured information—letting agents reason, recall, and navigate complex domains.
1
3
1
reposted by
CocoIndex
Linghua Jin
7 months ago
github.com/cocoindex-io...
loading . . .
GitHub - cocoindex-io/cocoindex: ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.
ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego. - cocoindex-io/cocoindex
https://github.com/cocoindex-io/cocoindex
1
3
1
Load more
feeds!
log in