Understanding how OpenRails processes and retrieves your documents
When you create a data lake and upload documents, OpenRails automatically handles the heavy lifting — parsing, chunking, embedding, and indexing your content so it can be searched and referenced by your chatbots and agents. You don't need to configure individual collections manually.
Each file is processed based on its format — text is extracted from PDFs, Word documents, and presentations. Images are OCR'd. Audio and video are transcribed.
The extracted text is split into smaller segments that are suitable for search and retrieval. The platform handles chunking automatically based on the content type.
Each chunk is converted into a searchable representation and stored in the knowledge base. This is what enables your chatbots and agents to find relevant information when answering questions.
OpenRails uses two complementary search methods to find the most relevant content:
Finds content that matches the meaning of the question, even when the exact words don't appear in the document.
Understands relationships between people, organizations, concepts, and events — answering questions that require connecting information across documents.
Both methods run automatically when a user asks a question. Results are combined and filtered by the user's security tier before being passed to the AI model.