Upload files to OpenRails with support for batch processing, chunked uploads, and multiple formats
OpenRails supports uploading a wide range of document types including PDF, DOCX, PPTX, images, video, and audio files. Uploaded documents are automatically processed through the ingestion pipeline where they are parsed, chunked, embedded, and indexed for RAG retrieval.
| Category | Formats | Processing |
|---|---|---|
| Documents | PDF, DOCX, PPTX, TXT, CSV, JSON, HTML, Markdown | Text extraction and chunking |
| Images | PNG, JPG, JPEG, TIFF, BMP, GIF | OCR for text extraction |
| Video | MP4, AVI, MOV, MKV, WEBM | Speech-to-text transcription |
| Audio | MP3, WAV, M4A, OGG, FLAC, WEBM | Speech-to-text transcription |
From the sidebar, go to Data Lakes and select the data lake where you want to upload documents.
Click the Upload Files button or drag and drop files directly onto the upload area.
Choose one or more files from your file system. You can upload multiple files per request.
For large uploads, configure batch processing:
The upload progress bar shows the status of each file. Large files use chunked uploads automatically, splitting the file into smaller segments for reliable transfer.
After upload completes, files enter the ingestion pipeline. Check the Documents tab in your data lake to see ingestion status for each file.
Upload limits are configurable per deployment. Large files are automatically chunked for reliable transfer. Use multiple batches for larger collections to optimize processing throughput.