What It Does
- reads files from one or more buckets
- converts content to Markdown for indexing
- runs ingestion on configurable schedules
Required Environment Variables
Set these in.env.rag:
S3_ACCOUNT1_ENDPOINT: S3 API endpoint URLS3_ACCOUNT1_ACCESS_KEY: access key IDS3_ACCOUNT1_SECRET_KEY: secret keyS3_ACCOUNT1_REGION: bucket regionS3_ACCOUNT1_USE_SSL:TrueorFalseS3_ACCOUNT1_BUCKETS: bucket name(s), comma-separated for multipleS3_ACCOUNT1_SCHEDULES: ingestion interval(s) in seconds (default is3600)
config.yaml Example
Configuration Reference
| Field | Required | Default | Description |
|---|---|---|---|
endpoint | yes | — | S3 API endpoint URL |
access_key | yes | — | Access key ID |
secret_key | yes | — | Secret access key |
region | yes | — | Bucket region |
use_ssl | no | True | Whether to use HTTPS |
buckets | yes | — | Bucket name(s), comma-separated |
schedules | no | 3600 | Ingestion interval in seconds |
request_delay | no | 0 | Seconds to wait between API requests. Increase to avoid rate-limiting (e.g. 0.1) |
Multiple S3 Accounts
Add moresources entries (account2, account3, etc) with separate env vars per account.