Latenode LogoLatenode
RAG Database

AI Data Storage

RAG is currently in beta. Pricing, behavior, and limitations may change.

brave_gM7qog41yj.png

Purpose

AI Data Storage (RAG) is a component of the Latenode platform designed for storing and indexing text files, images, and other knowledge sources.

This tool is primarily intended to be used in conjunction with the AI Agent — it provides documents in the form of chunks, which the agent can then use to generate responses.

Use cases include:

  • Uploading and storing structured or unstructured content
  • Generating embedding vectors for fast semantic search
  • Running natural language search queries
  • Connecting to the RAG Search node inside a scenario

How to Access

You can access this feature via Data Storage → AI Data Storage (RAG) in the left-hand side menu.

brave_l3S0LYq3WK.png


Creating Storage

Click Create Storage to open the setup modal:

brave_awRQKmbKQs.png

Fill in the required fields: Storage Name, Chunk Size, Chunk Overlap

brave_nHbU4QmKzu.png


What are Chunk Size and Overlap?

  • Chunk Size — the number of tokens in a single chunk. Smaller chunks provide higher accuracy but increase the total number of chunks.
  • Chunk Overlap — the percentage of token overlap between neighboring chunks. Helps maintain context across them.

Managing Storage

Created storages are displayed in a table:

brave_R2vB1tRsYc.png

FieldDescription
NameStorage name
Chunk SizeNumber of tokens per chunk
Chunk OverlapOverlap between chunks in %
CreatedCreation date
UpdatedLast updated date

Uploading Files

Open a storage to access the upload interface. Drag-and-drop is supported.

brave_B5v1L58izT.png

After uploading:

  • Each file is processed and indexed (status: Processing)
  • Files are listed with size, upload date, and status
  • Editing or downloading files is currently not supported

brave_5f8Vn9IysJ.png


Multimodal RAG Features

For working with images and non-textual data, RAG Storage uses an advanced approach:

  • Automatic Image Description: When uploading images (JPEG, PNG), the system automatically generates their textual description (summary) using a multimodal LLM and indexes this description along with the text content.
  • Text Indexing: Text extracted via OCR from images (or PDF files) is also split into chunks and indexed.

This allows the AI Agent to effectively find answers to questions based on both textual and visual content.


Features & Limits

FeatureStatus
OCRSupported (English and Russian)
Image UploadSupported (if image contains text)
File EditingNot supported
File DownloadNot yet available
Automatic IndexingYes
Supported FormatsPDF, TXT, JSON, MD, PNG, JPG, and more
Upload via scenarioNot yet supported

Technical Details

ParameterValue
Max file size20 MB (50 MB planned)
Embedding modelCloudflare + LlamaIndex
Vector limit5 000 000 vectors per account
Billing0.0066 PNP tokens per page, charged only during file upload

Billing

  • PNP tokens are deducted upon file upload
  • Billing is based on pages/chunks
  • Vectorization cost: 0.0066 PNP tokens per page
  • "1 page" corresponds to approximately 1000 words or 5000 characters of text
  • For unstructured data (e.g. TXT, MD), the same linear pricing model applies — cost is proportional to total text length

Examples:

  • 10 pages (PDF/DOCX/PPTX) → 0.066 PNP tokens ($0.066)

  • TXT ≈ 10 000 words (≈ 50 000 chars) → 0.066 PNP tokens ($0.066)

  • MD ≈ 20 000 words (≈ 100 000 chars) → 0.132 PNP tokens ($0.132)

  • 100 pages → 0.66 PNP tokens ($0.66)

  • Queries via RAG Search are not additionally billed