Storage Guide
A hands-on guide to working with RUAL storage: creating your first document, building search queries, modelling data without relations, and using full-text search effectively.
This guide builds on the Storage documentation and walks you through practical examples of working with RUAL's built-in storage system. If you haven't read the storage docs yet, we recommend starting there to understand the fundamentals of metadata, document updates, and expiry.
Creating Your First Document
To create a document, you need two things: a storage reference and a set of mutations that define the fields. Here is the block chain for creating a document:
- Add a
storageblock and select the storage you want to write to. - Add a
mutations_set_bp_field_multipleblock to define the fields and their values. - Add a
function_create_documentblock, connect the storage and mutations to its in-pins, and connect the flow pin from your flow.
The function_create_document block returns the created document including the generated _meta data with its unique guid.
Field Name Rules
Field names are always stored in lowercase. If you use UserName, it will be stored as username. Fields cannot include dots (.) — if they do, RUAL automatically converts them into a nested object structure.
Updating Documents
RUAL uses a transaction-based update system through mutations. Each update is processed sequentially, ensuring that concurrent updates don't overwrite each other. This makes it safe to use counters and array operations.
To update a document, use the function_update_document block with the document's guid and the mutations you want to apply.
Common mutation operations include:
| Block | Operation |
|---|---|
mutations_set_bp_field_multiple |
Set one or more fields to specific values. |
mutations_set_custom_field |
Set a field using a dynamic key and value. |
mutations_increment_by_field |
Increment a numeric field by a given amount. |
mutations_decrement_by_field |
Decrement a numeric field by a given amount. |
mutations_add_to_array_by_field |
Add a unique value to an array field. |
mutations_remove_from_array_by_field |
Remove a value from an array field. |
mutations_remove_by_field_multiple |
Remove one or more fields from the document entirely. |
mutations_add |
Combine multiple mutation sets into one. |
Getting a Document by GUID
The fastest way to retrieve a document is by its _meta.guid using the function_get_document block. This method is faster than searching and is easier for the system to cache. Whenever possible, prefer get document over search queries for individual document retrieval.
To retrieve multiple documents at once, use the function_get_documents block with an array of GUIDs.
Building Search Queries
Search queries are built by connecting query blocks together and feeding them into a function_search block. A typical search flow looks like this:
- Create the query conditions using blocks like
query_bool_term_fields(exact match on a field). - Wrap them in a boolean query using
query_bool_filter,query_bool_must, orquery_bool_should. - Optionally add sorting with
query_sort_fieldand a limit withquery_limitor anumberblock connected to the limit pin. - Combine all query parts using
query_and(multiple queries). - Connect the final query output to the query in-pin of
function_search. - Select the storage on the
function_searchblock.
Search Query Example
Below is an example of a search that finds documents in a messages storage where user_guid matches a specific value, sorted by creation date descending, limited to 10 results:
| Block | Purpose | Connects To |
|---|---|---|
value_default |
The user GUID to search for | query_bool_term_fields (user_guid field) |
query_bool_term_fields |
Match documents where user_guid equals the value |
query_bool_filter |
query_bool_filter |
Wrap the term query in a filter context | query_and |
query_sort_field |
Sort by _meta.created descending |
query_and |
query_and |
Combine filter and sort into one query | function_search (query pin) |
number_default (10) |
Limit results to 10 | function_search (limit pin) |
The function_search block returns an array of matching documents. For queries that should return at most one result, use function_search_single_result instead.
Query Types Overview
| Query Block | What It Does |
|---|---|
query_bool_must |
All provided queries must match (AND logic). |
query_bool_should |
One or more queries should match (OR logic). |
query_bool_filter |
Filters results using a boolean query (does not affect scoring). |
query_bool_must_not |
All provided queries must not match (exclusion). |
query_bool_term_fields |
Exact match on a selected field. |
query_bool_range_field |
Range comparison (greater than, less than) on a numeric field. |
query_bool_range_date_field |
Range comparison on a date field. |
query_bool_exists_bpfield |
Check if a field exists in the document. |
query_bool_wildcard_string_field |
Wildcard matching on a field value. |
query_bool_prefix_string_field |
Check if a field value starts with a given prefix. |
query_bool_term_fuzzy |
Find documents with terms similar to the search term (fuzzy matching). |
Search Tips
"Joe" and search for "joe", the document will not be found. It is good practice to store searchable values in lowercase to ensure accurate search results.
- Use
get documentwhen possible: Retrieving a document by_meta.guidis the fastest method and is easy to cache. Avoid creating custom unique identifiers and searching for them. - Use
query_bool_filterfor most queries: Filter context is more efficient thanmustbecause it does not calculate relevance scores. - Limit your results: Always set a reasonable limit on search results. Smaller result sets are faster to return and process.
- Select specific fields: Use the
query_source_by_fieldblock to return only the fields you need, reducing data transfer. - Stream large result sets: For queries that return many documents, use
function_search_streamto process results as they arrive rather than loading everything into memory.
Real-Time Search Results
By default, search queries do not wait for the most recent inserts to be indexed. If you create a document and immediately search for it, it might not appear in the results yet. In most cases this slight delay is acceptable.
If your use case requires the absolute latest data, you can use the disable cache query block. This forces the system to wait until all pending inserts are indexed before executing the search. However, in high-insert environments, this can introduce significant delays.
Data Modelling Without Relations
RUAL storage is schema-less and does not support relationships between storage entities. This means you cannot do traditional SQL-style joins. Instead, you should denormalize your data: include all the information you need within each document.
Example: Orders and Customers
In a traditional relational database, you would have a separate customers table and reference the customer ID from the orders table. In RUAL, you should include the relevant customer information directly in the order document:
This way, when you search for orders, you immediately have the customer name and email available without needing a second query. If the customer changes their name, you can update all their order documents using a storage event on the customers storage.
Denormalization Strategies
- Embed frequently accessed data: Include the data you display most often directly in the document. For example, store
customer_nameon every order instead of justcustomer_guid. - Keep a reference GUID: Always store the
guidof related documents so you can useget documentwhen you need the full, up-to-date data. - Use storage events for sync: When a source document changes (e.g., customer name), use a storage event to update all documents that embed that data.
- Store counters separately: Instead of using aggregation queries to count documents, maintain a counter field on a parent document and increment it when child documents are created. Alternatively, store aggregated stats in Redis cache.
Full-Text Search
RUAL's storage includes a built-in full-text search engine. To take advantage of it:
- Use
query_bool_simple_query_string_fieldfor user-facing search inputs. This block handles common search syntax that users expect. - Use
query_bool_wildcard_string_fieldfor pattern matching with wildcards. - Use
query_bool_prefix_string_fieldfor autocomplete-style "starts with" searches. - Use
query_bool_term_fuzzyfor fuzzy matching that tolerates typos.
Remember that search queries are case-sensitive. Store searchable text in lowercase and convert user input to lowercase before searching to ensure consistent results.
Document Lifecycle
Understanding the full lifecycle of a document helps you design robust data flows:
| Operation | Block | Notes |
|---|---|---|
| Create | function_create_document |
Returns the created document with _meta. Triggers on_created and on_saved storage events. |
| Read | function_get_document |
Fastest retrieval method by guid. Preferred over search for single documents. |
| Search | function_search |
Full-text search with query blocks. Results may have a slight indexing delay. |
| Update | function_update_document |
Transaction-based. Creates a revision if enabled. Triggers on_updated and on_saved events. |
| Remove (soft) | function_remove_document |
Marks as removed but data stays. Excluded from search by default. Can be restored. |
| Restore | function_restore_document |
Restores a removed document back to active state. |
| Delete (permanent) | function_delete_document |
Permanently deletes. Cannot be recovered unless revisions are enabled. |
remove (soft delete) in most cases instead of permanent delete. This allows recovery if needed. You can set up a repeating event to permanently purge removed documents after a retention period.
Caching with Redis
For high-traffic applications, caching search results in Redis is critical to maintain performance. Instead of hitting storage for every page visit, check Redis first and only query storage when the cache is empty or expired.
Common caching patterns include:
- Cache frequently accessed search results with a TTL (time to live) using
redis cacheblocks. - Invalidate or update the cache when documents change, using storage events.
- Store aggregated statistics in Redis and update them periodically with repeating events.