redis
diff --git a/‎docs/user_guide/cli.ipynb‎
Lines changed: 50 additions & 32 deletions b/‎docs/user_guide/cli.ipynb‎
Lines changed: 50 additions & 32 deletions
diff --git a/‎docs/user_guide/how_to_guides/migrate-indexes.md‎
Lines changed: 189 additions & 2 deletions b/‎docs/user_guide/how_to_guides/migrate-indexes.md‎
Lines changed: 189 additions & 2 deletions
@@ -364,6 +364,35 @@
     "!rvl stats -i vectorizers"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Migrate\n",
+    "\n",
+    "The ``rvl migrate`` command provides a full workflow for changing index schemas without losing data. Common use cases include vector quantization (float32 → float16), algorithm changes (HNSW → FLAT), and adding/removing fields.\n",
+    "\n",
+    "```bash\n",
+    "# List available indexes\n",
+    "rvl migrate list --url redis://localhost:6379\n",
+    "\n",
+    "# Build a migration plan interactively\n",
+    "rvl migrate wizard --index myindex --url redis://localhost:6379\n",
+    "\n",
+    "# Or generate from a schema patch file\n",
+    "rvl migrate plan --index myindex --schema-patch patch.yaml --url redis://localhost:6379\n",
+    "\n",
+    "# Apply with backup and multi-worker quantization\n",
+    "rvl migrate apply --plan migration_plan.yaml --url redis://localhost:6379 \\\n",
+    "  --backup-dir /tmp/backups --workers 4 --batch-size 500\n",
+    "\n",
+    "# Validate the result\n",
+    "rvl migrate validate --plan migration_plan.yaml --url redis://localhost:6379\n",
+    "```\n",
+    "\n",
+    "See the [Migration Guide](how_to_guides/migrate-indexes.md) for detailed usage, performance tuning, and examples."
+   ]
+  },
   {
    "cell_type": "markdown",
    "metadata": {},
@@ -383,15 +412,6 @@
   },
   {
    "cell_type": "markdown",
-   "metadata": {},
-   "source": [
-    "### Choosing your Redis instance\n",
-    "By default rvl first checks if you have `REDIS_URL` environment variable defined and tries to connect to that. If not, it then falls back to `localhost:6379`, unless you pass the `--host` or `--port` arguments"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 11,
    "metadata": {
     "execution": {
      "iopub.execute_input": "2026-02-16T15:58:08.651332Z",
@@ -400,33 +420,23 @@
      "shell.execute_reply": "2026-02-16T15:58:10.874011Z"
     }
    },
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "Indices:\n",
-      "1. vectorizers\n"
-     ]
-    }
-   ],
    "source": [
-    "# specify your Redis instance to connect to\n",
-    "!rvl index listall --host localhost --port 6379"
+    "### Choosing your Redis instance\n",
+    "By default rvl first checks if you have `REDIS_URL` environment variable defined and tries to connect to that. If not, it then falls back to `localhost:6379`, unless you pass the `--host` or `--port` arguments"
    ]
   },
   {
-   "cell_type": "markdown",
+   "cell_type": "code",
    "metadata": {},
    "source": [
-    "### Using SSL encryption\n",
-    "If your Redis instance is configured to use SSL encryption then set the `--ssl` flag.\n",
-    "You can similarly specify the username and password to construct the full Redis URL"
-   ]
+    "# specify your Redis instance to connect to\n",
+    "!rvl index listall --host localhost --port 6379"
+   ],
+   "outputs": [],
+   "execution_count": null
   },
   {
-   "cell_type": "code",
-   "execution_count": 12,
+   "cell_type": "markdown",
    "metadata": {
     "execution": {
      "iopub.execute_input": "2026-02-16T15:58:10.876537Z",
@@ -435,10 +445,10 @@
      "shell.execute_reply": "2026-02-16T15:58:13.099303Z"
     }
    },
-   "outputs": [],
    "source": [
-    "# connect to rediss://jane_doe:password123@localhost:6379\n",
-    "!rvl index listall --user jane_doe -a password123 --ssl"
+    "### Using SSL encryption\n",
+    "If your Redis instance is configured to use SSL encryption then set the `--ssl` flag.\n",
+    "You can similarly specify the username and password to construct the full Redis URL"
    ]
   },
   {
@@ -462,8 +472,16 @@
     }
    ],
    "source": [
-    "!rvl index destroy -i vectorizers"
+    "# connect to rediss://jane_doe:password123@localhost:6379\n",
+    "!rvl index listall --user jane_doe -a password123 --ssl"
    ]
+  },
+  {
+   "metadata": {},
+   "cell_type": "code",
+   "outputs": [],
+   "execution_count": null,
+   "source": "!rvl index destroy -i vectorizers"
   }
  ],
  "metadata": {
 
@@ -589,10 +589,15 @@ rvl migrate validate \
 - `--index` : Index name to migrate
 - `--plan` / `--plan-out` : Path to migration plan
 - `--async` : Use async executor for large migrations (apply only)
-- `--resume` : Path to checkpoint file for crash-safe quantization resume (apply only)
 - `--report-out` : Path for validation report
 - `--benchmark-out` : Path for performance metrics
 
+**Apply flags (quantization & reliability):**
+- `--backup-dir <dir>` : Directory for vector backup files. Enables crash-safe resume and manual rollback. Required when using `--workers` > 1.
+- `--batch-size <N>` : Keys per pipeline batch (default 500). Values 200–1000 are typical.
+- `--workers <N>` : Parallel quantization workers (default 1). Each worker opens its own Redis connection. See [Performance](#performance-tuning) for guidance.
+- `--keep-backup` : Retain backup files after a successful migration (default: auto-cleanup).
+
 **Batch-specific flags:**
 - `--pattern` : Glob pattern to match index names (e.g., `*_idx`)
 - `--indexes` : Explicit list of index names
@@ -631,6 +636,111 @@ If `apply` fails mid-migration:
 
 The underlying documents are never deleted by `drop_recreate`.
 
+## Backup, Resume & Rollback
+
+### How Backups Work
+
+When you pass `--backup-dir` (or `backup_dir` in the Python API), the
+migration executor saves **original vector bytes** to disk before mutating
+them. This enables two key capabilities:
+
+1. **Crash-safe resume** — if the process dies mid-migration, re-running the
+   same command with the same `--backup-dir` automatically detects partial
+   progress and resumes from the last completed batch.
+2. **Manual rollback** — the backup files contain the original (pre-quantization)
+   vector values, which can be restored to undo a migration.
+
+Backup files are written to the specified directory with this layout:
+
+```
+<backup-dir>/
+  migration_backup_<index_name>.header   # JSON: phase, progress counters, field metadata
+  migration_backup_<index_name>.data     # Binary: length-prefixed batches of original vectors
+```
+
+**Disk usage:** approximately `num_docs × dims × bytes_per_element`.
+For example, 1M docs with 768-dim float32 vectors ≈ 2.9 GB.
+
+By default, backup files are **automatically deleted** after a successful
+migration. Pass `--keep-backup` to retain them for post-migration auditing
+or potential rollback.
+
+### Crash-Safe Resume
+
+If a migration is interrupted (crash, network error, Ctrl+C), simply re-run
+the exact same command:
+
+```bash
+# Original command that was interrupted
+rvl migrate apply --plan plan.yaml --url redis://localhost:6379 \
+  --backup-dir /tmp/backups --workers 4
+
+# Just re-run it — progress is resumed automatically
+rvl migrate apply --plan plan.yaml --url redis://localhost:6379 \
+  --backup-dir /tmp/backups --workers 4
+```
+
+The executor detects the existing backup header, reads how many batches were
+completed, and resumes from the next unfinished batch. No data is duplicated
+or lost.
+
+```{note}
+**Single-worker vs multi-worker resume:** In single-worker mode, the full
+backup is written *before* the index is dropped, so a crash at any point
+leaves a complete backup on disk. In multi-worker mode, dump and quantize
+are fused (each worker reads, backs up, and converts its shard in one pass
+*after* the index drop). A crash during this fused phase may leave partial
+backup shards. Re-running detects and resumes from partial state.
+```
+
+### Rollback
+
+If you need to undo a quantization migration and restore original vectors,
+use the `rollback` command:
+
+```bash
+rvl migrate rollback --backup-dir /tmp/backups --url redis://localhost:6379
+```
+
+This reads every batch from the backup files and pipeline-HSETs the original
+(pre-quantization) vector bytes back into Redis. After rollback completes:
+
+- Your vector data is restored to its original datatype
+- You will need to **manually recreate the original index schema** if the
+  index was changed during migration (the rollback command restores data
+  only, not the index definition)
+
+```bash
+# After rollback, recreate the original index if needed:
+rvl index create --schema original_schema.yaml --url redis://localhost:6379
+```
+
+```{important}
+Rollback requires that backup files were preserved. Either pass
+`--keep-backup` during migration, or ensure the backup directory was not
+cleaned up. Without backup files, rollback is not possible.
+```
+
+### Python API for Rollback
+
+```python
+from redisvl.migration.backup import VectorBackup
+import redis
+
+r = redis.from_url("redis://localhost:6379")
+backup = VectorBackup.load("/tmp/backups/migration_backup_myindex")
+
+for keys, originals in backup.iter_batches():
+    pipe = r.pipeline(transaction=False)
+    for key in keys:
+        if key in originals:
+            for field_name, original_bytes in originals[key].items():
+                pipe.hset(key, field_name, original_bytes)
+    pipe.execute()
+
+print("Rollback complete")
+```
+
 ## Python API
 
 For programmatic migrations, use the migration classes directly:
@@ -652,6 +762,20 @@ report = executor.apply(plan, redis_url="redis://localhost:6379")
 print(f"Migration result: {report.result}")
 ```
 
+With backup and multi-worker quantization:
+
+```python
+report = executor.apply(
+    plan,
+    redis_url="redis://localhost:6379",
+    backup_dir="/tmp/migration_backups",   # enables crash-safe resume
+    batch_size=500,                        # keys per pipeline batch
+    num_workers=4,                         # parallel quantization workers
+    keep_backup=True,                      # retain backups for rollback
+)
+print(f"Quantized in {report.timings.quantize_duration_seconds}s")
+```
+
 ### Async API
 
 ```python
@@ -667,7 +791,12 @@ async def migrate():
     )
 
     executor = AsyncMigrationExecutor()
-    report = await executor.apply(plan, redis_url="redis://localhost:6379")
+    report = await executor.apply(
+        plan,
+        redis_url="redis://localhost:6379",
+        backup_dir="/tmp/migration_backups",
+        num_workers=4,
+    )
     print(f"Migration result: {report.result}")
 
 asyncio.run(migrate())
@@ -927,6 +1056,64 @@ print(f"Successful: {report.summary.successful}/{report.summary.total_indexes}")
 
 5. **Keep checkpoint files**: The `batch_state.yaml` is essential for resume. Don't delete it until the batch completes successfully.
 
+## Performance Tuning
+
+### Quantization Throughput
+
+Vector quantization (e.g. float32 → float16) is the most time-consuming
+phase of a datatype migration. Observed throughput on a local Redis instance:
+
+| Workers | Dims | Throughput | Notes |
+|---------|------|------------|-------|
+| 1       | 256  | ~70K docs/sec | Single worker is fastest for low dims |
+| 4       | 256  | ~62K docs/sec | Worker overhead exceeds parallelism benefit |
+| 1       | 1536 | ~15K docs/sec | Higher dims = more conversion work |
+| 4       | 1536 | ~15K docs/sec | I/O-bound; Redis is the bottleneck |
+
+**Guidance:**
+- For **low-dimensional vectors** (≤ 256 dims), use `--workers 1` (the default). Per-vector conversion is so cheap that process-spawning and extra-connection overhead outweigh the parallelism benefit.
+- For **high-dimensional vectors** (≥ 768 dims), `--workers 2-4` may help if the Redis server has available CPU headroom. Diminishing returns above 4–8 workers on a single Redis instance because Redis command processing is single-threaded.
+- The main bottleneck for large migrations is typically **index rebuild time** (the `FT.CREATE` background indexing after vectors are written), not quantization itself.
+
+### Batch Size
+
+The `--batch-size` flag controls how many keys are read/written per Redis
+pipeline round-trip. The default of 500 is a good balance. Larger batches
+(1000+) reduce round-trips but increase per-batch memory and latency.
+
+### Backup Disk Space
+
+When `--backup-dir` is provided, original vectors are saved to disk before
+mutation. Approximate size: `num_docs × dims × bytes_per_element`.
+
+| Docs   | Dims | Source dtype | Backup size |
+|--------|------|-------------|-------------|
+| 100K   | 768  | float32     | ~292 MB     |
+| 1M     | 768  | float32     | ~2.9 GB     |
+| 1M     | 1536 | float32     | ~5.7 GB     |
+
+### HNSW vs FLAT Index Capacity
+
+```{note}
+When migrating from **HNSW** to **FLAT**, the target index may report a
+*higher* document count than the source. This is not a bug — it reflects
+a fundamental difference in how the two algorithms store vectors.
+
+HNSW maintains a navigable small-world graph with per-node neighbor lists.
+This graph overhead limits how many vectors can fit in available memory.
+FLAT stores vectors as a simple array with no graph overhead.
+
+If the source HNSW index was operating near its memory capacity, some
+documents may have been registered in Redis Search's document table but
+not fully indexed into the HNSW graph. After migration to FLAT, those
+same documents become fully searchable because FLAT requires less memory
+per vector.
+
+The migration validator compares the total key count
+(`num_docs + hash_indexing_failures`) between source and target, so this
+scenario is handled correctly in the general case.
+```
+
 ## Learn more
 
 - {doc}`/concepts/index-migrations`: How migrations work and which changes are supported