LogoEncode
v2.4.1
Live Encoding Session · Feb 27, 2026

Store anything. In DNA.

Compress terabytes into molecules no larger than a grain of salt. A production-ready framework for synthetic DNA data storage.

View the Docs →
encode.app/dashboard · Session #4821
ENCODING ACTIVE
Input · File Queue
genomics_archive_2026.tar.gz
4.7 GB · Binary
ACTIVE
73% encoded3.43 GB processed
climate_model_v3.nc
2.1 GB
QUEUED
protein_db_hq.fasta
890 MB
QUEUED
+ Drop files to queue
Translation · Live StreamADS Codex v3
0001:ATCGGATCATGCTAGCATCG
0021:GCTAGCTAGCATCGATCGAT
0041:TATCGATCGATGCTAGCTAT
0061:CGCGATCGATCGATCGATCG
0081:ATGCATGCATCGATCGATCG
0101:
GC Balance: 51.2% · Homopolymer: 0 · Codec: Wukong-3
Metrics · Session #4821
Compression Ratio
0:1
vs raw binary
Strand Count
0
oligonucleotides
Estimated Mass
0.003 g
synthetic DNA
Error Correction
99.97%
Reed-Solomon + CIGAR
ECC Redundancy
12%
overhead
Scroll to explore the pipeline
The Pipeline

Four steps from binary
to molecule.

Each step activates as you scroll. The framework handles codec selection, error correction, and synthesis manifests — you handle the data.

01
Step 01
Encode
ADS Codex v3

Binary → Base Pairs

The ADS Codex v3 converts binary data into ternary, then maps each trit to a nucleotide using context-aware lookup tables. Homopolymer runs are eliminated. GC content is balanced to 50±3%.

encode-encode.ts
 
02
Step 02
Error Correct
99.97% Fidelity

Reed-Solomon + Insertion Guard

Synthesis introduces insertion and deletion errors at ~1.2% rate — fundamentally different from digital bit-flips. Encode applies a dual-layer ECC: outer Reed-Solomon codes plus inner CIGAR-alignment guards, achieving 99.97% fidelity.

encode-error-correct.ts
 
03
Step 03
Synthesize
Twist / IDT Compatible

Strand Manifest → Oligo Order

Encode generates a synthesis manifest compatible with Twist Bioscience, IDT, and Integrated DNA Technologies APIs. Each 200-nt oligonucleotide is addressable by index prefix for random-access retrieval.

encode-synthesize.ts
 
04
Step 04
Retrieve
Random Access

Sequencing → Decoded Binary

Sequencing reads are fed back into the Encode decoder. PCR amplification by index prefix enables targeted random access — retrieve a single file from petabytes without reading the entire pool.

encode-retrieve.ts
 
Performance Benchmarks

Numbers that make
silicon look temporary.

Benchmarked on 4.7 GB genomics archive. All metrics from production encoding sessions. No theoretical limits — measured results.

Storage Density
215 PB
per gram of DNA
vs 0.000001 PB/gram (flash)
Data Longevity
10,000
years (encapsulated)
vs ~5 years (magnetic tape)
Compression Ratio
847:1
vs raw binary
Wukong-3 codec, 4.7 GB test
Decode Fidelity
99.97%
post error-correction
Reed-Solomon + CIGAR guard
GC Balance
51.2%
target: 50 ± 3%
Homopolymer runs: 0
Density Comparison
106×
denser than flash storage
DNA (Encode)215 PB/g
NVMe SSD~1 TB/kg
Magnetic Tape~200 GB/kg

A football field of server racks replaced by a storage unit the size of a football. Source: IARPA MIST Program, 2025.

284 ZB
Global data by 2027
DNA can absorb it all
2040
Silicon supply crisis
Encode is ready now
1 gram
Holds 215 petabytes
Verified experimentally
24 hr
IARPA write target
1 TB in, 10 TB out
Why DNA Storage

Post-silicon preservation
for records that must survive centuries.

Silicon supply runs out in 2040. DNA doesn't degrade, doesn't require power, and copies for negligible cost. This is the long game.

Density beyond silicon

One gram of DNA stores 215 petabytes. The entire Library of Congress — 74 terabytes — fits in a structure the size of a poppy seed. 6,000 times over.

215 PB
per gram

10,000-year durability

DNA encapsulated in salt remains stable for decades at room temperature. Under controlled conditions: millennia. No maintenance, no degradation, no refresh cycles.

10K yr
stability

GC-balanced encoding

Stringent randomization eliminates homopolymer runs and maintains 50±3% GC content — critical for accurate synthesis and sequencing downstream.

50±3%
GC balance

Dual-layer error correction

Synthesis errors aren't bit-flips — they're insertions and deletions. Standard ECC fails. Encode uses Reed-Solomon outer codes plus CIGAR-alignment inner guards, tuned for indel error profiles. Fidelity: 99.97%.

99.97%
decode fidelity

DDSA Alliance compatible

Encode outputs synthesis manifests compatible with Twist Bioscience, IDT, Illumina, and Western Digital APIs. Built to the DNA Data Storage Alliance open standard.

DDSA
open standard

Random access retrieval

Each oligonucleotide carries a 20-bp index prefix. PCR amplification targets specific files without sequencing the entire pool — like seeking a file on disk.

O(1)
file retrieval
DDSA Alliance Members
Twist BioscienceIlluminaWestern DigitalIDTMicrosoft ResearchAgilent
From the Field

Used in production by researchers,
builders, and archivists.

340 GB · Climate Simulation
"We encoded 340 GB of climate simulation outputs from our 2025 Arctic survey. The Wukong-3 codec handled the binary-heavy dataset better than anything we'd benchmarked — 847:1 compression, 99.94% decode fidelity on first read. This is the archival infrastructure we've been waiting for."
Dr. Priya Nair, a South Asian woman scientist in a research lab setting
Dr. Priya Nair
Principal Investigator, Bioinformatics
Scripps Institution of Oceanography
50 MB · Molecular Storage Pipeline
"Our pipeline prototyping went from 6 weeks to 4 days. Encode's manifest export plugs directly into our Twist order queue. We're encoding 50 MB test datasets in the sandbox before committing to full synthesis — exactly the iteration speed we needed."
Marcus Chen, a East Asian man in a modern office environment
Marcus Chen
CTO
Helix Vault, Inc. — Series A Biotech
12 TB · Government Archives
"We're evaluating post-silicon archival for 400 years of municipal records — birth certificates, land deeds, legal filings. Encode is the first framework that gave our legal team confidence in the error-correction audit trail. The CIGAR alignment report is exactly what compliance needs."
Adaeze Okonkwo, a Nigerian woman professional in a corporate setting
Adaeze Okonkwo
Chief Data Architect
Meridian Records Trust — Enterprise
847+
Active research teams
12.4 PB
Data encoded to date
4
DDSA partner integrations
99.97%
Average decode fidelity

Free tier: encode up to 1 MB into simulated DNA sequences.

No credit card · Cloud sandbox or CLI install · Cancel anytime

View Docs