Convert SAMI to Markdown.
Drop a .smi or .sami caption file and read every SYNC block as a timestamped line of Markdown. It runs entirely in your browser, so your file never leaves your device.
Drag & drop your files
or
Optimize for AI & RAG
Extra cleanup for LLM ingestion: strip HTML, fix smart quotes, tidy Unicode and spacing.
Add YAML front matter
Prepend a metadata block (title, source, date, word & token counts) for knowledge bases and RAG.
Add table of contents
Build a linked index from the headings. Handy for long documents.
Export RAG chunks (.json)
Split the result into retrieval-ready chunks. Download per file from the result panel.
Most converters quietly upload your documents to a server. This one physically can't.
Written for players,
not for reading.
A SAMI file wraps every caption in HTML-like SYNC and P tags with Start times counted in milliseconds. Converting lifts each caption out as a plain line with a readable minutes and seconds stamp.
<SAMI><BODY>
<SYNC Start=1000><P>Hello there.</P>
<SYNC Start=5000><P>General Kenobi.</P>
</BODY></SAMI>
[00:01] Hello there.
[00:05] General Kenobi.
Captions, made
easy to reuse.
Caption files in, clean timestamped Markdown out, with no server and no account anywhere.
It never leaves your browser
Your .smi is read and converted on your own device. Nothing is uploaded to any server, ever.
# Heading
- point one
3 chunks
AI & RAG ready
Optional cleanup, YAML front matter, a table of contents and RAG chunk export.
Works offline
Once the page has loaded you can switch off your connection and it keeps converting.
[00:01] Hello there.
[00:05] General Kenobi.
SYNC blocks to lines
Each SYNC caption keeps its text and start time as a line; the markup tags go.
Unicode safe
Accents, symbols and non-Latin scripts come through intact as UTF-8.
Free, and unlimited
No sign-up, no quotas, no watermarks. Convert one file or a thousand; it all runs the same way, on your own device.
What the line
keeps.
Honest about what comes through, and what doesn't. These are the same notes the Formats list shows for SAMI, so the page never drifts from what the converter really does.
Kept
2- Each caption text
- A start time
Dropped
3- Markup tags
- Styling
- End times
SAMI questions,
answered.
Everything worth knowing before you drop in a caption file.
Other converters.
Have other subtitle or document files? These convert the same way: privately, in your browser.
GenBank to Markdown
.gb · .gbk
Annotated DNA sequences.
PDB to Markdown
.pdb
Protein 3D structures.
Newick to Markdown
.nwk · .tree
Phylogenetic trees.
GFF / GTF / BED to Markdown
.gff · .gtf · .bed
Genome feature annotations.
SAM to Markdown
.sam
Sequence alignment data.
MOL / SDF to Markdown
.mol · .sdf
Chemical structure files.
SMILES to Markdown
.smiles
Molecular structure strings.
Crystallography to Markdown
.cif
Crystal structure data.