د کشف پوښښ
اندازه شوی، نه یوازې ادعا شوی.
لاندې هر شمېر په دې ډول اندازه شوی چې ریښتینی خرابوالی په ریښتینو بېلګو فایلونو کې ورننویستل شو او هغه څه وشمېرل شول چې اعتبارسنج ونیول — د هر فارمټ او هر حالت لپاره 100 ازموینې، ثابت seed، او د هرې ازموینې خام معلومات په زېرمتون کې شته. صفرونه هم خپروو: ځینې فارمټونه په ساده ډول د بشپړتیا هېڅ میکانیزم نه لري، او موږ غوره ګڼو چې دا درته ووایو، نه دا چې بل څه وښیو.
- Sniper — په تصادفي بایټ موقعیت کې د یوه واحد بِټ تصادفي بدلون — هغه خرابوالی چې نیول یې تر ټولو سخت دي.
- Shotgun — په تصادفي موقعیت کې 4,096 پرلهپسې تصادفي بایټونه بیا لیکل کېږي — د ناکام شوي ډیسک سکټور بېلګه جوړوي.
د کشف کچې له هرې خپرونې سره لوړېږي: ژور ډیکوډرونه فارمټ په فارمټ ورزیاتېږي، او دا جدول هر ځل د اندازهګیرۍ له خامو معلوماتو څخه بیا جوړېږي.
د میکانیزم یادښتونه په انګلیسي ښودل کېږي: دا ژور تخنیکي توضیحات دي چې د اندازهګیرۍ له راپور څخه په عینه ساتل شوي دي.
انځور او عکس
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د PNG انځورPNG | 100% | 100% | Linux · macOS · Windows | CRC32 per chunk |
| JPEG XL انځورJXL | 87% | 100% | Linux · macOS · Windows | Container + frame checksums |
| د ویب پی انځورWebP | 83% | 84% | Linux · macOS · Windows | libwebp full decode |
| macOS IconICNS | 97% | 100% | Linux · macOS · Windows | TLV chunk stream; near-total coverage |
| فلش SWFSWF | 100% | 100% | Linux · macOS · Windows | CWS = zlib wrapper; any flip = zlib CRC fail |
| د SVG ویکتور ګرافیکSVG | 45% | 99% | Linux · macOS · Windows | XML parse — almost any corruption breaks the XML grammar. |
| د وینډوز آیکونICO | 63% | 70% | Linux · macOS · Windows | Directory entries + embedded PNG/BMP image validation. Multi-resolution ICO has high structural-byte density. |
| د QOI انځورQOI | 0% | 0% | Linux · macOS · Windows | QOI spec has no per-opcode checksum — bit flips in pixel data decode to different-but-valid pixels. Only magic (4 B) and end marker (8 B) are checkable. Fundamental format limit. |
| JPEG2000 انځورJPEG2K | 6% | 97% | په Windows کې جوړښتي اعتبارسنجي — ژوره JPEG ډیکوډینګ په پیل کې یوازې د Linux/macOS لپاره ده؛ د Windows پرمختیا روانه ده. | Codestream marker structure |
| د GIF انځورGIF | 9% | 100% | Linux · macOS · Windows | LZW decode (shotgun desyncs state); larger animated fixture lifts header-tamper detection |
| JPEG انځورJPEG | 4% | 100% | په Windows کې جوړښتي اعتبارسنجي — ژوره JPEG ډیکوډینګ په پیل کې یوازې د Linux/macOS لپاره ده؛ د Windows پرمختیا روانه ده. | jpegz wrapperDecode (062393f); most single-byte tamper lands in entropy-coded data, which JPEG tolerates by design |
| OpenEXR HDR انځورEXR | 1% | 100% | Linux · macOS · Windows | ZIP-compressed EXR — zlib decompress path runs on every scanline block. Shotgun detection perfect; sniper lower than the 26 KB NONE-compressed sample (6%) because structural bytes are a smaller fraction on the larger file. Both paths validated. |
| اډوب فوټوشاپPSD | 2% | 50% | Linux · macOS · Windows | RLE-compressed PSD — `validatePsdDeep` decodes every scanline. The old RAW-compressed `sample.psd` (0%/7%) is retained; sweep picks the larger RLE one. Measures the strong path that the RAW sample couldn't exercise. |
| HEIC/HEIF انځورHEIC | 0% | 4% | Linux · macOS · Windows | H.265 CABAC per tile — **arithmetic coding absorbs single-bit errors by design** |
| د AVIF انځورAVIF | 0% | 1% | Linux · macOS · Windows | AV1 OBU + CABAC — same limitation |
| د BMP انځورBMP | 0% | 0% | Linux · macOS · Windows | BMP spec has no data checksums — `bmp_decoder.validateBmp` walks every pixel row proving accessibility but cannot detect bit-flips in pixel bytes. 0/400 at ±0.5% CI. Fundamental format limit. |
| DPX انځورDPX | 0% | 0% | Linux · macOS · Windows | Raw pixel; SMPTE 268M spec has no checksum |
| د پورټ وړ Anymap انځورPAM/PPM | 0% | 0% | Linux · macOS · Windows | Raw pixel; Netpbm spec has no checksum |
| د TGA انځورTGA | 25% | 100% | Linux · macOS · Windows | Header + image-spec validation catches malformed-byte tamper; tiny 11 KB fixture pushes structural-byte density up |
| TIFF انځورTIFF | 0% | 0% | Linux · macOS · Windows | IFD structural only — no per-strip checksum |
| JBIG2 دوه اړخیزه انځورJBIG2 | 0% | — | Linux · macOS · Windows | Bi-level image stream walk; sniper 0% on truncated sample. Shotgun N/A (sample < 4 KB). |
RAW کامره
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| Adobe DNG RAWDNG | 3% | 3% | په Windows کې جوړښتي اعتبارسنجي — ژوره JPEG ډیکوډینګ په پیل کې یوازې د Linux/macOS لپاره ده؛ د Windows پرمختیا روانه ده. | TIFF-based via tiffz strip+tile decode loop + jpegz preview decode; 77 MB sensor dump dilutes detection density per byte |
| کینن RAWCR2 | 1% | 6% | په Windows کې جوړښتي اعتبارسنجي — ژوره JPEG ډیکوډینګ په پیل کې یوازې د Linux/macOS لپاره ده؛ د Windows پرمختیا روانه ده. | TIFF-based; IFD-walked preview JPEG decoded via libjpeg-turbo (sRAW2 lossless strip filtered out so heuristic keeps the real preview) |
| نیکون RAWNEF | 0% | 0% | Linux · macOS · Windows | TIFF-based; deep via zigimg |
| سوني راARW | 0% | 0% | Linux · macOS · Windows | TIFF-based; deep via zigimg |
| Fuji RAWRAF | 0% | 1% | په Windows کې جوړښتي اعتبارسنجي — ژوره JPEG ډیکوډینګ په پیل کې یوازې د Linux/macOS لپاره ده؛ د Windows پرمختیا روانه ده. | Fuji; validator decodes the JPEG preview at 0x54/0x58, but preview is ~0.5% of a 208 MB sensor dump — shotgun almost never lands in it. ⚠ See Action Items. |
| NRW | 0% | 0% | Linux · macOS · Windows | Nikon; dispatched through LibRaw which unpacks sensor data but the format has no per-row checksum. `libraw_unpack_thumb` is not currently wired up — adding it would catch corruption inside the embedded preview JPEG and could lift this to ~15-30% shotgun. Follow-up item. |
| Olympus RAWORF | 0% | 0% | Linux · macOS · Windows | Olympus; validator WARNs on "uncompressed IFD claims but Huffman-compressed data". Structural-only. |
| Pentax RAWPEF | 0% | 0% | Linux · macOS · Windows | Pentax; TIFF-wrapped. Structural-only. |
| Panasonic RAWRW2 | 0% | 0% | Linux · macOS · Windows | Panasonic; TIFF-wrapped. Structural-only. |
| Canon CR3 RAWCR3 (Canon) | 0% | 0% | Linux · macOS · Windows | ISOBMFF-based Canon RAW 3; structural box walk. 15 MB sample at 0%/0% confirms no integrity mechanism beyond structure. |
ویډیو
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د ماتروسکا ویډیوMKV | 100% | 100% | Linux · macOS · Windows | CRC32 per EBML cluster — **gold standard** |
| AV1 ویډیوAV1 | 5% | 100% | Linux · macOS · Windows | OBU structure + tile decode |
| د MPEG ټرانسپورټ جریانMPEG-TS | 4% | 100% | Linux · macOS · Windows | PAT/PMT CRC + continuity counters |
| د معیاري MIDI فایلMIDI | 15% | 100% | Linux · macOS · Windows | Track framing + delta/event validation |
| د ایپل پروریز ویډیوProRes/MOV | 5% | 78% | Linux · macOS · Windows | ProRes intra-frame DCT decode per frame |
| د MP4 ویډیوMP4 | 0% | 66% | Linux · macOS · Windows | H.264 CABAC + AAC decode (sample = `avc1` + AAC) |
| د QuickTime ویډیوMOV | 1% | 75% | Linux · macOS · Windows | H.264 CABAC decode — new sample generated from public-domain jellyfish footage via ffmpeg. Old MPEG-4 Part 2 sample kept in `sample.mov` but sweep now picks the larger H.264 one. |
| د ویب ایم ویډیوWebM (VP9+Opus) | 86% | 78% | Linux · macOS · Windows | libvpx 1.14.1 full VP9 decode per frame — every frame entropy + DCT decoded via `vpx_codec_decode`. Opus audio CRC provides additional coverage. |
| د ویب ایم ویډیوWebM (VP8) | 88% | 90% | Linux · macOS · Windows | libvpx 1.14.1 full VP8 decode. **Critical:** uses `VP8D_GET_FRAME_CORRUPTED` control query after each frame — without it, VP8's built-in error concealment silently patches bit flips. Both sniper and shotgun lift by ~90 points because of this query. |
| AVI ویډیوAVI | 0% | 93% | په Windows کې جوړښتي اعتبارسنجي — ژوره JPEG ډیکوډینګ په پیل کې یوازې د Linux/macOS لپاره ده؛ د Windows پرمختیا روانه ده. | MJPEG per-frame decode via libjpeg-turbo — new sample generated via ffmpeg. Old MPEG-4 Part 2 sample kept in `generated_testsrc.avi` but sweep picks the larger MJPEG one. |
| DV ویډیوDV | 0% | 0% | Linux · macOS · Windows | DV spec has no checksum; relies on tape physical ECC |
| د MPEG ابتدايي جریانMPEG-ES | 0% | 0% | Linux · macOS · Windows | Start codes only |
| د MPEG پروګرام جریانMPEG-1/2 | 0% | 0% | Linux · macOS · Windows | Start codes only |
| MPEG-4 Part 2 | 0% | 0% | Linux · macOS · Windows | VOP header parsing tolerates VOP failures |
| RealMediaRM | 0% | 2% | Linux · macOS · Windows | RealMedia spec has no checksums; structural chunk walk only |
| فلش ویډیوFLV | 40% | — | Linux · macOS · Windows | Flash Video tag walk; no per-tag CRC. Tiny 33 B sample — sniper hits magic + header bytes. Shotgun N/A (sample < 4 KB). |
| د MPEG پروګرام جریانMPEG-PS | 0% | — | Linux · macOS · Windows | MPEG Program Stream; PES header walk; no CRC. Shotgun N/A (sample < 4 KB). |
| اوګ ویډیو (تیورا)Theora (.ogv) | 100% | 100% | Linux · macOS · Windows | Theora-in-Ogg; libtheora-encoded testsrc (CC0). Ogg page CRC32 catches every probed bit flip in both modes. |
| د IVF ویډیو کانټینرVP8 (raw IVF) | 0% | 0% | Linux · macOS · Windows | IVF container with VP8 frames; structural-only without libvpx wired into the IVF dispatch path. Hand-authored CC0 sample. Detection is fundamentally low until VP8-in-IVF gets the same `VP8D_GET_FRAME_CORRUPTED` query as VP8-in-WebM. |
غږ
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| دولبي ډیجیټل AC-3 آډیوAC3 | 100% | 100% | Linux · macOS · Windows | CRC-16 per syncframe. Previous sample `TomorrowNeverDies-...ac3` was malformed (started with bytes 0x84 4F 59 11, not AC3 sync 0B 77) — validator correctly rejected it, so the old 100%/100% number was a false positive from trial inheritance. Removed 2026-04-23; now measured against a genuinely valid sample. |
| اوګ آډیوOGG | 100% | 100% | Linux · macOS · Windows | CRC32 per OGG page |
| دولبي ډیجیټل پلس آډیوE-AC3 (large) | 100% | 100% | Linux · macOS · Windows | CRC-16 per syncframe (full file, after 2026-03-06 fix) |
| دولبي ډیجیټل پلس آډیوE-AC3 (small) | 81% | 85% | Linux · macOS · Windows | ⚠ Lower coverage on smaller file — reflects frame-size / coverage density, not a bug |
| FLAC آډیوFLAC | 80% | 88% | Linux · macOS · Windows | MD5 audio hash + CRC-8/CRC-16 per frame |
| د ایپل بې ضرر آډیوALAC | 1% | 100% | Linux · macOS · Windows | Lossless decode — 4KB overwrite kills a frame |
| Opus | 1% | 35% | Linux · macOS · Windows | OGG page CRC + libopus decode |
| M4A آډیوAAC (M4A) | 4% | 31% | Linux · macOS · Windows | MP4 box + AAC syntax decode |
| AAC-LC آډیو (ADTS)AAC (ADTS) | 6% | 20% | Linux · macOS · Windows | ADTS framing + syntax |
| MP3 آډیوMP3 | 1% | 1% | Linux · macOS · Windows | Frame sync only — MP3 spec has no data CRC |
| د WAV آډیوWAV | 0% | 2% | Linux · macOS · Windows | RIFF structural; no data checksum |
| د AIFF آډیوAIFF | 0% | 1% | Linux · macOS · Windows | IFF structural; no data checksum |
| د CAF آډیوCAF | 0% | 1% | Linux · macOS · Windows | Chunk walk; no data checksum |
| AU/SND آډیوAU | 0% | 0% | Linux · macOS · Windows | Header + raw PCM |
| د پرو ټریکر ماډلTracker (MOD) | 0% | 0% | Linux · macOS · Windows | No integrity mechanism in format |
| Compact Pro ArchiveCPT | 100% | 100% | Linux · macOS · Windows | CRC per resource fork entry (Compact Pro archive, not audio) |
| د AMR آډیوAMR | 14% | — | Linux · macOS · Windows | Adaptive Multi-Rate audio; frame-table based, no per-frame CRC. Sniper at 14% reflects sync-byte coverage on a 38 B sample. Shotgun N/A. |
| د بندر غږAPE (Monkey's Audio) | 99% | 100% | Linux · macOS · Windows | MAC header + descriptor + per-frame CRC32. Full deep-decode validation now wired via vendored upstream Monkey's Audio SDK 12.73 (`deps/libape/`, BSD-3 since 2023). The validator runs structural rigor first (descriptor, header, seek-table monotonicity, audio-region bounds, field sanity, version range) then decodes every frame and surfaces per-frame CRC32-over-decoded-PCM mismatches via the C-shim `validate_ape_decode_check`. Truncation is caught either by the structural seek-table walk (modern v3980+ has audio_data_length) or by the decoder's sample-count-mismatch path. Synthetic 16 KB corpus is now a real APE encoded by the SDK from white-noise WAV (\`mac -c2000\`), so it actually decodes; sniper hits 99% (only the descriptor MD5 bytes — which are byte-level metadata not bitstream — escape detection in some random flips), shotgun hits 100%. Tested with the in-tree Zig tests + tests/cli/ape_validation (11 PASS). |
| CD+Graphics KaraokeCD+G (Karaoke) | 0% | 2% | Linux · macOS · Windows | 24-byte fixed-size sectors; structural only — no checksum. 0%/2% confirms fundamental format limit. |
| د DSDIFF آډیوDFF (DSDIFF) | 52% | — | Linux · macOS · Windows | DSD audio container; chunk walk only. Tiny 32 B header-only sample. Shotgun N/A. |
| د DSD جریان فایلDSF (DSD) | 60% | — | Linux · macOS · Windows | Sony DSD; structural walk. Shotgun N/A. |
| DTS Digital Surround AudioDTS (Digital Surround) | 0% | 51% | Linux · macOS · Windows | Frame sync + size walk; no per-frame CRC. Shotgun lift from 4 KB overwrite desyncing the frame stream. |
| ریښتیني آډیو (TTA)TTA (True Audio) | 97% | — | Linux · macOS · Windows | Per-frame CRC32 catches almost every probed bit flip. Shotgun N/A (sample < 4 KB). |
| WavPack آډیوWavPack | 100% | 100% | Linux · macOS · Windows | libwavpack 5.9.0 deep decode: every block decoded to PCM, per-block CRC over decoded samples + checksum sub-block + sample-count drift. Block-checksum sub-block (`ID_BLOCK_CHECKSUM`) catches header/bitstream tampering at open-time; post-decode CRC catches arithmetic drift; truncation surfaces as decoded < expected sample count. |
سند او دفتر
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د ایکسل سپریډ شیټ (97-2003)XLS | 22% | 96% | Linux · macOS · Windows | BIFF8 records + SST + formulas + cells |
| د کلمې سند (97-2003)DOC (large) | 2% | 2% | Linux · macOS · Windows | FIB + 31 fc/lcb pair bounds + CLX piece table. Detection density drops on large docs — body text is most of the file. |
| د کلمې سند (97-2003)DOC (small) | — | 52% | Linux · macOS · Windows | Same validator, smaller file — shotgun has ~21% chance of hitting FIB/Table/CLX. |
| د PDF سندPDF | n/a† | n/a† | Linux · macOS · Windows | †Headline numbers are misleading for PDF: detection rate is dominated by which compression filters the document uses for its embedded streams (Flate, DCT/JPEG, JPX/JPEG2000, JBIG2, CCITT), not by the validator. See "PDF detection by stream-filter dominance" subsection below for the breakout table. The exit-code bug from 2026-03-06 was fixed in commit `c304f36` (2026-04-23, Action Item #1). |
| پاورپاینټ (97-2003)OLE2 (PPT) | 0% | 0% | Linux · macOS · Windows | FAT/directory structural only |
| د Adobe InDesign سندInDesign | 1% | 73% | Linux · macOS · Windows | Page structure. |
| د کلمې سند (OOXML)DOCX | 87% | 100% | Linux · macOS · Windows | OOXML = ZIP with per-entry CRC32. Sample replaced 2026-04-23 from Apache 2.0 Tika test corpus. |
| د ایکسل سپریډ شیټ (OOXML)XLSX | 82% | 100% | Linux · macOS · Windows | OOXML = ZIP with per-entry CRC32. Sample from Apache Tika. |
| پاورپاینټ (OOXML)PPTX | 93% | 100% | Linux · macOS · Windows | OOXML = ZIP with per-entry CRC32. Sample from Apache Tika. |
| د پرانیستې سند متنODT | 96% | 100% | Linux · macOS · Windows | ODF = ZIP with per-entry CRC32. Sample from Apache Tika. Note: currently auto-detected as EPUB (also ZIP) — separate detection-priority bug. |
| د پرانیستې سند سپریډ شیټODS | 88% | 100% | Linux · macOS · Windows | Same as ODT. |
| د پرانیستې سند پریزنټیشنODP | 97% | 100% | Linux · macOS · Windows | Same as ODT. |
| د بډایه متن بڼهRTF | 0% | 92% | Linux · macOS · Windows | Structural only — RTF has no checksums. Shotgun high because 4 KB overwrite reliably breaks brace matching or control-word syntax. |
| د EML بریښنالیک پیغامEML | 12% | 84% | Linux · macOS · Windows | UTF-8 + NUL-byte integrity check on the raw file before structural parsing. NUL is never legal in a mail message; flipping the high bit of an ASCII byte produces a lone UTF-8 continuation byte. Caught ~50% of single-bit ASCII flips and ~80% of 4 KB shotgun overwrites. |
| د MBOX میل آرشیفMBOX | 13% | 100% | Linux · macOS · Windows | Same UTF-8 + NUL-byte check as EML, applied to the whole concatenated file. Shotgun catches every 4 KB random overwrite (random bytes almost always include NUL or invalid UTF-8 sequences). Sniper catches roughly half the random bit flips. |
| د ایپل پاڼېPages | 100% | 100% | Linux · macOS · Windows | iWork bundle = ZIP with per-entry CRC32. Hand-authored from scratch (no Apple software, no permissive corpus exists); `scripts/build-pages-sample` regenerates a deterministic 8-IWA inner `Index.zip` with high-entropy openssl-AES-CTR payloads plus real plist metadata. CRC32 per entry catches every bit flip and every 4 KB shotgun overwrite. |
| Apple KeynoteKeynote | 99% | 100% | Linux · macOS · Windows | iWork bundle = ZIP with per-entry CRC32. Hand-authored mirroring the Pages sample; `scripts/build-keynote-sample` regenerates a 9-IWA inner `Index.zip` plus an uncompressed `buildVersionHistory.plist` carrying `com.apple.iWork.Keynote` so the format detector keys on it. CRC32 per entry catches every 4 KB shotgun overwrite and ~99% of single-bit flips. |
| Apple NumbersNumbers | 100% | 100% | Linux · macOS · Windows | Same iWork-bundle template as Keynote with a Tables/-shaped IWA layout. The `com.apple.iWork.Numbers` marker is stored uncompressed at the head of the outer ZIP. CRC32 per entry catches every probed bit flip and every 4 KB shotgun overwrite. |
| Adobe انځورګرAI (Adobe Illustrator) | 6% | — | Linux · macOS · Windows | PostScript-derived header + PDF body. Tiny 372 B sample is mostly PDF magic; later bytes are uncovered text. Shotgun N/A. |
| BAI2 Balance ReportBAI2 (Bank Admin Inst.) | 20% | — | Linux · macOS · Windows | Fixed-format banking text; structural validation of record-type prefixes. No checksum. Shotgun N/A. |
| د CSV ډاټاCSV | 0% | — | Linux · macOS · Windows | Plain text; structural validator only checks UTF-8 + delimiter consistency. 0% as expected. Shotgun N/A. |
| ClarisWorks/AppleWorksClarisWorks | 6% | — | Linux · macOS · Windows | Legacy AppleWorks. Structural walk. Shotgun N/A. |
| UN/EDIFACTEDIFACT | 37% | — | Linux · macOS · Windows | Fixed-format trade messages; validator cross-checks UNH/UNT counts. Shotgun N/A. |
| چاپ شوی پوسټ سکریپټEPS | 4% | — | Linux · macOS · Windows | PostScript header + structural walk. Shotgun N/A. |
| EPUB eBookEPUB | 69% | — | Linux · macOS · Windows | ZIP container with mimetype check; per-entry CRC32. Shotgun N/A. |
| د HTML سندHTML | 2% | — | Linux · macOS · Windows | Tag-tree validator; structural only. Shotgun N/A. |
| iCalendariCalendar (RFC 5545) | 21% | — | Linux · macOS · Windows | Structural; BEGIN/END pairing + property syntax. Shotgun N/A. |
| د Adobe InDesign مارک اپIDML (InDesign) | 55% | — | Linux · macOS · Windows | ZIP+XML markup; per-entry CRC32. Shotgun N/A. |
| د INI تشکیلINI | 17% | — | Linux · macOS · Windows | Plain-text key-value; structural only. Shotgun N/A. |
| د JSON ډاټاJSON | 47% | — | Linux · macOS · Windows | JSON parser; structural only. Tiny sample's curly/brace density yields 47%. Shotgun N/A. |
| JSON5 | 30% | — | Linux · macOS · Windows | JSON5 parser; structural only. Shotgun N/A. |
| SWIFT MT940 Bank StatementMT940 (SWIFT) | 11% | — | Linux · macOS · Windows | Banking text; structural only. Shotgun N/A. |
| د MacWrite سندMacWrite Document | 0% | — | Linux · macOS · Windows | Legacy word processor; structural only. Tiny sample. Shotgun N/A. |
| NACHA/ACH Electronic PaymentsNACHA (ACH) | 15% | — | Linux · macOS · Windows | Banking fixed-format text; record-type validation. Shotgun N/A. |
| Open Financial ExchangeOFX (Open Financial) | 2% | — | Linux · macOS · Windows | Banking SGML/XML; structural only. Shotgun N/A. |
| ساده متن (UTF-8)Plain Text | 0% | — | Linux · macOS · Windows | UTF-8 + control-char check. 0% as expected. Shotgun N/A. |
| د پرو اوزار سیشنPTX (Pro Tools) | 0% | 0% | Linux · macOS · Windows | Avid session structural walk. 0%/0% on 40 KB confirms structural-only. |
| Quicken Interchange FormatQIF (Quicken Interchange) | 3% | — | Linux · macOS · Windows | Plain-text; structural only. Shotgun N/A. |
| د TOML ترتیبTOML | 37% | — | Linux · macOS · Windows | Plain-text; structural only. Shotgun N/A. |
| Tax Exchange FormatTXF (Tax Exchange) | 3% | — | Linux · macOS · Windows | Plain-text; structural only. Shotgun N/A. |
| vCardvCard (RFC 6350) | 32% | — | Linux · macOS · Windows | Structural BEGIN/END pairing + property syntax. Shotgun N/A. |
| د WordPerfect سندWordPerfect | 2% | — | Linux · macOS · Windows | Header walk; structural only. Shotgun N/A. |
| X12 EDI | 21% | — | Linux · macOS · Windows | Fixed-format trade messages; ISA/IEA + GS/GE counts cross-validated. Shotgun N/A. |
| د ایکس ایم ایل سندXML | 64% | — | Linux · macOS · Windows | XML parse; structural only. 64% sniper from tag/quote density on tiny sample. Shotgun N/A. |
| د YAML ډاټاYAML | 0% | — | Linux · macOS · Windows | Plain-text; structural only. Shotgun N/A. |
| د نښه کولو متنMarkdown | 0% | — | Linux · macOS · Windows | Plain-text; structural only. Shotgun N/A. |
فونټ
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د ریښتیني ډول فونټTTF | 100% | 100% | Linux · macOS · Windows | Per-table checksum + whole-file checkSumAdjustment (strict mode) |
| د OpenType فونټOTF | 100% | 100% | Linux · macOS · Windows | Per-table checksum + whole-file checkSumAdjustment |
| د WOFF فونټWOFF | 100% | 100% | Linux · macOS · Windows | Per-table zlib-decompress + origChecksum verification (font_validator.zig:370). Prior "0%/0%" was a stale sweep — the code was already doing the right thing. |
| د WOFF2 فونټWOFF2 | 49% | 100% | Linux · macOS · Windows | Per-table Brotli-decompress + origChecksum verification. Prior "0%/0%" was a stale sweep. Sniper is lower than WOFF because WOFF2's Brotli framing is more compact (fewer structural bytes), but shotgun still perfect. |
علمي
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د FITS ستورپوهنې ډاټاFITS (with CHECKSUM) | 100% | 100% | Linux · macOS · Windows | CHECKSUM/DATASUM per HDU |
| د FITS ستورپوهنې ډاټاFITS (no CHECKSUM) | 0% | 2% | Linux · macOS · Windows | Keyword validation only |
| د DICOM طبي انځورDICOM | 5% | 20% | Linux · macOS · Windows | Tag structure + value validation |
| HDF5 ساینسي معلوماتHDF5 | 4% | 13% | Linux · macOS · Windows | Jenkins lookup3 checksum (small file) |
| د PDB پروټین جوړښتPDB (Protein) | 16% | 39% | Linux · macOS · Windows | ATOM/HETATM record cross-validation |
| د CIF کریسټالوګرافیک ډاټاCIF (Crystallographic Info) | 0% | — | Linux · macOS · Windows | Plain-text scientific format; structural only. Shotgun N/A. |
| د فاسټا ترتیبFASTA | 22% | — | Linux · macOS · Windows | Plain-text bioinformatics; structural only. Shotgun N/A. |
| د FASTQ ترتیب لوستلFASTQ | 28% | — | Linux · macOS · Windows | Plain-text bioinformatics; per-record sequence/quality length cross-check. Shotgun N/A. |
| د MATLAB ډاټاMAT-File | 94% | — | Linux · macOS · Windows | Element header + flag walk; magic + endian + tag length validation. Shotgun N/A. |
| د NetCDF ساینسي معلوماتNetCDF | 39% | — | Linux · macOS · Windows | NetCDF classic header walk; HDF5-derived NetCDF-4 reuses HDF5's lookup3 checksums. Tiny sample. Shotgun N/A. |
| د NIfTI نیورو امیجنگNIfTI-1 | 1% | — | Linux · macOS · Windows | Header magic + dims; no checksum. Shotgun N/A. |
| د اپاچي پارکیټ ډاټاParquet | 2% | — | Linux · macOS · Windows | Footer + page CRC32 (not currently verified by validator beyond header). Shotgun N/A. |
| د ESRI شکل فایلShapefile | 89% | — | Linux · macOS · Windows | GIS .shp; record-by-record header check + magic. Shotgun N/A. |
ډیټابیس
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| QuickBooks Company FileQBW | 100% | 100% | Linux · macOS · Windows | CRC32 per 4096-byte page (v12+) |
| د SQLite ډیټابیسSQLite | 54% | 100% | Linux · macOS · Windows | Page headers + btree structure |
| د مایکروسافټ لاسرسي ډیټابیسACCDB | 1% | 73% | Linux · macOS · Windows | Jet engine page structure (small file) |
| د مایکروسافټ لاسرسي ډیټابیس (97-2003)MDB | 1% | 73% | Linux · macOS · Windows | Jet engine page structure (small file) |
ارشیف
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د تور آرشیفTAR | 15% | 73% | Linux · macOS · Windows | Header checksum per 512-byte block |
| ۷-زپ آرشیف7z | 99% | 100% | Linux · macOS · Windows | 7z next-header CRC32 + per-stream CRC32. Hand-authored xorshift corpus (CC0). Both sniper and shotgun catch nearly every flip. |
| د یونیکس آر آرشیفAR (Unix archive) | 38% | — | Linux · macOS · Windows | `!<arch>\n` magic + 60-byte member headers; no per-entry checksum. Shotgun N/A. |
| BLIP ArchiveBLAR (Blake3 Archive) | 100% | — | Linux · macOS · Windows | Peter's archive format with Blake3 per-entry hashing. Every probed bit flip detected. Shotgun N/A. |
| Brotli CompressedBrotli | 47% | 100% | Linux · macOS · Windows | Raw Brotli stream; full streaming decompression via libbrotli. RFC 7932 has no whole-file checksum, but the entropy coder rejects most structurally invalid prefix codes / distance overflows / window-bits errors. ~47% of single-bit flips on dense English-text Huffman streams cascade into decoder failure; the rest decode to wrong-but-valid bytes (silent). 4 KB shotgun overwrites are essentially always rejected. Earlier 0%/0% row used a pathological xorshift random-noise corpus where compressed output is ~8 bits/byte and bit flips are statistically valid Huffman codes; replaced with a deterministic CC0 English-text corpus (scripts/build-brotli-corpus). |
| Bzip2 کمپریس شویBzip2 | 100% | 100% | Linux · macOS · Windows | CRC32 per block + combined CRC. Hand-authored xorshift corpus (CC0). |
| Microsoft Cabinet ArchiveCAB (Microsoft) | 100% | 100% | Linux · macOS · Windows | Per-folder + per-file CSUM (Adler-like) cross-validated. CC0 sample built via gcab. |
| Gzip کمپریس شویGzip | 100% | 100% | Linux · macOS · Windows | CRC32 + ISIZE in trailer. Hand-authored xorshift corpus (CC0). |
| BinHex 4.0 ArchiveBinHex (.hqx) | 100% | 100% | Linux · macOS · Windows | BinHex 4.0 header + per-fork CRC16. Hand-authored CC0 sample (encoder reverse-engineered from validator). |
| BLIP Mini-ArchiveMBLAR (Multi-Blake3) | 100% | — | Linux · macOS · Windows | Peter's manifest-bundle archive; Blake3 per file. Shotgun N/A. |
| د PAR2 د برابرۍ آرشیفPAR2 | 100% | 100% | Linux · macOS · Windows | MD5 of every packet + recovery slice integrity. Built via par2cmdline (BSD-licensed). |
| RAR ArchiveRAR | 100% | 100% | Linux · macOS · Windows | Per-entry CRC32 + RAR5 BLAKE2sp option. CC0 corpus (rar -m5). |
| StuffIt ArchiveStuffIt | 94% | — | Linux · macOS · Windows | Header + entry walk; sniper 94% on 140 B from header dominance. Shotgun N/A. |
| XZ کمپریس شویXZ | 100% | 100% | Linux · macOS · Windows | CRC32/CRC64/SHA-256 per stream + index integrity. Hand-authored xorshift corpus (CC0). |
| زپ آرشیفZIP | 100% | 100% | Linux · macOS · Windows | Per-entry CRC32 + EOCD record. Hand-authored xorshift corpus (CC0). |
| Zstandard کمپریس شویZstd | 100% | 100% | Linux · macOS · Windows | Frame-level XXH64 + frame footer. Hand-authored xorshift corpus (CC0). |
د لوبو ROM
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| SNES ROMSNES | 100% | 99% | Linux · macOS · Windows | Internal ROM checksum + complement |
| د لوبې هلک ROMGB | 0% | 1% | Linux · macOS · Windows | Header checksum only (tiny coverage) |
| د لوبې هلک پرمختللي ROMGBA | 0% | 0% | Linux · macOS · Windows | Header checksum only |
| سیګا جینیسس ROMGenesis | 0% | 1% | Linux · macOS · Windows | Header checksum only |
| NES ROMNES | 0% | 0% | Linux · macOS · Windows | iNES header only |
| نینټینډو 64 ROMN64 | 0% | 0% | Linux · macOS · Windows | No integrity mechanism |
| MAME CHD انځورCHD (MAME) | 10% | — | Linux · macOS · Windows | MAME's compressed disc; SHA-1 per hunk + global SHA-1. Tiny synthetic 124 B sample. Shotgun N/A. |
| Nintendo DS ROMNDS (Nintendo DS) | 41% | — | Linux · macOS · Windows | Header CRC16 (logo + secure area). 41% sniper on 1 KB. Shotgun N/A. |
| د DOOM WAD آرشیفWAD (Doom/Wii) | 100% | — | Linux · macOS · Windows | Lump table; structural only. Tiny synthetic sample — header IS most of file. Shotgun N/A. |
د ډیسک انځور / فایل سیسټم / اجرایي / نور
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د ایپل ډیسک انځورDMG | 0% | 10% | Linux · macOS · Windows | Plist + koly trailer |
| د ISO 9660 ډیسک انځورISO | 0% | 0% | Linux · macOS · Windows | PVD structural only |
| د COFF اعتراض فایلCOFF | 0% | 1% | Linux · macOS · Windows | Section header structure |
| Mach-O یونیورسل بائنریMach-O Fat | 0% | 0% | Linux · macOS · Windows | Architecture header only |
| بلورب متقابل افسانهBlorb | 0% | 0% | Linux · macOS · Windows | IFF structural only |
| macOS DS_StoreDS_Store | 0% | 25% | Linux · macOS · Windows | BTree page structure |
| ASF میډیاASF | 1% | 0% | Linux · macOS · Windows | GUID/object structural |
| Quicken Data FileQDF | 1% | 0% | Linux · macOS · Windows | OLE2/ZIP structural |
| د 3MF 3D تولید3MF (3D Manufacturing) | 75% | — | Linux · macOS · Windows | ZIP-based; per-entry CRC32 + XML manifest. Shotgun N/A (sample < 4 KB). |
| Adobe After Effects پروژهAEP (After Effects Project) | 27% | — | Linux · macOS · Windows | RIFX container; structural-only walk. Tiny sample (44 B). Shotgun N/A. |
| Ableton Live SetALS (Ableton Live Set) | 90% | — | Linux · macOS · Windows | gzip-wrapped XML. Tiny sample — gzip CRC32 + zlib structure catches most bit flips. Shotgun N/A. |
| Apple Media Library DatabaseApple Media DB | 46% | — | Linux · macOS · Windows | tvdb/photo SQLite-derived store. Structural walk. Shotgun N/A. |
| د GarageBand پروژهGarageBand (.band) | 0% | — | Linux · macOS · Windows | Bundle (directory) format — `projectData` inside is a plist routed to plist validator. Sweep-only on the plist file. Shotgun N/A. |
| ایرلانګ/ایلکسیر بیم بایټ کوډBEAM (Erlang) | 36% | — | Linux · macOS · Windows | FOR1/IFF chunk container; chunk lengths cross-validated. No CRC. Shotgun N/A. |
| د بلینډر 3D پروژهBlender (.blend) | 47% | — | Linux · macOS · Windows | DNA-block-based binary. Structural walk; no checksum. Tiny header-only sample. Shotgun N/A. |
| د BSP نقشه دوتنهBSP (Quake) | 39% | — | Linux · macOS · Windows | Lump-table walk; no CRC. Structural only. Shotgun N/A. |
| د Bitwig سټوډیو پروژهBitwig Project | 0% | — | Linux · macOS · Windows | ZIP-derived but tiny sample (128 B). Sniper 0% — sample is below ZIP minimum. Shotgun N/A. |
| د کرومیم سرچینې PAKChromium PAK | 0% | — | Linux · macOS · Windows | Resource bundle; index walk only. Tiny synthetic sample. Shotgun N/A. |
| د کیوبا پروژهCubase Project | 49% | — | Linux · macOS · Windows | Steinberg binary. Structural only. Shotgun N/A. |
| DER Certificate/KeyDER (ASN.1) | 7% | — | Linux · macOS · Windows | TLV-walked. Structural; no checksum. Shotgun N/A. |
| DaVinci د حل پروژهDRP (DR Painter) | 60% | — | Linux · macOS · Windows | Generic binary — high sniper from header dominance. Shotgun N/A. |
| د AutoCAD انځورګريDWG (AutoCAD) | 1% | — | Linux · macOS · Windows | Section structure walk. Tiny sample. Shotgun N/A. |
| AutoCAD DXFDXF (AutoCAD) | 5% | — | Linux · macOS · Windows | Plain-text CAD; structural only. Shotgun N/A. |
| EEx/ERB کينډۍErlang Mix .eex | 0% | — | Linux · macOS · Windows | Plain-text template; structural only. Shotgun N/A. |
| د اجرا وړ ELFELF | 20% | — | Linux · macOS · Windows | Section header walk; no whole-file checksum. Tiny synthetic 64 B sample. Shotgun N/A. |
| د ایرلنګ اصطلاحErlang BERT | 0% | — | Linux · macOS · Windows | External Term Format walk. Shotgun N/A. |
| Final Cut Pro XMLFCPXML (Final Cut) | 58% | — | Linux · macOS · Windows | XML-based; structural walk. Shotgun N/A. |
| د FL سټوډیو پروژهFL Studio | 6% | — | Linux · macOS · Windows | Project file structural walk. Tiny sample. Shotgun N/A. |
| GLB بائنری glTFGLB (glTF binary) | 32% | — | Linux · macOS · Windows | Chunk-based; structural walk. JSON chunk + BIN chunk lengths cross-validated. Shotgun N/A. |
| glTF 3D صحنهglTF (JSON) | 20% | — | Linux · macOS · Windows | JSON manifest; structural only. Shotgun N/A. |
| د IFF کانتینرIFF (EA) | 46% | — | Linux · macOS · Windows | Chunk walk; no CRC. Shotgun N/A. |
| Java Class FileJava .class | 21% | — | Linux · macOS · Windows | ClassFile constant pool walk; magic + version check. Shotgun N/A. |
| د KML جغرافیایي ډاټاKML | 58% | — | Linux · macOS · Windows | GIS XML; structural only. Shotgun N/A. |
| KMZ کمپریس شوی KMLKMZ | 95% | — | Linux · macOS · Windows | KMZ = zipped KML; per-entry CRC32 catches almost any bit flip on the small sample. Shotgun N/A. |
| د منطق پرو ایکس پروژهLogic Pro X | 71% | — | Linux · macOS · Windows | Bundle format — sample is `ProjectData` plist alone. Shotgun N/A. |
| لارین سټوډیوز PAKLSPK (Larian Studios) | 3% | — | Linux · macOS · Windows | Pak file; structural only. Shotgun N/A. |
| Mach-O BinaryMach-O | 0% | — | Linux · macOS · Windows | Single-arch sample; load command walk; no checksum. Shotgun N/A. |
| Wavefront OBJ 3D ماډلOBJ (Wavefront) | 44% | — | Linux · macOS · Windows | Plain-text 3D; vertex/face syntax check. Shotgun N/A. |
| د زلزلې PAK آرشیفPAK (Quake) | 60% | — | Linux · macOS · Windows | Header offset/length cross-check. Tiny synthetic sample (12 B). Shotgun N/A. |
| د وینډوز PE د اجرا وړPE (Windows) | 3% | — | Linux · macOS · Windows | MZ + PE headers; optional checksum (rarely populated). Tiny sample. Shotgun N/A. |
| PEM Certificate/KeyPEM (RFC 7468) | 51% | — | Linux · macOS · Windows | Base64 envelope; structural only. Shotgun N/A. |
| PGP Signed Message | 81% | — | Linux · macOS · Windows | Header/footer detect + Base64 walk. Shotgun N/A. |
| د ایپل ملکیت لیستPlist | 52% | — | Linux · macOS · Windows | Both XML and binary plist; structural only. Shotgun N/A. |
| PLY پولیګون فایلPLY (3D) | 51% | — | Linux · macOS · Windows | Header + element count; no per-element checksum. Shotgun N/A. |
| د اډوب پریمیر پرو پروژهPremiere Project | 55% | — | Linux · macOS · Windows | Gzip-wrapped XML. Tiny sample. Shotgun N/A. |
| د دلیل پروژهReason (Propellerhead) | 33% | — | Linux · macOS · Windows | Bundle binary; structural walk. Shotgun N/A. |
| د ریپر پروژهRPP (Reaper) | 50% | — | Linux · macOS · Windows | Plain-text project; structural only. Shotgun N/A. |
| د سکیچ ډیزاین فایلSketch (.sketch) | 56% | — | Linux · macOS · Windows | ZIP-based; per-entry CRC32. Shotgun N/A. |
| SSH Signature | 64% | — | Linux · macOS · Windows | RFC 4880-like wire-format walk. Shotgun N/A. |
| د سټیپ CAD ماډلSTEP (.step) | 23% | — | Linux · macOS · Windows | ISO 10303-21 plain-text CAD. Shotgun N/A. |
| د STL 3D ماډلSTL (3D) | 73% | — | Linux · macOS · Windows | Both ASCII and binary stl; sniper 73% on small ASCII sample. Shotgun N/A. |
| Roxio Toast Disc ImageToast (Roxio) | 0% | 10% | Linux · macOS · Windows | Apple Toast disc image; structural walk. |
| د 1 فونټ ډولType 1 Font | 0% | — | Linux · macOS · Windows | PostScript-derived font; eexec encrypted body walk. Shotgun N/A. |
| VMware Virtual DiskVMDK | 0% | 0% | Linux · macOS · Windows | VMware disk descriptor + extent walk. 65 KB sample at 0%/0% — structural-only. |
| والو PAK آرشیفVPK (Valve Pak) | 100% | — | Linux · macOS · Windows | Tiny synthetic 28 B sample; structural walk catches every flip (header IS the file). Shotgun N/A. |
| د WARC ویب آرشیفWARC (Web Archive) | 48% | — | Linux · macOS · Windows | Record header + content-length walk. Shotgun N/A. |
| د WebAssembly ماډلWebAssembly | 35% | — | Linux · macOS · Windows | Section LEB128 length walk; magic + version. Tiny sample. Shotgun N/A. |
| Windows Imaging FormatWIM (Windows Imaging) | 8% | — | Linux · macOS · Windows | XPRESS/LZX section walk; partial integrity. Shotgun N/A. |
وروستي اضافات (د پوښښ د تشو ډکول)
| فارمټ | Sniper | Shotgun | پلیټفارمونه | څنګه نیول کېږي (یا ولې نه شي نیول کېدای) |
|---|---|---|---|---|
| د سټوډیو یو پروژهStudio One Project (.song) | 100% | 100% | Linux · macOS · Windows | ZIP-based; per-entry CRC32 + metainfo.xml integrity. Hand-authored CC0 sample. |
| StuffIt X ArchiveStuffIt X (.sitx) | 0% | 0% | Linux · macOS · Windows | Magic + structural header walk only; no per-entry checksums in current validator. Hand-authored. |
| Microsoft InstallerMicrosoft Installer (.msi) | 0% | 43% | Linux · macOS · Windows | OLE2 compound file (no integrity beyond CFBF FAT structure). Built via wixl. Shotgun catches FAT/dir mismatch. |
| Windows ESD ImageWindows ESD (.esd) | 1% | 0% | Linux · macOS · Windows | WIM variant with LZMS compression; structural header walk (208-byte WIM header). Hand-authored. |
| LLVM Precompiled HeaderLLVM Precompiled Header (.pch) | 0% | 0% | Linux · macOS · Windows | Magic ("CPCH") + LLVM bitcode signature only. Bitcode contents are version-specific; structural only. |
| LLVM Serialized DiagnosticsLLVM Serialized Diagnostics (.dia) | 0% | 0% | Linux · macOS · Windows | Magic ("DIAG") + LLVM bitcode signature only. Same limit as .pch. |
| QuickBooks BackupQuickBooks Backup (.qbb) | 8% | 24% | Linux · macOS · Windows | OLE2-based; dispatches through document_validators (no per-stream checksum). Sample shared with ole2/sample.doc. |
| PCAP د شبکې نیولPCAP | 4% | 100% | Linux · macOS · Windows | Hand-authored (no sudo for tcpdump in nix sandbox). Walks every packet record's incl_len/orig_len; shotgun lands in valid trailer bytes that fail length checks. Fixed 64 MiB-stack-overflow bug in `validatePcap` while landing the sample. |
| PCAPNG د شبکې نیولPCAPNG | 0% | 0% | Linux · macOS · Windows | Section Header Block + IDB + EPBs structural walk; pcapng-validator checks magic and BOM only (no block-level CRC verification yet — pcapng has optional CRC32 per block). |
| dBASE ډیټابیسdBASE (.dbf) | 0% | 100% | Linux · macOS · Windows | Header version + date + record-length cross-validation; hand-authored CC0 dBASE III. Sniper rate fundamental (no per-record checksum); shotgun lands in tail records past header-declared range. |
| G-code (3D Printer/CNC)G-code | 27% | 98% | Linux · macOS · Windows | Text format; line-grammar walk catches 27% sniper (most flips break a coordinate or G/M code prefix). Shotgun 98% — large overwrite breaks too many lines to ignore. Hand-authored CC0. |
| MessagePack DataMessagePack (.msgpack) | 0% | 0% | Linux · macOS · Windows | Type-tagged binary; validator walks tag stream but spec has no checksum. Most flips land in payload bytes that decode to different-but-valid values. Fundamental limit per RFC. Hand-authored CC0. |
| RPM بستهRPM Package (.rpm) | 3% | 30% | Linux · macOS · Windows | RPM v3 lead + signature header + main header. Validator computes SHA-1 over main header (when sig tag 269 present). Shotgun 30% reflects header dominance vs payload mass. Built via rpmbuild (CC0 spec). Fixed 16 MiB-stack-overflow bug in `validateRpm` while landing the sample. |