Skip to content

Browse File Formats by Category

Explore 871+ documented file formats grouped into 16 categories. Each format includes its primary hex signature, byte offset, MIME type, and ready-to-use validation code.

Images

109

Raster and vector image formats — from JPEG, PNG, and GIF to camera RAW, SVG, and PSD — with their magic bytes, MIME types, and byte offsets.

Audio

45

Audio and sound formats including MP3, WAV, FLAC, AAC, Ogg, and MIDI, and how to identify each by its binary header.

Video

35

Container and codec formats for video — MP4, MOV, AVI, MKV, WebM and more — and the magic bytes that distinguish them.

Documents

7

Page-description and document formats such as PDF, PostScript, DjVu, and XPS, with reliable signature-based detection.

eBooks

3

Electronic book formats including EPUB, MOBI, AZW, and FB2, and the headers used to recognise them.

Office

37

Word processor, spreadsheet, and presentation formats — Microsoft Office, OpenDocument, Lotus SmartSuite — and their file signatures.

Archives

21

Compression and archive formats like ZIP, 7z, RAR, tar, and gzip, and how to validate them by magic number.

Executables

27

Executable, library, and bytecode formats — Windows PE, ELF, Mach-O, Java class, WebAssembly — and their identifying headers.

Fonts

14

Digital font formats including TrueType, OpenType, WOFF, and bitmap fonts, with signature-based identification.

Databases

10

Database and table-store file formats such as SQLite, Access, dBASE, and Firebird, and how to detect them.

Certificates & Keys

11

Cryptographic certificate, key, and keystore formats — PEM, DER, PKCS#12, PGP — and their distinguishing magic bytes.

CAD & 3D

8

Computer-aided design and 3D model formats including STL, OBJ, STEP, FBX, and glTF, with reliable detection.

Disk Images

15

Disk, filesystem, and forensic image formats — ISO, DMG, VHD, VMDK, and EnCase E01 — and their headers.

Web & Config

13

Markup, stylesheet, and configuration formats such as HTML, XML, CSS, JSON, YAML, and TOML.

Source Code

20

Programming and scripting source formats across C, Python, shell, and many other languages, plus how they are detected.

Email & PIM

11

Email message and mailbox formats — Outlook, Thunderbird, mbox, and address books — and their identifying signatures.