Microsoft Office Open XML Format (.docx)
.docx file signature | application/octet-stream
Microsoft Office Open XML Format (OOXML) DocumentNOTE:There is no subheader for MS OOXML files as there is with DOC, PPT, and XLS files. To better understand the format of these files,rename any OOXML file to have a .ZIP extension and then unZIP the file;look at the resultant file named[Content_Types].xmlto see the contenttypes. In particular, look for the<Override PartName=tag, where youwill findword,ppt, orxl, respectively.Trailer:Look for50 4B 05 06(PK..) followed by 18 additional bytesat the
Magic Bytes
Offset 0
50 4B 03 04 50 4B 05 06 50 4B 07 08
Sources: Wikipedia
All Known Signatures
2 signature variants are documented for .docx files across multiple sources.
| Hex Signature | Offset | Sources |
|---|---|---|
| 50 4B 03 04 50 4B 05 06 50 4B 07 08 | 0 | Wikipedia |
| 50 4B 03 04 14 00 06 00 | 0 | Gary Kessler |
Extension
.docx
MIME Type
application/octet-stream
Byte Offset
0
Risk Level
Safe
Validation Code
How to validate .docx files in Python
def is_docx(file_path: str) -> bool:
"""Check if file is a valid DOCX by magic bytes."""
signature = bytes([0x50, 0x4B, 0x03, 0x04, 0x50, 0x4B, 0x05, 0x06, 0x50, 0x4B, 0x07, 0x08])
with open(file_path, "rb") as f:
return f.read(12) == signature
How to validate .docx files in Node.js
function isDOCX(buffer: Buffer): boolean {
const signature = Buffer.from([0x50, 0x4B, 0x03, 0x04, 0x50, 0x4B, 0x05, 0x06, 0x50, 0x4B, 0x07, 0x08]);
return buffer.subarray(0, 12).equals(signature);
}
How to validate .docx files in Go
func IsDOCX(data []byte) bool {
signature := []byte{0x50, 0x4B, 0x03, 0x04, 0x50, 0x4B, 0x05, 0x06, 0x50, 0x4B, 0x07, 0x08}
if len(data) < 12 {
return false
}
return bytes.Equal(data[:12], signature)
}
API Endpoint
/api/v1/docx
curl https://filesignature.org/api/v1/docx
See the full API documentation for all endpoints and parameters.
Frequently Asked Questions
What is a .docx file?
A .docx file is a Microsoft Office Open XML Format file. Microsoft Office Open XML Format (OOXML) DocumentNOTE:There is no subheader for MS OOXML files as there is with DOC, PPT, and XLS files. To better understand the format of these files,rename any OOXML file to have a .ZIP extension and then unZIP the file;look at the resultant file named[Content_Types].xmlto see the contenttypes. In particular, look for the<Override PartName=tag, where youwill findword,ppt, orxl, respectively.Trailer:Look for50 4B 05 06(PK..) followed by 18 additional bytesat the
What are the magic bytes for .docx files?
The magic bytes for Microsoft Office Open XML Format files are 50 4B 03 04 50 4B 05 06 50 4B 07 08 at byte offset 0. These bytes uniquely identify the file format regardless of the file extension.
How do I validate a .docx file?
To validate a .docx file, read the first bytes of the file and compare them against the known magic bytes (50 4B 03 04 50 4B 05 06 50 4B 07 08) at offset 0. This is more reliable than checking the file extension alone, as extensions can be renamed.
What is the MIME type for .docx files?
There is no officially registered MIME type for .docx files. Systems typically use application/octet-stream as a generic fallback when handling this format.
Is it safe to open .docx files?
Microsoft Office Open XML Format (.docx) files are generally safe to open. They are classified as low risk because they primarily contain data rather than executable code. However, always ensure files come from a trusted source.