Forensic field glossary

What this page is for

When ExistBefore extracts metadata from a file, the result is a list of technical fields: dateTimeOriginal, producer, exifByteOrder, and so on. This glossary explains, in plain language, what each field reveals to a forensic analyst — and which combinations are integrity signals worth investigating.

Catalog version: existbefore-rationale-v2. Each entry has a stable anchor (#field-name) so the in-app tooltips, the PDF certificate footer, and external tooling can deep-link to it. Catalog parity with the canonical metadata catalog (existbefore-metadiff-v1) is verified at build time.

The absence of a finding is not proof of authenticity. These fields document evidence; they do not issue verdicts.

File basics

File size sizeBytes
Total file size in bytes as observed by the browser. Cross-references with declared content (page count, dimensions, duration) to flag truncated or padded files; a size that disagrees with prior revisions of the same logical document signals re-encoding or content drift.
MIME type mimeType
Internet media type declared for the file (e.g. image/jpeg, application/pdf). Used together with the raw header (first 64 bytes) to detect MIME-spoofing attacks (see F19), where a hostile file claims one type while its bytes match another.

Capture & device

Capture timestamp dateTimeOriginal
The exact moment the camera firmware recorded the photo. Cameras write this once, at capture, and editors typically preserve it. Mismatch with file system timestamps or with XMP/IPTC dates is a strong forensic signal.
EXIF digitization timestamp dateTimeDigitized
Moment the analog source (film, scanner) was converted to digital, or — for native-digital captures — the moment the camera firmware finalized the image. For digital cameras it usually equals dateTimeOriginal; large divergence is a strong signal of post-capture manipulation.
EXIF file timestamp dateTime
Last modification timestamp written into the EXIF header (often by an editor on save). Distinct from dateTimeOriginal, which records the capture moment. A file timestamp newer than the capture timestamp is normal for edited files; the inverse is logically impossible (see F03).
Camera vendor make
Vendor declared by the camera firmware. Combined with model and EXIF byte order, it fingerprints the device. A vendor that disagrees with the EXIF byte order it normally writes (see F21) suggests the EXIF block was re-encoded or the field was forged.
Camera model model
Model declared by the camera firmware. Cross-references with vendor and known capture defaults; useful in chain-of-custody analysis when comparing several photos that should come from the same device.
Last writing application software
Application that wrote the EXIF block. Camera firmware names (e.g. "Apple iOS 17.5") indicate an unedited capture; image editor names (Photoshop, Lightroom, GIMP, Affinity) indicate a derivative work. Disagreement with XMP creator tool signals a multi-tool chain of custody.
XMP creator tool xmpCreatorTool
Application that wrote the XMP block. Usually the editor that last saved the file. Independent of the EXIF software field; comparing the two reveals whether a single tool or a chain of tools touched the file (see F20).
EXIF byte order exifByteOrder
EXIF blocks are tagged either little-endian (Intel, II) or big-endian (Motorola, MM). Each camera vendor has a stable native order. Disagreement between the byte order and the claimed vendor (see F21) is consistent with downstream re-encoding or a forged Make field.
EXIF orientation flag orientation
How the image should be rotated/mirrored for display, written by the camera based on the gravity sensor at capture. Cosmetic rather than forensic, but a value that disagrees with the visible orientation may indicate a re-saved file whose pixels were rotated without updating the flag.
EXIF flash status flash
Whether the camera flash fired at capture, and how (auto, forced, suppressed, red-eye reduction). Combined with exposure parameters, it documents lighting conditions; a flash-fired flag on a brightly-lit outdoor scene is unusual but not by itself indicative of tampering.
EXIF white balance mode whiteBalance
Auto vs manual white-balance setting at capture. A capture parameter for forensic completeness; useful when comparing several photos from the same alleged session to confirm consistent camera configuration.
EXIF exposure mode exposureMode
Auto vs manual exposure setting at capture. A capture parameter for forensic completeness; combined with ISO, aperture, and shutter speed it characterizes how the camera was configured at the moment of capture.
EXIF ISO sensitivity iso
Sensor sensitivity setting at capture (e.g. 100, 800, 6400). Capture parameter; cross-references with shutter speed and aperture to validate consistency. Editors that strip and re-write the EXIF block sometimes lose this value.
EXIF f-number (aperture) fNumber
Lens aperture at capture (e.g. f/2.8, f/8). Capture parameter recorded by the camera; useful in chain-of-custody analysis when comparing photos that should come from the same lens or session.
EXIF exposure time (shutter speed) exposureTime
Shutter speed at capture (e.g. 1/250 s). Capture parameter recorded by the camera; combined with f-number, ISO, and focal length it forms the exposure triangle visible in image-EXIF cross-checks.
EXIF focal length focalLength
Lens focal length at capture, in millimetres of the sensor's native format. Useful when verifying the lens used; an abrupt change between consecutive frames of a single session is unusual but not by itself a tampering signal.
EXIF lens model lensModel
Manufacturer-declared lens identifier (e.g. "EF24-70mm f/2.8L II USM"). Cross-references with focal length and aperture range; a lens that does not exist or is incompatible with the declared camera body is a strong tampering signal.
EXIF color space colorSpace
Color profile assigned by the camera or editor (sRGB, Adobe RGB, ProPhoto). Forensically informational; an unexpected color space (e.g. ProPhoto on a smartphone capture) is consistent with a desktop-editor round-trip.
IPTC capture date iptcDateCreated
Capture date stored in the IPTC IIM block, often used by photo agencies and newsrooms. Mismatch with EXIF DateTimeOriginal beyond 24 hours (see F01) suggests manual editing or partial metadata rewrite.
XMP create date xmpCreateDate
Creation date stored in the Adobe XMP block. Mismatch with EXIF DateTimeOriginal beyond 24 hours (see F02) is a re-encoding or tampering signal.
XMP edit history xmpHistory
Chain of edit actions recorded by Adobe-compatible tools. Useful provenance signal — its presence is not a tampering indicator by itself, but it documents what was done to the file and when.

Editorial metadata (IPTC)

IPTC headline iptcTitle
Short headline written into the IPTC IIM block, often added by photo agencies and editorial workflows. Mismatch with XMP title (see F20) signals a multi-tool chain of custody where each tool wrote into its own metadata block.
IPTC caption iptcCaption
Long-form caption from the IPTC IIM block, typical of newsroom and stock-photo workflows. Its presence indicates a curated/published image rather than a raw camera capture.
IPTC keywords iptcKeywords
Keyword tags from the IPTC IIM block, used by digital asset management systems for search and categorization. Their presence signals a managed editorial workflow; absence is normal for camera originals.
IPTC author / by-line iptcAuthor
Photographer or author credit from the IPTC IIM block (the "by-line" field). Cross-references with XMP creator and EXIF author; disagreement signals a multi-tool chain of custody or a delegated re-export.
IPTC copyright notice iptcCopyright
Copyright string written into the IPTC IIM block, often added at editorial export. Independent of XMP copyright; the two should usually agree, and disagreement may indicate a partial re-write of the metadata blocks.
IPTC credit line iptcCredit
Credit line (e.g. "Photo: Reuters / Joe Smith") from the IPTC IIM block. Editorial provenance signal; useful in fact-checking workflows when verifying the claimed source of an image.
IPTC source iptcSource
Original source/agency identifier from the IPTC IIM block (distinct from the credit line). Editorial provenance signal; disagreement with the credit line is unusual and worth noting.
IPTC city iptcCity
City of capture from the IPTC IIM block, typically added in editorial post-processing. Privacy-sensitive location signal; cross-references with GPS coordinates and a divergence is a strong indicator of metadata editing.
IPTC state / province iptcState
State or province of capture from the IPTC IIM block. Privacy-sensitive location signal; combined with city and country it gives a coarse geographic claim that should be consistent with GPS when both are present.
IPTC country iptcCountry
Country of capture from the IPTC IIM block, typically a 2-letter ISO code or full name. Privacy-sensitive location signal; cross-references with GPS coordinates to validate the claimed geographic context.

XMP metadata

XMP creator xmpCreator
Photographer or author credit from the XMP block (Dublin Core dc:creator). Independent of IPTC author and EXIF artist; the three usually agree on a camera original and may legitimately diverge on a multi-tool edited file.
XMP modify date xmpModifyDate
Last modification timestamp from the XMP block. Distinct from xmpCreateDate and from EXIF dateTime; mismatch with the file system modification date can indicate the file was metadata-edited without rewriting the bytes.
XMP rating (1–5 stars) xmpRating
User rating from 1 to 5 stars (or -1 for rejected) from the XMP block. Indicates the file passed through a digital asset management workflow; cosmetic rather than forensic, but its presence rules out a raw camera capture.
XMP color label xmpLabel
User-assigned color label (e.g. "Red", "Green") from the XMP block. Used by Lightroom and similar DAM tools to mark workflow status. Indicates the file passed through such a workflow.

Location

GPS coordinates gps
Latitude and longitude embedded in the EXIF GPS block. High-fidelity location data — consider whether to publish the file with these still embedded (the "Include GPS" toggle removes them from the certificate; see F15).
GPS date stamp gpsDateStamp
Day-precision UTC date associated with the GPS fix. When it disagrees with EXIF DateTimeOriginal beyond 24 hours (see F17), GPS may have been injected after capture or the EXIF date may have been rewritten in isolation.

PDF documents

Page count pages
Number of pages in the document. Useful as a baseline integrity check: a page count that disagrees with what you expect to receive can indicate a swapped file.
PDF version pdfVersion
PDF specification version declared in the file header (e.g. "1.4", "1.7", "2.0"). Useful for reader compatibility; a version that disagrees with the producer's expected output may indicate a re-save through a non-default toolchain.
PDF producer producer
Library that wrote the PDF bytes. Distinct from the creator (the application that originated the content). Examples: "Adobe PDF Library 17.0", "Skia/PDF m120". Sudden changes between revisions of the same document indicate a re-save through a different toolchain.
PDF creator creator
Application that originated the PDF content. Examples: "Microsoft Word", "LibreOffice", "Pages". A creator that disagrees with the producer is normal (Word generated, then re-saved by Acrobat) but worth noting in chain-of-custody analysis.
PDF author author
Document author from the PDF Info dictionary. Self-declared and trivially editable; useful as a chain-of-custody hint but never as proof of authorship. Often left as "Unknown" or the system username of the machine that exported the PDF.
PDF title title
Document title from the PDF Info dictionary. Often defaults to the filename of the source document; an empty or system-default title is normal, while a custom title indicates the author actively set it during export.
PDF subject subject
Document subject from the PDF Info dictionary (a short summary of contents). Rarely populated; its presence indicates the author actively used document properties during export.
PDF keywords keywords
Comma-separated keyword list from the PDF Info dictionary. ExistBefore certificates also store machine-readable structured data here ("existbefore-pdf-meta-vN"); their absence on a real-world PDF is normal.
PDF creation date creationDate
Timestamp the document creator wrote into the PDF Info dictionary. Not signed; trivial to back-date. Useful as a self-declared anchor only — combine with the ExistBefore attestation timestamp for trustworthy temporal evidence.
PDF modification date modificationDate
Timestamp of the last save. When this precedes the creation date (see F03), the metadata is logically impossible and has been forged or reset.
PDF encryption encryption
Indicates the PDF is protected by a password, an owner permission, or a DRM scheme. Informational signal only; encrypted PDFs may still have all the forensic anomalies of unencrypted ones.
PDF fast-web-view (linearized) linearized
Linearized PDFs are reorganized so the first page can be displayed before the entire file is downloaded. Common in web-served PDFs; absence is normal for PDFs produced by office suites.
PDF embedded JavaScript javascript
Whether the PDF contains active JavaScript (forms, animations, dynamic behaviour). A security and authorship signal: static documents rarely need JavaScript, and its presence (see F07) warrants inspection in a sandboxed reader.
PDF embedded files embeddedFiles
Whether the PDF carries other files inside (attachments, source data). Common in technical documents that bundle their source spreadsheet; their presence (see F09) warrants inspection because attachments can carry executables or sensitive data.
PDF form fields formFields
Whether the PDF includes interactive AcroForm or XFA form fields. Indicates a form/template document rather than a static report; cosmetic rather than forensic, but useful when verifying the document type.
PDF annotations annotations
Whether the PDF has annotations (sticky notes, highlights, ink, comments). Indicates the file went through an annotation/review workflow after creation; cosmetic rather than forensic but useful chain-of-custody context.
PDF incremental updates incrementalUpdates
Whether the PDF carries one or more incremental updates (revisions appended to the original bytes). Their presence (see F08) means the file was modified after initial creation and the previous revision is recoverable from the byte stream.

Media (audio & video)

Media duration duration
Total playback duration of the media file (audio or video) in seconds. Useful as a baseline integrity check: a duration that disagrees with the expected content length may indicate a truncated or extended file.
Media codec codec
Compression scheme used by the media file (e.g. H.264, H.265, AAC, Opus). Useful when verifying the toolchain that produced the file; an unusual codec for the declared device is consistent with a re-encoding step.
Video codec videoCodec
Video stream compression scheme (e.g. H.264 / AVC, H.265 / HEVC, AV1, VP9). Independent of the audio codec; abrupt changes between segments of a single source file are consistent with a re-encoding chain.
Audio codec audioCodec
Audio stream compression scheme (e.g. AAC, Opus, MP3, FLAC). Independent of the video codec; useful when validating that an audio-video file came from the expected toolchain.
Video framerate framerate
Frames per second of the video stream (e.g. 24, 25, 29.97, 60). Capture parameter recorded at encoding time; cross-references with the declared device to validate that the framerate is one the device natively supports.
Media bitrate bitrate
Average bitrate of the media stream in bits per second. Useful as a quality/compression indicator; a bitrate that disagrees with the declared codec preset can indicate a re-encoding step that changed the quality envelope.
Video rotation rotation
Display rotation flag for the video stream (0°, 90°, 180°, 270°). Written by the device based on its orientation at capture. A value that disagrees with the visible orientation can indicate a re-encode that rotated pixels without updating the flag.
Audio artist artist
Artist tag from the audio file metadata (ID3, Vorbis comment, MP4 atom). Self-declared; useful as a chain-of-custody hint but never as proof of authorship.
Audio album album
Album tag from the audio file metadata. Self-declared; combined with year and track number it characterizes the source publication of the audio file.
Audio year year
Release year tag from the audio file metadata. Self-declared and trivially editable; mismatch with the modification timestamp can indicate the metadata was set after the fact.
Copyright string from the audio file metadata. Self-declared; useful in chain-of-custody analysis when validating the claimed publisher.
Audio comment comment
Free-form comment field from the audio file metadata. Often used by encoders to declare themselves (e.g. "Lavf58.76.100"); useful as a toolchain fingerprint.
Audio sample rate sampleRate
Sample rate of the audio stream in Hertz (e.g. 44100, 48000). Capture parameter; an unusual sample rate for the declared format is consistent with a re-sampling step in the encoding chain.
Audio channel count channels
Number of audio channels (1 = mono, 2 = stereo, 6 = 5.1 surround). Capture parameter; useful when validating that the channel layout matches the expected source recording configuration.
Audio genre genre
Genre tag from the audio file metadata. Self-declared and cosmetic; useful as a chain-of-custody hint but never forensically authoritative.
Audio track number trackNumber
Track number within the album, from the audio file metadata. Self-declared; combined with album and year it characterizes the source publication.
Audio encoder encoder
Software that encoded the audio file (e.g. "LAME 3.100", "libfdk_aac"). Toolchain fingerprint; useful when validating that the encoder matches the expected production workflow.

Office documents

Office revision number revision
Number of times the document has been saved. Revision 1 with thousands of words (see F12) is unusual for authored content and consistent with a paste from another source.
Office total editing time totalTime
Cumulative minutes the document has been open in editing mode. Zero minutes with substantial content (see F11) is consistent with paste-from-external or plagiarized content.
Office word count words
Word count tracked by the office suite. Cross-references with totalTime and revision number to assess whether the editing footprint matches the apparent work product.
Office paragraph count paragraphs
Number of paragraphs tracked by the office suite. Cross-references with word count, total editing time, and revision number to assess whether the editing footprint matches the apparent work product.
Office last-modified-by lastModifiedBy
Username of the account that last saved the document, recorded by the office suite. Privacy-sensitive identity signal; useful in chain-of-custody analysis to determine whether the editor matches the declared author.
Office company company
Organization name configured on the editing machine, embedded by the office suite at save time. A chain-of-custody hint; the value can leak the editor's affiliation even when the visible content is anonymized.
Office manager manager
Manager name configured on the editing machine, embedded by some office suites in document properties. Rarely populated; a chain-of-custody hint when present.
Office creation date created
Timestamp the office suite recorded when the document was first created. Self-declared; trivial to back-date by changing the system clock. Combine with the ExistBefore attestation timestamp for trustworthy temporal evidence.
Office modification date modified
Timestamp of the last save in the office suite. When this precedes the creation date (see F03), the metadata is logically impossible and has been forged or reset.
Office application name application
Name and version of the office suite that wrote the file (e.g. "Microsoft Office Word", "LibreOffice/7.6"). Toolchain fingerprint; useful when verifying that the file is consistent with the declared editing workflow.
Office template template
Path to the template the document was based on (e.g. "Normal.dotm"). Often left as the default; a custom template path can leak the editor's filesystem layout and affiliation.
Office content status contentStatus
Document status declared by the author (e.g. "Draft", "Final", "In review"). Self-declared; useful as a workflow hint but never forensically authoritative.

C2PA Content Credentials

C2PA Content Credentials validation c2paValidation
Result of validating the embedded C2PA Content Credentials manifest. "Valid" means the cryptographic chain checks out for this file. Anything else (see F06) is a stronger negative signal than no manifest at all.
C2PA claim generator c2paClaimGenerator
Software that produced the C2PA Content Credentials manifest (e.g. "Adobe Photoshop 25.1", "Leica M11-P firmware 2.0"). Toolchain fingerprint authenticated by the manifest signature when validation succeeds.
C2PA title c2paTitle
Title declared inside the C2PA Content Credentials manifest (distinct from the file's other title fields). Authenticated by the manifest signature when validation succeeds.
C2PA author c2paAuthor
Author/creator declared inside the C2PA Content Credentials manifest. Authenticated by the manifest signature when validation succeeds; one of the strongest authorship signals available for digital media.
C2PA copyright notice c2paCopyright
Copyright string declared inside the C2PA Content Credentials manifest. Authenticated by the manifest signature when validation succeeds; independent of EXIF / IPTC / XMP copyright fields.
C2PA actions log c2paActions
List of editing actions recorded inside the C2PA Content Credentials manifest (e.g. "c2pa.created", "c2pa.color_adjusted"). Provenance signal authenticated by the manifest signature when validation succeeds.
C2PA signer issuer c2paSignerIssuer
Issuer of the X.509 certificate that signed the C2PA Content Credentials manifest. Identifies the trust anchor for the C2PA chain; useful when verifying the file's signer is in the expected trust list.
C2PA signature time c2paSignatureTime
Timestamp at which the C2PA Content Credentials manifest was signed. Authenticated by the manifest signature when validation succeeds; provides cryptographically-anchored temporal evidence independent of EXIF/XMP/file system dates.

Universal forensic

File header (first 64 bytes) rawHeader
First 64 bytes of the file, hex-encoded. Used to verify that the bytes match the magic-bytes signature for the claimed MIME type (see F19). Universal across file types — extracted at the dispatcher boundary regardless of MIME.

How findings cite glossary entries

The Forensic Integrity Findings rules (F01–F21) reference glossary entries by name. For example, F21 ("device byte order disagreement") cites make and exifByteOrder. Use these anchors when sharing analysis with collaborators or in legal correspondence.

Italian version: glossario campi forensi. See also: how it works, technical overview, FAQ.