2.10.0

Improve WAV parser by focusing on performance rather than on attempting a best-effort when extracting metadata from files that do not strictly follow the format spec.

2.9.0

Improve WAV parser by performing a best-effort when extracting metadata from files that do not strictly follow the format spec.

2.8.0

Add support for Ruby 3.2 and 3.3.

2.7.2

Improved stability for mp4 parser when dealing with corrupted FTYP boxes.

2.7.1

Fixed bug with WAV file wrongly parsed as MP3.

2.7.0

JSON format support.

2.6.0

Prevent the default loading of thumbnails on TIFF-based formats to improve I/O.

2.5.0

Add avc1 and xavc as brand codes in the mp4 format parser to allow more file types to be parsed correctly.

2.4.5

Disable udta ISOBMFF box parsing, since their contents are not guaranteed to be consistent with the spec.

2.4.4

Prevent infinite loops when parsing ISOBMFF boxes with size = 0 (meaning that the box extends to the end of the file).

2.4.3

Improve resiliency in ISOBMFF parsing to missing mandatory boxes and fields.
Simplify ISOBMFF frame rate calculations.
Refactor.

2.4.2 (yanked)

Added support for PDF 2.0
Expanded test coverage for PDF parsing

2.4.1 (yanked)

Revert change where variable frame rates in MOV and MP4 files would result in an array value for frame_rate.

2.4.0 (yanked)

Adapt the ISOBMFF based decoder for parsing MOV and MP4 parsing.
Fix MOV/MP4 issues:
- MP4 files being misidentified as MOV files.
- Dimensions being miscalculated when files include multiple tracks or transformations.

2.3.0

Add support for RW2 files.

2.2.1

Bug fix for CR3 files being misidentified as MOOV.

2.2.0

Add support for CR3 files.
Add ISO base file format decoding functionality.

2.1.0

Require minimum 2.6 ruby version.
Bring back 2.6 to test matrix, we have jruby there which is still compatible with 2.6
Drop ks dependency.

2.0.0

Drop explicit support for Ruby <2.7.
Drop faraday dependencies.
Loosen version constraints on other dependencies.
Update measurometer metrics for consistency and clarity.

1.7.0

Add support for ARW files.

1.6.0

Add support for AAC files.

1.5.0

Add support for NEF files.

1.4.2

Fix MP3Parser taking precedence when parsing WEBP files.

1.4.1

Skip Exif chunks that are malformed during WEBP parsing.

1.4.0

Add support for WEBP lossy, lossless and extended file formats.

1.3.0

Add heif_parser and support for HEIF and HEIC formats. Exif parsing is still missing.

1.2.1

Resolve bug when stts atom is nil

1.2.0

Add support for codecs in moov_parser for video metadata

1.1.0

Add support for frame_rate in moov_parser

1.0.0

Dropping support for Ruby 2.2.X, 2.3.X and 2.4.X
MP3: Fix negative length reads in edge cases by bumping id3tag version to v0.14.2

0.29.1

Fix handling of 200 responses with parse_http as well as handling of very small responses which do not need range access

0.29.0

Add option headers: to FormatParser.parse_http

0.28.0

Change FormatParser.parse_http to follow HTTP redirects

0.27.0

Add #content_type on Result return values which makes sense for the detected filetype

0.26.0

Add support for M3U format files

0.25.6

Fix FormatParser.parse (with results: :first) to be deterministic

0.25.5

DPX: Fix DPXParser to support images without aspect ratio

0.25.4

MP3: Fix MP3Parser to return nil for TIFF files
Add support to ruby 2.7

0.25.3

MP3: Fix parser to not skip the first bytes if it's not an ID3 header

0.25.2

Hotfix Moov parser

0.25.1

MOV: Fix error "negative length"
MOV: Fix reading dimensions in multi-track files
MP3: Fix parse of the Xing header to not raise errors

0.25.0

MP3: add suport to id3 v2.4.x
JPEG: Update gem exifr to 1.3.8 to fix a bug

0.24.2

Update gem id3tag to 0.14.0 to fix MP3 issues

0.24.1

Fix MP3 frames reading to jump correctly to the next bytes

0.24.0

The TIFF parser will now return :arw as format for Sony ARW files insted of :tif so that the caller can decide whether it wants to deal with RAW processing or not

0.23.1

Updated gem exifr to fix problems related to jpeg files from Olympos microscopes, which often have bad thumbnail data

0.23.0

Add ActiveStorage analyzer which can analyze ActiveStorage blobs. Enable it by setting config.active_storage.analyzers.prepend FormatParser::ActiveStorage::BlobAnalyzer
Ignore empty ID3 tags and do not allow them to overwrite others
Update the id3tag dependency so that we can fallback to UTF8 instead of raising an error when parsing MP3 files

0.22.1

Fix Zip parser to not raise error for invalid zip files, with an invalid central directory

0.22.0

Adds option stringify_keys: true to #as_json methods (fix #151)

0.21.1

MPEG: Ensure parsing does not inadvertently return an Integer instead of Result|nil
MPEG: Scan further into the MPEG file than previously (scan 32 1KB chunks)
MPEG: Ensure the parser does not raise an exception when there is no data to read for scanning beyound the initial header

0.21.0

Adds support for MPEG video files

0.20.1

Make sure EXIF results work correctly with ActiveSupport JSON encoders

0.20.0

Correctly tag the license on Rubygems as MIT (Hippocratic) for easier audit

0.19.0

Improve handling of Sony ARW files (make sure the width/height is correctly recognized)
Update Travis matrix and gitignore

0.18.0

Mark m4v as one of the filename extensions likely to parse via the MOOV parser
Adopt Hippocratic license v. 1.2 Note that this might make the license conditions unacceptable for your project. If that is the case, you can use the 0.17.X branch of the library which stays under the original, exact MIT license.

0.17.0

Remove parser factories. A parser should respond to likely_match? and call. If a parser has to be instantiated anew for every call the parser should take care of instantiating itself.
Add support for BMP files with core headers (older version of the BMP format)

0.16.2

All EXIF: Deal with EXIF orientations that get parsed as an array of [Orientation, nil] due to incorrect padding

0.16.1

All EXIF: Make sure the 0 orientation does not get silently treated as orientation 8, mislabling images which are not rotated as being rotated (orientation changed)
All EXIF: Make sure the 0 orientation (unknown) is correctly passed and represented
JPEG: Make sure multiple EXIF tags in APP1 markers get handled correctly (via overlays)

0.16.0

Add filename_hint keyword argument to FormatParser.parse. This can hint the library to apply the parser that will likely match for this filename first, and the other parsers later. This helps avoiding extra work when parsing less-popular file formats, and can be optionally used if the caller knows the filename of the original file. Note that the filename is only that: a hint, it helps apply parsers more efficiently but does not specify the actual format of the file that is going to be detected.

0.15.1

Relax the "ks" dependency version since we do not need the constraint to be so strict

0.15.0

Allow setting :priority when registering a parser, to make sure certain parsers are applied earlier - depending on detection confidence and file format popularity at WT.

0.14.1

Care caching: Clear pages more deliberately instead of relegating them to GC
JPEG: Clear the EXIF buffer explicitly

0.14.0

PDF: Reduce the PDF parser to the basic binary detection (PDF/not PDF) until we have a better/more robust PDF parser
MP3: Fix the byte length of MPEG frames calculation to correctly account for ID3V1 and ID3V2 instead of ID3V1 twice
MP3: Remove the workaround for id3tag choking on non-matching genre strings (bumps dependency on id3tag)
Use Measurometer provided by the measurometer gem
Ogg: Add support for the Ogg format

0.13.6

Make all reads in the MOOV decoder strict - fail early if reads are improperly sized
Disable parsing for udta atoms in MP4/MOV since we do not have a good way of parsing them yet

0.13.5

Use the same TIFF parsing flow for CR2 files as it seems we are not very reliable yet. The CR2 parser will need some work.

0.13.4

Make sure JSON data never contains NaN, fix the test that was supposed to verify that but didn't
Forcibly UTF-8 sanitize all EXIF data when building JSON

0.13.3

Add a fixture to make sure all parsers can cope with an empty file when using parse_http
Terminate the ZIP parser early with empty input
Terminate the MP3 parser early with empty or too small input

0.13.2

Handle BMP files with pixel array offsets larger than 54

0.13.1

Avoid ZIP checks in the JPEG parser which are no longer necessary

0.13.0

Replace the homegrown ID3 parser with id3tag - this introduces id3tag as a dependency in addition to exifr, but the gains are substantial.

0.12.4

Ensure JPEG recognition only runs when the JPEG SOI marker is detected at the start of file. Previously the JPEG parser would scan for the marker, sometimes finding it (appropriately) in places like... MP3 album artwork inside ID3 tags. Or Keynote documents. Or whatnot - lots of things have JPEG thumbnails embedded.

0.12.3

Make sure all strings going to the JSON representations of parse results are encoded as UTF-8 or escaped

0.12.2

Make sure the VERSION constant is available in the loaded gem. Previously the constant would be made available by Bundler when developing the library - since it loads the .gemspec which, in turn, requires the version.rb file, but when used as a gem the version.rb file would not end up being loaded.

0.12.1

Reinstate support for Ruby 2.2.0
Fix support for JRuby 9.0

0.12.0

Relay upstream status from RemoteIO in the status_code attribute (returns an Integer)

0.11.0

Add Image#display_width_px and Image#display_height_px for EXIF/aspect corrected display dimensions, and provide those values from a few parsers already. Also make full EXIF data available for JPEG/TIFF in intrinsics[:exif]
Adds limits_config option to FormatParser.parse() for tweaking buffers and read limits externally

0.10.0

Adds the format_parser_inspect binary for parsing a file from the commandline and returning results in JSON
Adds the FormatParser.parse_at(path) convenience method

0.9.4

Fix a TIFF parsing regression introduced in 0.3.1 that led to all TIFFs being incorrectly parsed

0.9.3

Fix a JPEG parsing regression introduced in 0.9.1

0.9.2 (yanked)

Make sure MP3 parser returns nil when encountering infinite duration
Do not read JPEG APP1 markers that contain no EXIF data
Explicitly replace Float::INFINITY values in AttributesJSON with nil as per JSON convention
Make sure the cached pages in Care are explicitly deleted after each parse call (should help GC)
Raise the pagefaults restriction to 16 to cope with "too many useless markers in JPEGs" scenario once more

0.9.1 (yanked)

Perf: Make JPEG parser bail out earlier if no marker is found while scanning through 1024 bytes of data

0.9.0

Add a parser for the BMP image file format

0.8.0

Add Measurometer for applying instrumentation around FormatParser operaions. See documentation for usage.

0.7.0

Configure read limits / pagefault limits centrally so that those limits make sense together

0.6.0

Double the cache page size once more
We no longer need exifr/jpeg
Fix EXIF parsing in JPEG files
Reject Keynote documents in JPEG parser

0.5.2

Do not raise EXIFR errors for keynote files
Correct broken comment for the audio nature

0.5.1

Raise the cache page size during detection
Fix ZIP entry filename parsing

0.5.0

Add FLAC parser
Add parse_atom_children_and_data_fields support
Add basic detection of Office files
Optimize EOCD signature lookup

0.4.0

Adds a basic PDF parser
Make sure root: and to_json without arguments work
ZIP file format support

0.3.5

Fix the bug with EXIF dimensions being used instead of pixel dimensions

0.3.4

Pagefault limit
Add seek modes required by exifr

0.3.3

Implement a sane to_json as well

0.3.2

Add default as_json
Test on 2.5.0

0.3.1

Remove post install warning
Moved aiff_parser_spec.rb to spec/parsers
CR2 file support
Add require 'set' to format_parser.rb
Use register_parser for natures/fmts

0.3.0

Reverse API changes to support :first as default and add opts to parse_http
Implement and comply with rubocop
JPEG parser and Care fixes
Add format and count options to parse_http
Return first result as default
Use hashes for MOOV atom default fields

0.2.0

Implement parser DSL

0.1.7

Fix read(0) on Care::IOWrapper, introduce top-level tests

0.1.6

Fix mp3 parsing bug
Add MOOV parser

0.1.5

Add FDX parser
Remove dry-structs
New interface updates

0.1.4

Add WAV parser

0.1.3

Add MP3 parser
Add FileInformation#intrinsics
Disallow negative Care offsets

0.1.2

Introduce a restrictive IO subset wrapper
Switch rewind for seek in exif parser
Prep for OSS release
Add fuzz spec
Improve orientation parsing
Optimisation for PNG and invalid input protection on JPEG

0.1.1

Add AIFF parser

0.1.0

Add parsers for PNG, JPG, TIFF, PSD
Add GIF parser
Add DPX parser

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

2.10.0

2.9.0

2.8.0

2.7.2

2.7.1

2.7.0

2.6.0

2.5.0

2.4.5

2.4.4

2.4.3

2.4.2 (yanked)

2.4.1 (yanked)

2.4.0 (yanked)

2.3.0

2.2.1

2.2.0

2.1.0

2.0.0

1.7.0

1.6.0

1.5.0

1.4.2

1.4.1

1.4.0

1.3.0

1.2.1

1.2.0

1.1.0

1.0.0

0.29.1

0.29.0

0.28.0

0.27.0

0.26.0

0.25.6

0.25.5

0.25.4

0.25.3

0.25.2

0.25.1

0.25.0

0.24.2

0.24.1

0.24.0

0.23.1

0.23.0

0.22.1

0.22.0

0.21.1

0.21.0

0.20.1

0.20.0

0.19.0

0.18.0

0.17.0

0.16.2

0.16.1

0.16.0

0.15.1

0.15.0

0.14.1

0.14.0

0.13.6

0.13.5

0.13.4

0.13.3

0.13.2

0.13.1

0.13.0

0.12.4

0.12.3

0.12.2

0.12.1