--nextPart54087508.ELBvalPpTO
Content-Type: text/plain;
charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline
Quoth Ilmari Heikkinen:
> On 9/13/07, Konrad Meyer <konrad@tylerc.org> wrote:
> >
> > Any chance this could be expanded to add FLAC and OGG support?
> >
> > Thanks!
> > --
> > Konrad Meyer <konrad@tylerc.org> http://konrad.sobertillnoon.com/
> >
>=20
> Done! Along with a preliminary list of supported formats and some cleanup.
> Thanks for the request!
>=20
>=20
> tarball: http://dark.fhtr.org/repos/metadata/metadata-0.2.tar.gz
> git: http://dark.fhtr.org/repos/metadata/
>=20
>=20
> Changelog:
>=20
> README:
> * added flacinfo, wmainfo, mp4info and ogginfo to the list of
> dependencies
> * prelim list of supported formats
> lib/metadata/extract.rb:
> * .ps.gz support
> * list archive contents
> * remove null fields from output
> * support for flac and ogg
> * untested support for wma and m4a
> * Audio.Bitrate now in kbps to match shared-filemetadata-spec
>=20
>=20
> Description
> -----------
>=20
> This package `Metadata' comes with a library called `metadata' and
> a small program called `mdh'.
>=20
> The library probes files for their metadata (e.g. jpeg dimensions
> and camera make, mp3 artist, pdf word count) and returns the metadata
> as a Hash.
>=20
> Mdh can print out file metadata as YAML and package the metadata
> with the file.
>=20
> This package has many dependencies since there is no single universal
> metadata header format that all files use. Blame resource forks, filena=
me
> extensions, bags of bytes and mimetypes.
>=20
>=20
> Usage
> -----
>=20
> # print out metadata header
> mdh -p myfile.jpg
>=20
> # create myfile.jpg.mdh, which consists of metadata header + myfile.jpg
> mdh myfile.jpg
>=20
> # print out metadata header from mdh file
> mdh -e -p myfile.jpg.mdh
>=20
> # strip out metadata header from mdh file and save it to myfile.jpg
> mdh -e myfile.jpg.mdh
>=20
> irb> Metadata.extract('myfile.jpg')
> irb> Metadata.extract_text('myfile.jpg')
> irb> Pathname.new("myfile.jpg").metadata
>=20
>=20
> List of supported formats
> -------------------------
>=20
> Audio:
> Successfully tested with:
> mp3, flac, ogg, wav
> Should also work:
> wma, m4a
>=20
> Video:
> What you manage to make mplayer play, which can be just about anythin=
g.
> Then again, missing title and author data, etc. (do videos even have=
=20
those?)
> Successfully tested with:
> wmv, mov, divx, xvid, flv, ogm, mpg
>=20
> Images:
> Should handle pretty much anything (apart from XCF and ORF.)
> Successfully tested with:
> jpeg, png, gif, nef, dng, crw, pef, psd
>=20
> Documents:
> Successfully tested with:
> pdf, ppt, odp, sxi, ps, ps.gz, html, txt
> Should work:
> - OpenOffice docs work to some degree (personally, I'm using unoconv =
to
> convert OO docs to temp PDFs for the text & dimensions extraction, =
so
> those bits of data are missing.)
> - MS Office docs to some degree (ppt at least, doc and xls should wor=
k=20
too,
> dimensions missing due to the above temp PDF -thing.)
>=20
> Others:
> Whatever extract spits out on the five or six bits of metadata I'm us=
ing
> from it. Archive contents at least.
>=20
> Requirements
> ------------
>=20
> * Ruby 1.8
>=20
> * Tons of metadata extraction programs and libs,
> list of gems:
> flacinfo-rb
> wmainfo-rb
> MP4info
> list of debian packages:
> dcraw
> libimlib2-ruby
> extract
> libimage-exiftool-perl
> poppler-utils
> mplayer
> html2text
> imagemagick
> unhtml
> pstotext
> antiword
> catdoc
> shared-mime-info
> vorbis-tools
>=20
> * You do want to install the latest versions of dcraw and
> shared-mime-info to be able to handle camera raw images.
> http://cybercom.net/~dcoffin/dcraw/
> http://freedesktop.org/wiki/Software/shared-mime-info
>=20
> * Python + chardet library
> http://chardet.feedparser.org/
>=20
> Install
> -------
>=20
> De-compress archive and enter its top directory.
> Then type:
>=20
> ($ su)
> # ruby setup.rb
>=20
> These simple step installs this program under the default
> location of Ruby libraries. You can also install files into
> your favorite directory by supplying setup.rb some options.
> Try "ruby setup.rb --".
>=20
>=20
> License
> -------
>=20
> Ruby's
>=20
>=20
> Ilmari Heikkinen <ilmari.heikkinen gmail com>
> http://fhtr.blogspot.com
Hmm, am I not seeing it (just using 'mdh -p') or can metadata.rb extract
stuff like artist, title, album, track, and whatnot from ogg/flac?
=2D-=20
Konrad Meyer <konrad@tylerc.org>
http://konrad.sobertillnoon.com/
--nextPart54087508.ELBvalPpTO
Content-Type: application/pgp-signature; name=signature.asc
Content-Description: This is a digitally signed message part.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
iD8DBQBG6tYICHB0oCiR2cwRAvfQAKDH0paIyE+d2pV5rw+Xgt BcjKW5vQCggiiV
yV8vD3aAT+PlYIdnC7NzyE0=
=DxwV
-----END PGP SIGNATURE-----
--nextPart54087508.ELBvalPpTO--