Discussion: [ANN] Metadata 0.3
Afficher un message
Vieux 15/09/2007, 09h33   #2
Konrad Meyer
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: [ANN] Metadata 0.3

--nextPart6653061.QBXH3ZSY6e
Content-Type: text/plain;
charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

Quoth Ilmari Heikkinen:
> On 9/15/07, Konrad Meyer <konrad@tylerc.org> wrote:
> > Quoth Ilmari Heikkinen:
> > > On 9/14/07, Konrad Meyer <konrad@tylerc.org> wrote:
> > > > Hmm, am I not seeing it (just using 'mdh -p') or can metadata.rb=20

extract
> > > > stuff like artist, title, album, track, and whatnot from ogg/flac?
> > >
> > > It should at least. If you're having trouble, lemme know
> > >

> > Yeah, I'm having some trouble. I have latest metadata (0.2).
> >
> > [snip]
> >
> > Any ideas?

>=20
> Yeah, I failed at using git. Jeez. Sorry about that.
> Here's 0.3, it oughta work:
>=20
> tarball: http://dark.fhtr.org/repos/metadata/metadata-0.3.tar.gz
> git: http://dark.fhtr.org/repos/metadata
>=20
>=20
> On 9/15/07, darren kirby <bulliver@badcomputer.org> wrote:
> > Hi Ilmari!
> >
> > Just wanted to mention that despite the name, wmainfo will parse anythi=

ng
> > wrapped in an ASF audio/video container format[0], so, you could use it=

to
> > parse wmv movies as well if your user didn't have mplayer installed.
> >
> > [0] http://en.wikipedia.org/wiki/Advanced_Systems_Format
> >

>=20
> Thanks for the pointer!
> I made it merge the wmainfo output to the mplayer output for wmv and asf.
>=20
>=20
> Description
> -----------
>=20
> This package `Metadata' comes with a library called `metadata' and
> a small program called `mdh'.
>=20
> The library probes files for their metadata (e.g. jpeg dimensions
> and camera make, mp3 artist, pdf word count) and returns the metadata
> as a Hash.
>=20
> Mdh can print out file metadata as YAML and package the metadata
> with the file.
>=20
> This package has many dependencies since there is no single universal
> metadata header format that all files use. Blame resource forks, filena=

me
> extensions, bags of bytes and mimetypes.
>=20
>=20
> Usage
> -----
>=20
> # print out metadata header
> mdh -p myfile.jpg
>=20
> # create myfile.jpg.mdh, which consists of metadata header + myfile.jpg
> mdh myfile.jpg
>=20
> # print out metadata header from mdh file
> mdh -e -p myfile.jpg.mdh
>=20
> # strip out metadata header from mdh file and save it to myfile.jpg
> mdh -e myfile.jpg.mdh
>=20
> irb> Metadata.extract('myfile.jpg')
> irb> Metadata.extract_text('myfile.jpg')
> irb> Pathname.new("myfile.jpg").metadata
>=20
>=20
> List of supported formats
> -------------------------
>=20
> Audio:
> Successfully tested with:
> mp3, flac, ogg, wav
> Should also work:
> wma, m4a
>=20
> Video:
> What you manage to make mplayer play, which can be just about anythin=

g.
> Then again, missing title and author data, etc. (do videos even have=

=20
those?)
> Successfully tested with:
> wmv, mov, divx, xvid, flv, ogm, mpg
>=20
> Images:
> Should handle pretty much anything (apart from XCF and ORF.)
> Successfully tested with:
> jpeg, png, gif, nef, dng, crw, pef, psd
>=20
> Documents:
> Successfully tested with:
> pdf, ppt, odp, sxi, ps, ps.gz, html, txt
> Should work:
> - OpenOffice docs work to some degree (personally, I'm using unoconv =

to
> convert OO docs to temp PDFs for the text & dimensions extraction, =

so
> those bits of data are missing.)
> - MS Office docs to some degree (ppt at least, doc and xls should wor=

k=20
too,
> dimensions missing due to the above temp PDF -thing.)
>=20
> Others:
> Whatever extract spits out on the five or six bits of metadata I'm us=

ing
> from it. Archive contents at least.
>=20
> Requirements
> ------------
>=20
> * Ruby 1.8
>=20
> * Tons of metadata extraction programs and libs,
> list of gems:
> flacinfo-rb
> wmainfo-rb
> MP4info
> list of debian packages:
> dcraw
> libimlib2-ruby
> extract
> libimage-exiftool-perl
> poppler-utils
> mplayer
> html2text
> imagemagick
> unhtml
> pstotext
> antiword
> catdoc
> shared-mime-info
> vorbis-tools
>=20
> * You do want to install the latest versions of dcraw and
> shared-mime-info to be able to handle camera raw images.
> http://cybercom.net/~dcoffin/dcraw/
> http://freedesktop.org/wiki/Software/shared-mime-info
>=20
> * Python + chardet library
> http://chardet.feedparser.org/
>=20
> Install
> -------
>=20
> De-compress archive and enter its top directory.
> Then type:
>=20
> ($ su)
> # ruby setup.rb
>=20
> These simple step installs this program under the default
> location of Ruby libraries. You can also install files into
> your favorite directory by supplying setup.rb some options.
> Try "ruby setup.rb --".
>=20
>=20
> License
> -------
>=20
> Ruby's
>=20
>=20
> --
> Ilmari Heikkinen <ilmari.heikkinen gmail com>
> http://fhtr.blogspot.com


Any chance you could wrap this up as a gem? It's not something I care
strongly about, and I don't know how complicated the process is, but I think
it would ease installation for some users.

=2D-=20
Konrad Meyer <konrad@tylerc.org> http://konrad.sobertillnoon.com/

--nextPart6653061.QBXH3ZSY6e
Content-Type: application/pgp-signature; name=signature.asc
Content-Description: This is a digitally signed message part.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)

iD8DBQBG64q7CHB0oCiR2cwRAo7PAKCd+VEEC1laxOSyE9fgwT Vxty08RgCggAQC
Ud/k87bjXEpRm23L2gRyPyA=
=qXTO
-----END PGP SIGNATURE-----

--nextPart6653061.QBXH3ZSY6e--

  Réponse avec citation
 
Page generated in 0,08090 seconds with 9 queries