PHWinfo banniere

Titres
PORTAIL ANNUAIRE ARTICLES COMPARATEUR HÉBERGEURS DEVIS FORUMS RÉDUCTEUR D'URL
Précédent   PHWinfo > Forums Hébergement > Forum Serveur - Sécurité et techniques > comp.unix.shell > Re: Text database?
S'inscrire FAQ Membres Recherche Messages du jour Marquer les forums comme lus
comp.unix.shell Using and programming the Unix shell.

Re: Text database?

Réponse
 
LinkBack Outils de la discussion
Vieux 18/03/2008, 16h48   #1
Edward Rosten
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: Text database?

On Mar 17, 1:54 am, saneman <asdf...@asd.com> wrote:
> I am trying to implement a text recognition module. But I need some
> character to train the algorithms with. Does anyone know of a free
> online database that contains characters?


You'd probably be better off asking on sci.image.processing, where you
were posting in the first place. That said, this is a reasonable place
for the following point:

You are presumable after a database of images of characters. You could
synthesize one by rasterizing a number of fonts (automatically) and
then adding various kinds of noise or various distortions.

I have a program for generating rasterizaton from here:

http://linuxfromscratch.org/pipermai...ry/004748.html

look at links-2.1pre32-italic.patch.gz

You can run this patch on an empty directory, to extract the relavent
files.

To add distortions, you may wish to experiment with pnmscale,
pnmrotate, pgmnoise and pnmshear to add distortions. To be honest,
comp.unix.shell is also a good place for this kind of commandline
stuff, so I've cross posted there as well. Maybe some imagemagick
expert can weigh in on adding errors automatically.

-Ed
--
(You can't go wrong with psycho-rats.)(http://mi.eng.cam.ac.uk/~er258)

/d{def}def/f{/Times s selectfont}d/s{11}d/r{roll}d f 2/m{moveto}d -1
r 230 350 m 0 1 179{ 1 index show 88 rotate 4 mul 0 rmoveto}for/s 12
d f pop 235 420 translate 0 0 moveto 1 2 scale show showpage

  Réponse avec citation
Vieux 20/03/2008, 21h20   #2
saneman
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: Text database?

Edward Rosten wrote:
> On Mar 17, 1:54 am, saneman <asdf...@asd.com> wrote:
>> I am trying to implement a text recognition module. But I need some
>> character to train the algorithms with. Does anyone know of a free
>> online database that contains characters?

>
> You'd probably be better off asking on sci.image.processing, where you
> were posting in the first place. That said, this is a reasonable place
> for the following point:
>
> You are presumable after a database of images of characters. You could
> synthesize one by rasterizing a number of fonts (automatically) and
> then adding various kinds of noise or various distortions.
>
> I have a program for generating rasterizaton from here:
>
> http://linuxfromscratch.org/pipermai...ry/004748.html
>
> look at links-2.1pre32-italic.patch.gz
>
> You can run this patch on an empty directory, to extract the relavent
> files.
>
> To add distortions, you may wish to experiment with pnmscale,
> pnmrotate, pgmnoise and pnmshear to add distortions. To be honest,
> comp.unix.shell is also a good place for this kind of commandline
> stuff, so I've cross posted there as well. Maybe some imagemagick
> expert can weigh in on adding errors automatically.
>
> -Ed
> --
> (You can't go wrong with psycho-rats.)(http://mi.eng.cam.ac.uk/~er258)
>
> /d{def}def/f{/Times s selectfont}d/s{11}d/r{roll}d f 2/m{moveto}d -1
> r 230 350 m 0 1 179{ 1 index show 88 rotate 4 mul 0 rmoveto}for/s 12
> d f pop 235 420 translate 0 0 moveto 1 2 scale show showpage
>


This here was just what I needed:

http://yann.lecun.com/exdb/mnist/

which is also used on the below pages:

http://www.bcl.hamilton.ie/~barak/te...hw1/index.html
http://www.iro.umontreal.ca/~lisa/tw...nistVariations
http://www.int.tu-darmstadt.de/mlu/index.html
  Réponse avec citation
Réponse


Outils de la discussion

Règles de messages
Vous ne pouvez pas créer de nouvelles discussions
Vous ne pouvez pas envoyer des réponses
Vous ne pouvez pas envoyer des pièces jointes
Vous ne pouvez pas modifier vos messages

Les balises BB sont activées : oui
Les smileys sont activés : oui
La balise [IMG] est activée : oui
Le code HTML peut être employé : non
Trackbacks are oui
Pingbacks are oui
Refbacks are oui


Fuseau horaire GMT +1. Il est actuellement 02h40.


Édité par : vBulletin® version 3.7.3
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.2.0 RC5 Tous droits réservés.
Version française #16 par l'association vBulletin francophone
PHWinfo est un site Éducation Sans Frontières
Ad Management by RedTyger
©Tous droits réservés par les parties respectives
Page generated in 0,08986 seconds with 10 queries