|
|
|
|
||||||
| comp.unix.shell Using and programming the Unix shell. |
![]() |
|
|
LinkBack | Outils de la discussion |
|
|
#1 |
|
Messages: n/a
Hébergeur: |
On Mar 17, 1:54 am, saneman <asdf...@asd.com> wrote:
> I am trying to implement a text recognition module. But I need some > character to train the algorithms with. Does anyone know of a free > online database that contains characters? You'd probably be better off asking on sci.image.processing, where you were posting in the first place. That said, this is a reasonable place for the following point: You are presumable after a database of images of characters. You could synthesize one by rasterizing a number of fonts (automatically) and then adding various kinds of noise or various distortions. I have a program for generating rasterizaton from here: http://linuxfromscratch.org/pipermai...ry/004748.html look at links-2.1pre32-italic.patch.gz You can run this patch on an empty directory, to extract the relavent files. To add distortions, you may wish to experiment with pnmscale, pnmrotate, pgmnoise and pnmshear to add distortions. To be honest, comp.unix.shell is also a good place for this kind of commandline stuff, so I've cross posted there as well. Maybe some imagemagick expert can weigh in on adding errors automatically. -Ed -- (You can't go wrong with psycho-rats.)(http://mi.eng.cam.ac.uk/~er258) /d{def}def/f{/Times s selectfont}d/s{11}d/r{roll}d f 2/m{moveto}d -1 r 230 350 m 0 1 179{ 1 index show 88 rotate 4 mul 0 rmoveto}for/s 12 d f pop 235 420 translate 0 0 moveto 1 2 scale show showpage |
|
|
|
#2 |
|
Messages: n/a
Hébergeur: |
Edward Rosten wrote:
> On Mar 17, 1:54 am, saneman <asdf...@asd.com> wrote: >> I am trying to implement a text recognition module. But I need some >> character to train the algorithms with. Does anyone know of a free >> online database that contains characters? > > You'd probably be better off asking on sci.image.processing, where you > were posting in the first place. That said, this is a reasonable place > for the following point: > > You are presumable after a database of images of characters. You could > synthesize one by rasterizing a number of fonts (automatically) and > then adding various kinds of noise or various distortions. > > I have a program for generating rasterizaton from here: > > http://linuxfromscratch.org/pipermai...ry/004748.html > > look at links-2.1pre32-italic.patch.gz > > You can run this patch on an empty directory, to extract the relavent > files. > > To add distortions, you may wish to experiment with pnmscale, > pnmrotate, pgmnoise and pnmshear to add distortions. To be honest, > comp.unix.shell is also a good place for this kind of commandline > stuff, so I've cross posted there as well. Maybe some imagemagick > expert can weigh in on adding errors automatically. > > -Ed > -- > (You can't go wrong with psycho-rats.)(http://mi.eng.cam.ac.uk/~er258) > > /d{def}def/f{/Times s selectfont}d/s{11}d/r{roll}d f 2/m{moveto}d -1 > r 230 350 m 0 1 179{ 1 index show 88 rotate 4 mul 0 rmoveto}for/s 12 > d f pop 235 420 translate 0 0 moveto 1 2 scale show showpage > This here was just what I needed: http://yann.lecun.com/exdb/mnist/ which is also used on the below pages: http://www.bcl.hamilton.ie/~barak/te...hw1/index.html http://www.iro.umontreal.ca/~lisa/tw...nistVariations http://www.int.tu-darmstadt.de/mlu/index.html |
|
![]() |
| Outils de la discussion | |
|
|