|
|
|
#1 (permalink) |
|
Messages: n/a
Hébergeur: |
Hi,
I'm getting funky characters, when parsing html using Hpricot. How to remove this funky character? Anyone have a fix / workaround for this? thanks in advance, Suja -- Posted via http://www.ruby-forum.com/. |
|
|
|
#2 (permalink) |
|
Messages: n/a
Hébergeur: |
Hi Suja,
two suggestions: - check the encoding used by the page you're hashpricoting (doh - think I just invented a verb, or what). - puts $KCODE to see if you're running in unicode or not. If you are hashpricoting a page encoded in UTF-8, but KCODE is set to none (or if the page is in latin1, but KCODE is set to U), then you'll have to change the encoding using iconv for instance. cheers Thibaut |
|
|
|
#3 (permalink) |
|
Messages: n/a
Hébergeur: |
Suja JS wrote:
> Hi, > I'm getting funky characters, when parsing html using Hpricot. > How to remove this funky character? > > Anyone have a fix / workaround for this? > > thanks in advance, > Suja Could you describe these 'funky characters'? -- Posted via http://www.ruby-forum.com/. |
|
|
|
#4 (permalink) |
|
Messages: n/a
Hébergeur: |
Lee Jarvis wrote:
> Suja JS wrote: >> Hi, >> I'm getting funky characters, when parsing html using Hpricot. >> How to remove this funky character? >> >> Anyone have a fix / workaround for this? >> >> thanks in advance, >> Suja > > Could you describe these 'funky characters'? Like '�' in this text. "By Mike Monson CHAMPAIGN � Effective today the city of Champaign is closing three bridges and posting load limits on three others." -- Posted via http://www.ruby-forum.com/. |
|
|
|
#5 (permalink) |
|
Messages: n/a
Hébergeur: |
> "By Mike Monson CHAMPAIGN ? Effective today the city of Champaign is
> closing three bridges and posting load limits on three others." hint hint : http://www.news-gazette.com/news/loc...s_limits_loads The minus character you see after CHAMPAIGN is not a regular "-". |
|
![]() |
| Outils de la discussion | |
|
|