Afficher un message
Vieux 06/01/2008, 02h10   #2
s.ross
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: html parser with regex, how to solve?

Don't use the regex. Let hpricot do what it's good at:

$ irb
>> require 'rubygems'
>> require 'hpricot'
>> html = '<h3 class="zmp">149.00</h3>'
>> doc = Hpricot.parse(html)
>> ele = doc.search('h3.zmp')
>> puts ele.text

=> 149.00

In your code, your @content will be searchable the same way. Hpricot
will give you a collection of all h3's with class 'zmp'.

http://code.whytheluckystiff.net/doc/hpricot/

Hope this s.


On Jan 5, 2008, at 4:07 PM, Luiz Vitor Martinez Cardoso wrote:

> Yeah,
>
> I`m trying to develop a simple application using ruby (when this
> works i
> will pass to rails). I need get the source code from a URL, and find
> for
> this string:
>
> <h3 class="zmp">$299.99</h3>
>
> wow, but i need search for not only 149.00, but for all possible
> numbers, my
> friend suggest this:
>
> <h3 class="zmp">*$\d+\.\d{2}.*</h3>
>
> i think this works! but i need other thing... look my code:
>
> #!/usr/bin/ruby
>
> require 'hpricot'
> require 'open-uri'
>
> @content = Hpricot(open("
> http://www.newegg.com/Product/Product.aspx?Item=N82E16855101066"))
>
> now how i can find for <h3 class="zmp">*$\d+\.\d{2}.*</h3> ?
>
> @content.search("<h3 class="zmp">*$\d+\.\d{2}.*</h3>") is broken ;(
>
> how i can solved this?
>
>
> thanks for you attention,
> Luiz Vitor Martinez Cardoso.
>
>
>
> --
> Regards,
> Luiz Vitor Martinez Cardoso [Grabber].
> (11) 8187-8662
>
> rubz.org - engineer student at maua.br



  Réponse avec citation
 
Page generated in 0,05376 seconds with 9 queries