nested scraping

Subscribe to nested scraping 1 post, 1 voice

 
Avatar internets 2 posts

I have this data I can scrape by class name. I have it nested within div’s by state;

I want to have a list something like: [code]

<state_list> for each <state> Alaska for each store within the state: <store> <store_name></storename> <store_address></storeaddress> <store_phone></storephone> </store> </state> <state> next state. <store_name> etc</storename> </state </state_list>

[/code]

here is my code to scrape, I can scrape it all in a list, I’m just not clear on where to stick the do end’s, I think. [code] fetch ‘path_to_file.html’ [/code]

state "/html/body/div[@class='coop_list_state']/p[@class='state']/a[1]" 
   coop "/html/body/div[@class='coop_list_state']/p[@class='coop_name']" 
     address "/html/body/div[@class='coop_list_state']/p[@class='street_address']" 
     city_and_state "/html/body/div[@class='coop_list_state']/p[@class='city_state']" 
     phone "/html/body/div[@class='coop_list_state']/p[@class='phone']" 
     fax "/html/body/div[@class='coop_list_state']/p[@class='coop_fax']" 
     email "/html/body/div[@class='coop_list_state']/p[@class='email']" 
     website "/html/body/div[@class='coop_list_state']/p[@class='coop_website']"