![]() ![]() Now, let's see how to set up the explicit task for capturing the data we want. So to make the extraction process more efficient, we should gather the URLs of these feature topics. In this tutorial, I will take as an example to show you how to save a list of given URLs and then extract detailed data from those web pages.Īssuming that we would like to capture tutorials from several features as below, saying AJAX, Export, Pagination. URL Loop List proves very useful when we prefer to run the crawler with a given list of URLs, either for a more fast and efficient data scraping process or we just happen to need data from some specific web pages. URL Loop List can be applied into web pages with similar content layout, since these web pages can share the same extraction process, like data fields, data format and etc. Then, we can extract data from the web page. Each time Octoparse clicks an URL in the loop list, we will visit a new web page by fetching the URL. URL loop list refers to crawling web pages by a given list of URLs. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |