01. Script should use scrapy so that it could be fast...
02. I will use this on my Ubuntu PC.
03. Script MUst use some logic so that if script is interrupted then it can start from the point where it was interrupted.
04. Script should save content in .html format having only one <p> </p> and all content should go inside the p tag.
05. Script should Save each and every scraped links.
06. On rerun script should skip those files which are already scraped.
07. Once script starts it should start scraping from first link.
08. once script finds 10 continious scraped links script would jump to the last link that is already scraped, and it would
start scraping from next link.
[login to view URL] should create folder named 1 and inside two folders should be created named Question and Answer.
10. In both folders same named html files should be there like [login to view URL] should be in both folders having same content should be there.
[login to view URL] there is any readable attachments then script should also save the content into the same html file.
12. html files name would be [login to view URL], [login to view URL], [login to view URL] etc...
13. if the folder named 1 is already there where question and answers included, then a new folder should be created named 2 and
files should be saved there.
link to target is
[login to view URL]
We have extensive experience in Python coding and are able to accomplish this task for you quickly. We are relatively new to this site but be rest assured that we deliver quality and timely work!
Hi,
I would like to write this script for you. Your task is easy. I crawl many data from online market place. So it will be fast. Hope to work with you soon.