Python web scraper using ec2 and S3. Files rewritten by a text rewriter api.
£20-250 GBP
Paid on delivery
Create a python scraper that scrapes web pages, stores scraped files on S3 as text. The files are then rewritten by a rewriter api tool.
key steps are:
- Scrape links based on keywords, dates and number of requested articles from specfic websites
- Create the a source list of urls on a text file from this
- From this source list of urls on the text file extract the text contents of the web pages and output in text format
- Write these source articles to an S3 bucket folder
Once all article are retrieved
- run article through an text rewriter api
- place rewritten articles into a different S3 bucket and folder.
The article text file will have in its content
* The article url
* content of the article
The rewritten article text file will have just
* rewritten content of the article
The format of rewriting article should be
s3/folder/[login to view URL]
eg. s3/20230912/[login to view URL]
Required implementation tools:
Use a scraping api such as [login to view URL] to act as a proxy to grab links and articles.
Use an article paraphrase api such as [login to view URL] to rewrite articles.
These have free trial or low cost options for development
You need expertise in:
Python
EC2
S3
IAM
Linux
You will use your AWS account for development
On conclusion of the project you will supply:
1. Python code for the solution
2. Documentation describing how to implement the solution
3. Demonstration of code working
4. Assist with any implementation issues
Project ID: #37230224
About the project
Awarded to:
Hi There, I am capable of doing the above-mentioned job effectively and efficiently without any hassle. I have the required expertise and have worked on very similar projects in the past. I am confident that I am the More
58 freelancers are bidding on average £170 for this job
I can make web scrapper as per your requirement. Please initiate a message for further discussion.
I can make you a smart system which will perform the described task in 48 hrs Hey there, i am developer from the UK with over 9 years experience in web development. Upon reading your project description this seems lik More
"Hi, I am an expert in Python, AWS and scraping. Can create features based on your requirement Skills: NodeJs, Python, PHP Please message me for further discussion. Thanks!"
Hi there, ★★★ Scrapping / Python / Selenium Expert ★★★ 10+ Years of Experience ★★★ I've read requirements and ready to scrape web pages, stores scraped files on S3 as text. Some major works we do: ✔️ Product Websites More
Hello, ✵✬✭✮ As senior scraper developer, I can complete your project perfectly!✵✬✭✮⚝ I have much experiences in EC2 field.(i have installed nodejs and python on ec2) I can get this done at short time, because i have a More
Python expert. I can do it. As 9+ years experiences in these field. I can give good quality work. I have read the guidelines of your work.I believe that i can provide you the best quality works you are anticipating fro More
Hi Sir, As a highly skilled and experienced, I am confident that I can provide the high-quality work you need, but i have some doubts regarding the same, lets discuss in detail make it clear & then i am ready to start More
I understand that you are looking for a Python web scraper that scrapes web pages, stores scraped files on S3 as text, and then rewrites the files using an article paraphrase API tool. I am confident that I can help y More
I understand that you are looking for someone to create a Python web scraper that scrapes web pages, stores scraped files on S3 as text, and then rewrites these articles using an article paraphrase api tool. With my ex More
Hi sir. I'm excited about your project and confident in my ability to deliver your project . I'm committed to exceeding your expectations and ready to start from right away . Let's connect and discuss the next steps! T More
Hello there, I am mohamed and I am a highly experienced Python developer with extensive experience in machine learning, AWS, and data scraping technologies. With a deep understanding of Python programming, I have devel More