Find Jobs
Hire Freelancers

Compile frequency of keywords on all articles of certain websites

$30-250 CAD

Closed
Posted over 8 years ago

$30-250 CAD

Paid on delivery
We are looking for a web scraping script that searches through all articles on a number of websites for certain keywords and outputs the article’s entire contents, the frequency of the keywords, and other metadata. We are also looking for a script that compiles the 50 most frequent words in those articles by month. More specific details are provided below. Output • CSV with one row for each article and columns for the following features (see “article summary” tab in attachment for template): o Date o Website o Article title o URL o Location of website headquarters o Article contents o Frequency of keyword 1 in article body o Presence of keyword 1 in article title (true/false or 1/0) o Repeat frequency and presence measures for other keywords • CSV with one row for each of the top 50 most frequent words and columns for the following features (see “top 50 monthly” tab in attachment for template): o Date (month-year) o Keyword o Frequency • Web scraping script(s) in Python or R that produced the output Scope • Articles containing at least one of the keywords below • Articles from 2014 to 2015 inclusive • For the “top 50” exercise, please exclude all common words, as listed on the website [login to view URL] Keywords o Ad/advertising/advertiser o Ad tech/advertising tech/ad technology/advertising technology o Ad exchange o Ad network o Web publisher/online publisher/digital publisher o Real time bidding/real-time bidding/RTB/real time auction/real-time auction o Demand side platform/demand-side platform/DSP o Supply side platform/supply-side platform/SSP o Data management platform/data-management platform/DMP o Programmatic o Programmatic advertising o Programmatic direct o Programmatic guaranteed o Programmatic reserved o Programmatic premium o Programmatic non-reserved o Programmatic buying o Programmatic selling o Programmatic real-time bidding/programmatic real time bidding/programmatic RTB o Preferred deal o Targeting o Geotargeting o Behavioral targeting o Retargeting/re-targeting o Cross-device tracking/cross device tracking/cross-device targeting/cross device targeting o Revenue optimization o Monetize/monetizing/monetization o Native advertising/native ad/native advertisement o Mobile advertising/mobile ad/mobile advertisement o Sponsored content/sponsored post o Branded content o Content recommendation o Advertorial o Viewability o In-feed/in feed o In-stream/in stream o Direct sale o Banner ad/banner advertising/banner advertisement o Display ad/display advertising/display advertisement o Malvertising/malicious advertising/malicious ad/malware o Ad fraud/advertising fraud/impression fraud/click fraud o Ad-blocking/ad blocking/ad-blocker/ad blocker o Adblock o Bot traffic/non-human traffic/non human traffic o Ad stacking o Whitelist/white list/whitelisting o Blacklist/black list/blacklisting o Rich media o Waterfall/waterfalling o Tag management o In-app advertising/in app advertising o Audiences o Measurement o Digital advertising/digital ad/digital advertisement o Big data o Third-party data/third party data o Click-thru rate/click thru rate/click-through rate/click through rate/click rate/CTR o Cost-per-action/cost per action/CPA o Cost-per-click/cost per click/CPC o Cost-per-install/cost per install/CPI o Cost-per-thousand/cost per thousand/cost-per-mille/cost per mille/CPM o Effective cost-per-thousand/effective cost per thousand/effective cost-per-mille/effective cost per mille/eCPM o Demand fill o Private marketplace/private exchange o Privacy Websites • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL] • [login to view URL]
Project ID: 8343646

About the project

8 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
8 freelancers are bidding on average $940 CAD for this job
User Avatar
Hi, it will be quite interesting to build this project for me. it will consist of two parts: analysis scripts and actual scraper. for scraper to work efficiently on those sites you will be required to have proxies, otherwise they will ban your server's ip for making too many automated requests... at any case, proxy costs will be several times higher than cost of a script itself, so you may want to search for some 3rd-party proxy providers or let me handle data retrival (my prices usally cheaper than 3rd parties)
$252 CAD in 3 days
5.0 (47 reviews)
7.3
7.3
User Avatar
Dear Sir, I'm very much delighted to let you know that i did data scraping with PHP-cURL, Node.js, Selenium from many sites. I just scraped the data from web site and then wrote the data in mysql database or excel or csv or xml file. I worked on many similar projects, I have big experience in data mining projects. I have written hundreds of web scrapers which scrape millions of pages each day. I'm ready to fulfill your requirement. I can finish this task in short time, with the best quality. I can assure 100% accuracy. Please give me the opportunity to do the work. With Kind Regards, Debdulal Roy Proshanta
$305 CAD in 3 days
4.9 (35 reviews)
6.1
6.1
User Avatar
Hello I am sajidmahmud I see your project and i do it as your acceptation.I have 3 years experience to work in a any kind data process Web Scraping, Web Search, Excel,SEO,Internet Marketing,Social Media,Lead,Telemarketing and Photoshop work would like to assure you that I will do my job according your expectation. I have a highly creative data entry studio. Just give me your jobs and find out me. Close communication & ON-TIME delivery ensures a long-term relationship with our valued clients. We believe, goodwill in business is worthier than anything else. I give you drafts and unlimited revisions until your satisfaction. I do through research on every project to get the Best result as well as optimum client satisfaction. Thanks
$200 CAD in 5 days
4.8 (105 reviews)
5.3
5.3
User Avatar
Hello Sir/Madam,i am ready to start your job as per your requirement Thanks this great opportunity,if you like than we can discuss further, awaiting for your positive response.I have been working in the field of Data-Entry,Excel,Web Search,Web scraping,Leads,Data Processing etc job from the last 4 years. I am good fit for your projects as i always be sincere with my clients and with my work. I aim to bring a modern but experienced approach to your projects. Allow me to provide you good services based on my experience and bring you vision to life looking forward to hear from you.I will prove my efficiency, reliability, ON TIME WORK DELIVERY with 100% accuracy. Thanks
$155 CAD in 3 days
4.4 (38 reviews)
5.2
5.2
User Avatar
Hello. I am interested in your project. I have a lot of experiences in web scrapping . And I have ever developed the software smilar to this program. Please check my portfolio and working history. if u hire me, u could come up with good results at fair price. Best regards. Yknox.
$412 CAD in 3 days
5.0 (16 reviews)
4.5
4.5
User Avatar
Hi, I'm Juan. I'm an Aerospace engineering student who's very experienced with data scraping and Python scripting. I would need to know from what pages do you what to get the data. I'm looking forward to working with you. Best regards, Juan
$222 CAD in 6 days
5.0 (3 reviews)
2.9
2.9
User Avatar
(((((((((((((((((((((((((( I CAN DO WELL AND LOW BUDGET ))))))))))))))))))))))))))))))))))))))))
$30 CAD in 1 day
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello, I will design a Perl script to emulate a browser accessing these sites to parse the HTML returned for articles text for analysis. Since each site has its own interface, a custom parser must be designed for each one. If you want to work closer to your budget, then I recommend that we choose a single source as proof of concept. My proposal is to deliver the script. It shall be your responsibility to run the script to access articles and process the article text. My proposal assumes that the site produces HTML output. My script(s) shall not be responsible for changes to the interface. These sites will likely limit the number of articles you may access. I will not be responsible should your access become blocked. A milestone payment for the full budget for your project must be deposited with this site before your offer can be accepted. Alan Idler Chief Software Architect Idleswell Software Creations
$5,694 CAD in 60 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello, I am PHP programmer in NetCreate company. I had done a lot of scraping projects as yours. If you are interested in my offer, please contact me. Regards.
$200 CAD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Propunerea nu a fost încă furnizată
$277 CAD in 3 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of CANADA
Canada
0.0
0
Member since Aug 26, 2015

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.