Find Jobs
Hire Freelancers

Content Scrapper - Aggregator

$100-500 USD

Cancelled
Posted about 15 years ago

$100-500 USD

Paid on delivery
We are a team of engeneers, specialized in SEO consulting. The goal is to **crawl** blogs and forums and **save** their content into a database. ## Deliverables ## Going to the data A list of blogs will be given. Each will need to be crawled throught it's archives, and every article taken. Also, sometimes a search result page will be given, and you have to open the results, as new blogs to crawl. A list of forums (phpBB mostly) will be given. With a login and password. You will need to get the topics and translate them as articles. The first post in the topic is the "content", the others are the "comments". A list of usenet newsgroups will be given. You will get their messages, through google group, or a news system (NNRP access) Each first post is an article, it's answers are comments. In all cases, an article that is less that X characters long won't be downloaded. ## ## Getting data Each article in a blog will be a new row in the main table. You may also create other tables as needed. If you think a table with a row for each blog can be useful, do it. Main table fields : * id primary key auto intrement <!-- --> * title * content <- whole article, not whole web page * date of release of article * source url <- also used as UNIQUE , so if we crawl the site again, don't take the same article 2 times * tags (if any) (<- separate table?) * categories (if any) (<- separate table?) * user comments (in separate table) : nickname, date, content * images if article got any. Will be put in a dir which name is the id field. Images can be taken with system("wget ...") ## ## Technical Name of table, of class(es) to use and some downloading functions are pre-defined or will be changed. Each blog , or blog platform will be different to parse obviously. You can decide to write parsing information in a table, or in the code, as you wish.
Project ID: 3777884

About the project

7 proposals
Remote project
Active 15 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
7 freelancers are bidding on average $288 USD for this job
User Avatar
See private message.
$382.50 USD in 20 days
4.9 (367 reviews)
7.0
7.0
User Avatar
See private message.
$425 USD in 20 days
4.9 (91 reviews)
6.1
6.1
User Avatar
See private message.
$289 USD in 20 days
4.9 (28 reviews)
4.8
4.8
User Avatar
See private message.
$408 USD in 20 days
4.7 (25 reviews)
4.5
4.5
User Avatar
See private message.
$255 USD in 20 days
4.9 (4 reviews)
3.1
3.1
User Avatar
See private message.
$170 USD in 20 days
0.0 (3 reviews)
0.0
0.0
User Avatar
See private message.
$85 USD in 20 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of BULGARIA
SOFIA, Bulgaria
5.0
5
Member since Jan 17, 2008

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.