Find Jobs
Hire Freelancers

Need an expert developer to develop a file scraper to scrape firmware files and their info from various vendors websites

€12-18 EUR / hour

Completed
Posted over 4 years ago

€12-18 EUR / hour

A python based CLI script that can download all product’s firmware (including all versions) from web pages for a given list of predefined vendors and store the information (meta data) in SQLite [login to view URL] mandatory metadata fields include ( Manufacturer, Model, Version, Type, Name, Release Date(if available), Download link, ( calculated Sha2 hash of the file)i.e. ( Cisco, Video Surveillance 6030 IP Camera, 2.7.0, IP Camera, [login to view URL], 21/08/2015, "link" ) There is a non-mandatory binary field which indicates if the device is discontinued or not depending on the fact that vendor mention that on the website or not. The firmware files itself will be stored in the file system and will be referenced by index ID in SQLite. The arguments to the script should be a list of comma separated vendor names or the location of a text file containing the vendor name. There are no GUI components in the server where the script will run hence headless mode for browser should be used by the script Solution Scope 1. Script will be written per vendor. This is required because each vendor website will have its own implementation of the firmware download page. However, efforts will be put to identify and implement reusable components, if any. 2. The script will only download new firmware that have been added by the vendor. Hence first execution of script will download all the firmware available but the subsequent runs will only download new ones which will get added. This will be achieved by analysing data available in SQLite and skipping the files that are already been downloaded and processed. 3. Each vendor, that will be provided, will be analysed manually to identify the following, which will be required to develop the script: a. URL for the firmware download page b. Credential Requirements (Simple Signups, Specific Signups, No Signups) c. Any Captcha on the page d. Any honeypot traps 4. If there are credential required to download the firmware and the credentials are simple ones where a simple sign up is required, the signup will be done manually as part of the manual analysis using a gmail account dedicated for this work. 5. Script will try to imitate human like behaviour (to a limit) while scraping the web page as well as uses Tor, so that if the vendor site has scraper/crawler detection logic implemented, it can be skipped. This will be achieved by adding random delays, random view time, avoiding honeypot traps through manual analysis Solution Brief A Python Selenium and SQLite based solution will be developed which will have the following features/components: 1. File Management Module: Responsible for storing and managing the downloaded files and meta data. Firmware and installer files will be stored on the filesystem which will have a structured folder hierarchy. Meta data of the files will be stored in SQLite. Meta Data will refer to the stored files through paths on the file system and file index/name. 2. Vendor Scrappers: Python Selenium based scrapper will be written for each of the vendor, responsible for downloading the files and grabbing the meta data from the vendor’s site. This will make use of the file management module to store the file and meta data to SQLite. 3. Configuration File: All the configurations for the framework (including vendor specific like credentials, url etc) will be stored in a json file which can be easily modified manually. 4. Execution Script: The configuration file can be setup to represent the polling interval for each of the vendor scraper and when the execution script is run it will go and schedule each of the vendor scripts individually according the polling interval defined in the config. Deliverable: 1) Python Source Code including the comments in the code explaining each function & its details. We should be able to give any required input as an argument and execute it as one line command in the Linux terminal. 2) Dependencies 3) Manual to install, configure and use the scraper
Project ID: 23139167

About the project

14 proposals
Remote project
Active 4 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hi, Im a Python Developer from Athens Greece, experienced in Web Scraping and all the related libraries. Similar projects have been completed in the past and I am able to provide evidence in chat. Interested in your project, capable of accurate delivery and waiting for your call. Thanks, Pandelis
€14 EUR in 20 days
4.9 (18 reviews)
5.3
5.3
14 freelancers are bidding on average €16 EUR/hour for this job
User Avatar
Hi there,I am Web Scraping expert from Bosnia & Herzegovina,Europe. I have carefully gone through with your requirements and I would like to help you with this project ! I can start immediately and finish it within the agreed deadline. Check out my profile and former clients feedback - that'll let you know everything about me. Please feel free to contact me so that we can discuss further details. Thank you for taking the time to read my proposal.I am looking forward to hearing from you. Best regards, Miljan
€20 EUR in 40 days
4.9 (170 reviews)
7.9
7.9
User Avatar
Hi. sure we can create such a script. Are all this data available for all vendors? Do you provide links to all vendors or we need to get them> Please provide some vendors and links and I will make some tests.
€20 EUR in 40 days
5.0 (138 reviews)
7.7
7.7
User Avatar
Dear Sir,   Our team has a huge experience in Python, Linux, Web Scraping, SQLite, Selenium as a result we can successfully complete this project. Having the required skills, we will be glad to help you.   We have 20 years of strong experience in outsourced web development, software development, managed services and we provide 24/7 technical support. As well as we guarantee Canadian quality.   Please, review our profile to find detailed information about our company and our portfolio. Feel free to contact us and discuss all details. P.S. As you see from details of your profile, you located in Europe. Be informed that our technical offices located in Ukraine i.e. you will be able work with our developers almost in same time zone. Best regards, Sales department Tangram Canada Inc.
€15 EUR in 5 days
4.9 (99 reviews)
7.4
7.4
User Avatar
Hello, I can help you with your project - Need an expert developer to develop a file scraper to scrape firmware files and their info from various vendors websites I have gone through your job posting and become very much interested to work with you. I am an expert in this field. I have already completed several projects like this. For evidence you can see my profile. Please visit : https://www.freelancer.com/u/schoudhary1553 I have excellent command over English. I am a hard worker, productive and worthy of your attention I hope, I would be the right candidate for this post. Awaiting an affirmative response from you. Kinds Regards, Sandeep
€18 EUR in 40 days
5.0 (87 reviews)
7.1
7.1
User Avatar
Yes, I have read your read job description. As I will help you in develop a python based CLI script that can download all product’s firmware including all versions from web pages. As I am a professional python expert with 8+ years of rich experience in this domain. I am familiar with following python frameworks and framework Django, Tenser flow and its related libraries are web pack-bundle-tracker and Django-web-pack-loader, NPM, Panda, Scrapy, Beautiful Soup, Selenium. So, Please let me know your convenient time that works best for you to discuss about the scope of your project. Warm Regards Mithu
€12 EUR in 40 days
5.0 (4 reviews)
3.5
3.5
User Avatar
Hi, I'd be happy to help you with this. I have experience in python, scrapping and selenium. I'm also experienced in networking and information security, so i know possible security features, workarounds and ability to search for creative solutions. Finally i'm a very experience software developer and can write a clean and well architectured application for you so it can grow with your business. Looking forward to hear from you.
€15 EUR in 40 days
5.0 (2 reviews)
2.9
2.9
User Avatar
**Ensure Affordable Price and Quality Work** Dear Client, Response: 1) Python Source Code including the comments in the code explaining each function & its details. We should be able to give any required input as an argument and execute it as one line command in the Linux terminal- surely you will get clean code 2) Dependencies- can you please elaborate more over this deliverable 3) Manual to install, configure and use the scraper- definitely you will be provided with a manual We are professionals in web Scrapping and can provide you with the best results in a short period of time. We have a team of experienced full stack developer in Python. Keeping it short we can have a detailed discussion over chat. There we will provide you the exact time and our steps towards the execution of the tasks. Looking forward to your valuable response. Thanks ExiverP
€13 EUR in 40 days
5.0 (1 review)
2.2
2.2
User Avatar
Hello, I have thoroughly read the given following features/components of script that you want to develop and I can develop a file scraper to scrape firmware files and their info from various vendors websites as per given requirements. I would like to inform you also that I am highly skilled with Python's Flask, Django, Quart and scrappy framework as I have a more than 5 year of experience over it. I am available 40+ hours a week and will be available anytime for a work. I can work for 3-4 hours during GMT time. Let me know when you are available to take this conversation to the next level. Waiting for your valuable response. Thanks, Sanjeev
€15 EUR in 40 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have a lot of experience in creating and deploying python/selenium scripts (that run on a headless browser) on local machines or aws servers.
€30 EUR in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi, I have more than 4 years of experience with scraping data from sites using python. I have built many automated scraper which use proxy servers to bypass web blocks. Also i can implement multiprocessing to scrape data faster. Let me know the url of the site you want to scrape and I will provide you with demo data within a few hours. Feel free to contact me for any further queries. Regards, Lovejeet
€12 EUR in 30 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of BELGIUM
Brussels, Belgium
4.9
5
Payment method verified
Member since Sep 20, 2019

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.