Create algorithm to extract data from the web and deliver as a CSV file, according to a template
R$90-750 BRL
Completed
Posted over 7 years ago
R$90-750 BRL
Paid on delivery
Create an algorithm that:
Stage 1. Access a form in the web
Stage 2. For each value in 3 of the listboxes in the form, select these values, and press the "Display" button ("Mostra", in portuguese);
Stage 3. Wait for the results page to load;
Stage 4. After the page is loaded, press the "Copy as CSV" button ("Copia como CSV", in portuguese).
Stage 5. After all CSV files (for the combinations specified below of the possible values in the three listboxes) are saved, create a single CSV file that combines all the data, according to the attached Excel template.
The details of each stage are given below:
******************************************************
Stage 1.
Access a form in the following website:
[login to view URL]
******************************************************
Stage 2.
For each value in 3 of the listboxes in the form, select these values, and press the "Display" button ("Mostra", in portuguese);
How to select the values:
- Choose one value of the listbox "line" ("Linha", in portuguese)
- Choose "Birth Month" ("Mês do Nascimento", in portuguese) in the listbox "Column" ("Coluna", in portuguese): Always keep this fixed
- Choose "Mother residence" ("Nasc. resid da mãe", in portuguese) in the listboxContent ("Conteúdo", in portuguese): Always keep this fixed
- Choose one value of the listbox "Available Periods" ("Períodos Disponíveis", in portuguese)
- Choose one value of the listbox "Municipality" ("Município", in portuguese) in "Available selections" ("Seleções Disponíveis", in portuguese)
Important: Always leave turned on the checkbox "Show zeroed lines" ( "Exibir linhas zeradas", in portuguese) so the tables have the same size (they include a "-" when the information is zero).
Press the "Display" button ("Mostra", in portuguese)
******************************************************
Stage 3.
A new page cotaining a table will be loaded.
******************************************************
Stage 4.
After the page is loaded, press the "Copy as CSV" button ("Copia como CSV", in portuguese) and save the file.
The algorith must repeat this procedure for:
* Years 1995 to 2014 (20 years) of the listbox "Available Periods" ("Períodos Disponíveis", in portuguese)
* All municipalities (~5.500) of the listbox "Municipality" ("Município", in portuguese) in "Available selections" ("Seleções Disponíveis", in portuguese)
* The following values (14 values) of the listbox "line" ("Linha", in portuguese):
- Mother's age (Portuguese: Idade da mãe)
- Mother's schooling (Portuguese: Instrução da mãe)
- Mother marital status (Portuguese: Estado civil da mãe)
- Pregnancy duration (Portuguese: Duração da Gestação)
- Pregnancy type (Portuguese: Tipo de Gravidez)
- Delivery type (Portuguese: Tipo de Parto)
- Prenatal consultation (Portuguese: Consultas pré-natal)
- Gender of the baby (Portuguese: Sexo)
- Race of the baby (Portuguese: Cor/raça)
- Apgar 1 minute (Portuguese: Apgar 1o minuto)
- Apgar, 5th minute (Portuguese: Apgar, 5o minuto)
- Birthweight (Portuguese: Peso ao nascer, Anomalia congênita, Tipo Anomal Congenita)
- Congenital anomaly (Portuguese: Anomalia congênita)
- Type of anomaly (Portuguese: Tipo de Anomalia Congenita)
******************************************************
Stage 5.
After saving all CSV files, create a single CSV file that combines all the data, according to the attached Excel template.
Hello,
I can do this for you very fast. I am very experienced with web scraping and also with CSV / Excel manipulation. I took a look to the website and I can do it.
Find me on chat and let's make this.
R$400 BRL in 3 days
5.0 (7 reviews)
2.9
2.9
11 freelancers are bidding on average R$552 BRL for this job
Hi,
I am Python developer working for more than 4 years. Actually, I have worked on several projects related to web scraping and data mining and I have developed many useful scripts and apps aiming for similar tasks like yours.
I have a deep look a the website, it look ok for me. Please allow me to suggest one thing, for the final output, what about combining all csv into single csv but each row should contain all the input parameters, for example :
Linha,Coluna, Conteúdo,PERÍODOS DISPONÍVEIS, etc ...
I mean I combine the input values in each row.
I am looking forward to getting your feedback.
Kind regards,
Ramzi Trabelsi
Hi there, I have read the project description.. I will write a web scraper to get this data from the website. Let me know & we can discuss details.. Thanks..
Hello Sir,
How are you?
I have read the project and understand your requirement. I am a highly trained on Data Entry ,Web search, Web scraping Expert with great knowledge of Excel. I also can do manual/ automated data scraping work.I definitely have some questions when we discuss project.
Please feel free to contact me directly to discuss this position further. I am all time online on Skype and Gtalk/ Google Hangouts. We can discuss there in details in a quick way, if you are interested.
Skype: shafaqat-11
Best Regards,
Shafaqat Ali
Hi,
I have more than 14 years of data scarping exp and I am expert in this kind of work.
I have completed more than 290 projects.
Please look at the feedback left by my employers to know more about my work. Waiting for your positive response.
Thanks.
Hi
I have been in this industry for 5 years and such jobs are my daily practice.I can assure you that if you work with me once, you will always work with me for these kind of projects.