Find Jobs
Hire Freelancers

Java web scrapping improve

$250-750 USD

Fechado
Publicado há quase 4 anos

$250-750 USD

Pago na entrega
Request details I developed a Java program to scrap information from a website. The architecture of the solution involves: 1) using Java Selenium to send requests to the webpage via Chrome Webdriver to trigger authentication and authenticated requests; 2) routing the requests from Chrome (headless) to Java BrowserMobProxy to capture three HTTP headers (Authorization, X-CSRF-TOKEN, and Cookie) and one query string (without these, the server after some requests starts responding 512); and 3) use these 4 elements in HTTPs requests from Java directly to the webpage (i.e. without Selenium, Chrome, and BrowserMobProxy involved) to retrieve the desired information. This program does the basic functionality of extracting the information but has a few problems: It depends on an external non-Java component: Chrome WebDriver It depends on Java Selenium and Java BrowserMobProxy, two dependencies that I would like to remove It is not optimized (too much refresh and too long sleep periods) relatively to the limit upon which the Webpage (Cloudfare) starts responding 429 errors. Thus, the retrieval of the information is taking much more time than needed. Deliverables You will get the current program Java code and you will need to solve the problems above. To do so, you will need to: A. Find out how to authenticate and refresh the 3 headers and the query string without depending on Selenium, Chrome Webdriver, and BrowserMobProxy. As most of this data is likely generated in JavaScript, you will need knowledge about JavaScript and how to execute JavaScript from within Java or convert the JavaScript code to Java (preferable solution). B. You will need to identify the limit upon which the Webpage (behind Cloudfare) starts responding 429 errors. You will need to tune the refresh frequency of the headers and sleep periods to the limit identified. You will need to demonstrate the benefits of your changes by extracting the information currently extracted by the program and measuring how long it takes. Note: you will need to create your own login/password in the webpage. No additional requirements exist to register.
ID do Projeto: 26818026

Sobre o projeto

8 propostas
Projeto remoto
Ativo há 4 anos

Quer ganhar algum dinheiro?

Benefícios de ofertar no Freelancer

Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
8 freelancers estão ofertando em média $491 USD for esse trabalho
Avatar do Usuário
Hello, I am pleasure with your job as detailed. Thank you for the job posting. It’s a pleasure to meet you. I’d really like to work with you on this one if possible! I do have a couple of questions, but first I’d like to make you an offer and some background so you can check my work out. I have been developing kind of project within 4+ years so I’m fluent experience to handle project. You’ll get all the expected stuff like a great professional service and a fast turnaround, at a bit less, and I get a bit more exposure. If the above offer sounds like something you would be interested in, I’d love to hear from you. Best regards, Adebayo
$250 USD em 2 dias
4,9 (48 avaliações)
6,2
6,2
Avatar do Usuário
Dear Employer, I have read the project details and confident to work on improving java web scraping. I have extensive knowledge on java, javascript, python, software,etc . Kindly message me so that we can discuss more about the work. Regards Lucky
$667 USD em 3 dias
5,0 (35 avaliações)
5,2
5,2
Avatar do Usuário
Hi, sir. I have carefully checked your requirements and I was glad that I've already done this kind of projects before. I'd love to share more detail with you over chat and I'm sure that you'll be interested in them. I also have very much experience in all skills you wanted for this project, Web Scraping, JavaScript, Java, Software Architecture, Python, so I think I can be the best candidate. Please contact me so that we can talk in detail. Thank you in advance!
$555 USD em 6 dias
5,0 (2 avaliações)
4,1
4,1
Avatar do Usuário
Hi, how are you doing? I hope you're doing well! I am a professional Web Scraper for the last 7 years. I am confident to complete your project. Regards! Sergei.
$450 USD em 4 dias
0,0 (0 avaliações)
0,0
0,0
Avatar do Usuário
Hi please hire me Relevant Skills and Experience Did the automation testing in selenium using java
$556 USD em 10 dias
0,0 (0 avaliações)
0,0
0,0

Sobre o cliente

Bandeira do(a) ROMANIA
Băilești, Romania
5,0
1
Membro desde mar. 8, 2020

Verificação do Cliente

Obrigado! Te enviamos um link por e-mail para que você possa reivindicar seu crédito gratuito.
Algo deu errado ao enviar seu e-mail. Por favor, tente novamente.
Usuários Registrados Total de Trabalhos Publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Carregando pré-visualização
Permissão concedida para Geolocalização.
Sua sessão expirou e você foi desconectado. Por favor, faça login novamente.