Web Scraping Specialist - EU Remote CET
at SEON Technologies
Budapest, Közép-Magyarország, Hungary -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 18 Jan, 2025 | Not Specified | 19 Oct, 2024 | N/A | Selenium,Data Extraction,Dynamic Websites,Python,Data Cleaning,Web Scraping | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
SEON is the leading fraud prevention system of record, catching fraud before it happens at any point across the customer journey. Trusted by over 5,000 global companies, we combine your company’s data with our proprietary real-time signals to deliver actionable fraud insights tailored to your business outcomes. We deliver the fastest time to value in the market through a single API call, enabling quick and seamless onboarding and integration. By analyzing billions of transactions, we’ve prevented $200 billion in fraudulent activities, showcasing why the world’s most innovative companies choose SEON.
SEON seeks a skilled Web Scraping Specialist to join our team in building cutting-edge anti-money laundering (AML) solutions. The data you extract from open-source web platforms will improve SEON’s fraud prevention and risk detection tools. By gathering and analyzing key information from public websites, you will strengthen our ability to detect and prevent illicit activities in financial transactions.
Our AML team specializes in the development of an anti-money laundering product suite. Our primary objectives revolve around enhancing the efficiency of collecting and managing data about individuals subject to diverse AML regulations, including sanctions, prominent public figures, financial supervision penalties, and warrant lists. Our diligent efforts involve continuously extracting data from over 300 sources and approximately 4,000 websites. In rendering this data searchable, we employ sophisticated techniques to address the intricacies of various languages and transcription nuances. Our focus on meticulous handling aims to minimize false positive results with the support of advanced Natural Language Processing (NLP) tools.
This is a remote role, and the ideal candidate will be based in the European Union, CET.
Responsibilities:
- Develop and maintain a scalable in-house built scraping pipeline using Python.
- Implement web scraping solutions using tools like Selenium, BeautifulSoup, or similar libraries.
- Troubleshoot, optimize and enhance existing scraping workflows and tools.
- Cooperation with data scientists and colleagues in developing in-house built data consolidation tools to clean and organize scraped data to ensure it is accurate, reliable, and ready for analysis.
- Manage and utilize third-party proxy services to ensure effective data extraction, bypassing anti-scraping mechanisms.
- Apply advanced client-faking techniques (e.g., user-agent rotation, CAPTCHA solving, IP masking) to avoid detection.
- Collaborate with data engineers and other team members to integrate data into pipelines or systems.
- Stay updated on the latest developments in web scraping, proxies, and anti-scraping techniques.
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Other
Information Technology
Graduate
Proficient
1
Budapest, Hungary