
Closed
Posted
Paid on delivery
Hello I need a web crawler/scraper which operates on the darkweb and clearweb, - ignores dontfollow -can use a configurable number of concurrent threads, indexes everything it finds into a backend database (including the content of websites, archives, text, documents, etc) -can be set to seek out and crawl sites based on certain keywords and use different search engines to find sites to search -can generate random darkwebaddresses to find and crawl sites -has cli and gui interface, supports commands being sent over ssh or has a server and client version? -supports configuring crawl depth, concurrent crawling threads, and other throttling settings -supports whitelisting and black listing domains, tlds -ideally has a basic gui but I'd be happy to help design that and its not 100% nessesary so long as there is an effective, interactive cli interface -builds a searchable index of domains/subdomains/IPs/other basic info about the sites and an index of content/information within sites crawled -configurable options on what data to index by cli flags and by regex -configurable options on what data to index by cli flags and by regex -api integration woiuld be great, allowing us to punch in our shodan,censys, criminalIP, etc api keys to use those sites as data sources but also to be able to use api to reeetrieve data from major OSINT sources such as facebook would be excellent but is not strictly necessary at this point -being able to operate as a distributed task -ability to connect multiple different cloud and local storage devices for indexed information -a web interface for the GUI which features index information & stats/ configiuration options and the ability to search the indexed information for keywords or filter the information by type, file extension, regex, domain -we will be wanting to integrate data analytics and ML/ai into the program (possibly down the road) and would like the program to get better at sniffing out sensitive data as it continueously crawls, scrapes and indexes its way through the darkweb. we are running a darkweb monitoring service which we want to expand the capabilities of to improve the service offered to our clients so basically what we need the software to do is to find and database any sensitive information/personal information/financial information/business information/ identity documents/related material which has been leaked onto the darkweb (or clearweb) in order to notify our clients of their information being vulnerable. due to the nature of the service we provide we need this solution to be extremely through at finding this information but also extremely secure with indexing and storing it
Project ID: 39197903
51 proposals
Remote project
Active 1 yr ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
51 freelancers are bidding on average $520 CAD for this job

Hello, I understand that you need a robust web crawler and scraper for both the dark web and clear web, focused on indexing sensitive information while ensuring security. My approach will include developing a configurable system with concurrent threads, customizable crawl depth, and black/whitelist features to filter domains. I'll implement both CLI and a basic GUI, along with API integration for enhanced data sources. The goal is to provide an effective tool for your darkweb monitoring service, allowing for thorough and secure indexing of sensitive data, which will greatly benefit your clients. What specific database technology would you like to use for indexing the data collected by the crawler? Thanks, Muhammad Awais
$750 CAD in 14 days
9.0
9.0

⭐⭐⭐⭐⭐ Build a Powerful Web Crawler for Darkweb and Clearweb Monitoring ❇️ Hi My Friend, I hope you're doing well. I've reviewed your project needs and see you're looking for a web crawler/scraper for both darkweb and clearweb. Look no further; Zohaib is here to assist you! My team has completed 50+ similar projects focused on web crawling and data indexing. I will create a solution that indexes everything from websites to documents and allows for customizable search parameters. ➡️ Why Me? I can easily handle your web scraping project as I have 5 years of experience in web crawling and data analysis. My expertise includes Python, database management, and API integration. Additionally, I have a strong grip on technologies like cloud storage and data security. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. I'm excited to explore how we can enhance your darkweb monitoring service. ➡️ Skills & Experience: ✅ Web Crawling ✅ Data Indexing ✅ Python Programming ✅ API Integration ✅ Database Management ✅ Cloud Storage ✅ CLI and GUI Development ✅ Data Security ✅ Regex Configuration ✅ Keyword Searching ✅ SSH Command Support ✅ Data Analytics Waiting for your response! Best Regards, Zohaib
$350 CAD in 2 days
7.6
7.6

As the CEO and Founder of Digital Screencast, I can wholeheartedly assure you that I possess the aptitude and experience necessary to build the advanced web crawler and scraper you're seeking. Having spent 7+ years as a Full Stack Developer with companies such as Metlife GOSC and DXC technologies, I am no stranger to complex web scraping projects. My expertise lies in Web Scraping, Web Automation, and writing useful scripts, the very skills that your project requires. Not only do I offer proven competency in utilizing PHP and other software architecture tools for this task, but I also bring to the table experience with databases (including Access Database and MySQL) which will be vital for indexing everything your crawler retrieves. Moreover, my extensive knowledge of Python broadens my ability to create configurable options for indexing data by CLI flags and regex, much in line with your project requirements. In addition to technical proficiency, I'm committed to tailoring my service around my clients' needs. If chosen for this role, you can expect not just excellent work quality but also superior speed and accuracy underlined by pay-as-you-like model after delivery. My end goal is always creating a long-term, mutually satisfying relationship with my clients - ensuring a commitment not just to finishing a project but exceeding expectations throughout. Let's bring your dark web monitoring service into an exciting new frontier together!
$500 CAD in 7 days
7.8
7.8

Hi there , I'm bidding on your project "Advanced Web Crawler and Scraper Development". Let's dive in and have a meeting I am expert in this area. please leave a message on my chat so we can discuss the budget and deadline of the project. I have read your project description and i'm confident i can do this project for you perfectly. Regards, Usama ..
$750 CAD in 3 days
7.3
7.3

I have extensive experience working on similar projects involving advanced web crawling and scraping solutions. 1. Technical Approach: - Develop a web crawler/scraper that operates on both the dark web and clear web. - Implement the ability to ignore "dontfollow" directives. - Enable configurable concurrent threading for efficient crawling. - Store all indexed data in a backend database. - Use keyword-based site discovery and search engine integration. - Generate random dark web addresses for crawling. - Provide CLI and GUI interfaces with SSH support. - Allow configuration of crawl depth, threads, and throttling settings. - Support domain whitelisting/blacklisting and TLD filtering. - Build searchable indexes of domains, subdomains, IPs, and content. - Integrate API support for data retrieval from sources like Shodan and Censys. - Implement distributed task capabilities and multiple storage device connections. - Develop a web interface for indexing information, stats, and search functionality. 2. Technologies: - Python for backend development. - Scrapy framework for web crawling. - Elasticsearch for data indexing and searching. - Docker for containerization. - Flask for web interface development. - Integration with Shodan, Censys, and other APIs. 3. Testing and Integration Plan: - Conduct unit testing for each module. - Perform integration testing to ensure seamless functionality. - Beta testing with a select group of users for feedback. - Continuous integration to maintain code quality. - Provide thorough documentation for user readiness. 4. Performance and Scalability Optimizations: - Implement efficient data structures for indexing. - Optimize crawling algorithms for speed. - Utilize cloud services for scalability and storage. - Monitor performance metrics for tuning. By following this technical approach, utilizing relevant technologies, and adhering to a rigorous testing and integration plan, we will deliver a reliable and user-ready solution that meets the client's requirements.
$750 CAD in 7 days
7.4
7.4

Hello, I am excited about your project for developing an advanced web crawler and scraper tailored for both the dark web and clear web. With over 5 years of experience in web scraping, software architecture, and database integration, I can deliver a robust solution that meets all your requirements. Your need for configurable concurrent threads and effective CLI/GI interfaces aligns perfectly with my expertise. I can design the scraper to index a wide range of information while ensuring it operates securely, taking into account the importance of sensitive data handling. Furthermore, I will implement whitelisting and blacklisting features for domains and TLDs to enhance your monitoring capabilities. Additionally, I can incorporate API integrations to enrich data sources and build a user-friendly web interface for managing indexed information. My background in machine learning will also enable us to enhance the tool's capability to identify sensitive data over time, making it an invaluable asset for your dark web monitoring service. Could you please clarify any specific programming languages or frameworks you prefer for the development of this web crawler and scraper? Thanks, Rashid
$700 CAD in 30 days
6.8
6.8

Through my AI and software development expertise, I can deliver the advanced web crawler and scraper that your project demands. My strong background in Machine Learning (ML), PHP, and Web Scraping aligns perfectly with the technical requirements of your project. Having honed my skills in developing comprehensive AI solutions and generating tailored, scalable frameworks, I am well-equipped to design a powerful web crawling tool that indexes everything it finds into a secure backend database. Beyond just meeting your stated needs, I seek to enhance the functionality of your monitoring service by integrating data analytics and ML/ai into the program. This will allow for continuous improvement in sniffing out sensitive data as the tool crawls through both darkweb and clearweb domains. Additionally, my proficiency in API integration can facilitate connections to key data sources like Shodan, Facebook, Censys, or CriminalIPs, further enriching your intelligence-gathering capabilities. Moreover, having built cloud-based solutions using Kubernetes and Docker for big data management, I'm confident about connecting multiple storage devices efficiently to handle the indexed information.
$750 CAD in 7 days
7.4
7.4

Hello, I trust you're doing well. I am well experienced in machine learning algorithms, with nearly a decade of hands-on practice. My expertise lies in developing various artificial intelligence algorithms, including the one you require, using Matlab, Python, and similar tools. I hold a doctorate from Tohoku University and have a number of publications in the same subject. My portfolio, which showcases my past work, is available for your review. Your project piqued my interest, and I would be delighted to be part of it. Let's connect to discuss in detail. Warm regards. please check my portfolio link: https://www.freelancer.com/u/sajjadtaghvaeifr
$500 CAD in 7 days
7.0
7.0

Hello. I’m thrilled to have the chance to propose my services for your project. With a strong background in Scraping, I’m eager to help bring your vision to life. I look forward to the possibility of working together. Cheers.
$500 CAD in 3 days
6.6
6.6

Hello, I came across your project and found it truly interesting. With over six years of hands-on experience in this field, I have successfully delivered high-quality solutions to clients worldwide. My dedication to excellence is reflected in the 150+ positive reviews from satisfied clients. I’d love to bring this expertise to your project and ensure outstanding results. However, I do have a few important points I’d like to clarify to align perfectly with your vision. Let’s connect via chat, where I can also share relevant examples of my past work. I'm looking forward to hearing back from you! Best Regards, Divu.
$750 CAD in 8 days
6.2
6.2

Hello Hope you are doing well! This is Efan , I checked your project detail carefully. I am pretty much experienced with Web Scraping, Software Architecture, PHP, Machine Learning (ML) and Web API for over 8 years, I can update you shortly. Cheers Efan
$250 CAD in 4 days
6.0
6.0

Hi There, I’m excited to offer you an exclusive 25% discount on all my services, making it easier than ever to develop a powerful and efficient web crawler and scraper tailored to your needs! ? Need an Advanced Web Crawler and Scraper? I specialize in building high-performance data extraction tools that can efficiently collect, parse, and structure web data from multiple sources. Whether you require real-time scraping, automated data collection, CAPTCHA bypassing, or API integrations, I’ll develop a robust and scalable solution to streamline your data acquisition process. Take advantage of this limited-time offer, and let’s build your custom web scraping tool today! ?️? Best regards, Sohail Jamil
$250 CAD in 1 day
6.4
6.4

With my extensive background in Machine Learning, PHP, Software Architecture, as well as my passion for innovation, I am confident that I can deliver the advanced web crawler and scraper you need to revolutionize your darkweb monitoring service. Over my 6+ years' experience, I've built intricate and efficient systems similar to what you're envisioning; systems that would certainly provide you with an edge where data analytics matters. I'll ensure your crawler not only finds but effectively indexes all important elements on the targeted web into a backend database while also allowing data extraction through different mediums. I'm well-versed in building ML modules overtime for intrusion detection and data classification and will apply these skills into refining your webscraper for an improved indexed information fetch each time its used. The use of differing search engines and configurable parameters for domain filtering is right within my sphere of expertise. Connecting multiple storage devices for efficient indexing doesn't faze me either. Moreover, I deeply understand how sensitive the nature of your service is. Consequently, I'll prioritize security at all levels - from database setup to retrieval mechanisms - ensuring your client's information remains confidential without compromise. It is indeed an exciting project to work on and with my skills, passion and dedication I can give you the solution that will completely elevate your darkweb monitoring service.
$500 CAD in 5 days
6.1
6.1

Hello Thank you for sharing the details of your project. After reviewing it, I’m confident that I can deliver exactly what you need. However, I do have a few important questions that I’d like to clarify before moving forward. Could you please connect with me via chat so we can discuss further? I’d also be happy to share my recent work that aligns with your requirements. Looking forward to hearing from you soon! Neha!!
$700 CAD in 7 days
5.7
5.7

Hello, I’ve reviewed your project and am confident my expertise in PHP, Web Scraping, Software Architecture, Machine Learning (ML), Web APImakes me a great fit. I focus on delivering high-quality, impactful results. I'm experienced with years of hands-on expertise in PHP, Web Scraping, Software Architecture, Machine Learning (ML), Web API, passionate about delivering top-notch results that truly make an impact. Let’s connect and chat—I’d love the chance to discuss how I can bring value to your project! Regards, Umair
$250 CAD in 3 days
5.3
5.3

Hi there, I’ve carefully read your project description - Advanced Web Crawler and Scraper Development and really interested in this job. I’m a full stack engineer for 8+years experience and can offering best quality and highest performance during your timeline. I’m ready to discuss your project and can start immediately. I'd like to talking about your proposals via chat. I will wait for your reply Thanks! Yurii.
$450 CAD in 7 days
5.2
5.2

Hi there,Good afternoon I am Talha. I have read you project details i saw you need help with PHP, Web Scraping, Software Architecture, Web API and Machine Learning (ML) I am pleased to present my proposal, highlighting our extensive experience and proven track record in delivering exceptional results. Our portfolio of success will showcase past projects that demonstrate our ability to meet and exceed client expectations. Glowing testimonials from satisfied clients will attest to our professionalism, dedication, and the quality of our work Please note that the initial bid is an estimate, and the final quote will be provided after a thorough discussion of the project requirements or upon reviewing any detailed documentation you can share. Could you please share any available detailed documentation? I'm also open to further discussions to explore specific aspects of the project. Thanks Regards. Talha Ramzan
$250 CAD in 7 days
5.3
5.3

Hey, the web crawler will be built using python with scrapy for efficient crawling and beautifulsoup for html parsing. it will support both tor and i2p networks using stem and requests to access darkweb sites, while also crawling the clearweb. the backend database will use postgresql for scalable indexing, with elasticsearch for full-text search capabilities. the crawler will support multi-threading with asyncio and threading to allow configurable concurrent crawling. it will feature a command-line interface for control, with ssh support for remote execution, and a basic web gui using flask for monitoring and searching indexed data. crawling depth, rate limits, whitelisting/blacklisting, and regex-based filtering will be configurable via yaml/json settings. an api integration layer will allow connections to shodan, censys, and other osint sources. indexed data will be encrypted and stored securely across multiple cloud/local storage backends. future-proofing for ai/ml-based pattern recognition will be ensured by designing modular data processing pipelines using tensorflow. Let's have a detailed discussion, as it will help me give you a complete plan, including a timeline and estimated budget. I will share my portfolio in chat I look forward to hear from you. thanks Regards, Mughira
$500 CAD in 7 days
5.1
5.1

Hello! What is the expected scale of data to be indexed? What is your priority regarding speed versus thoroughness of the crawl? Are there any specific legal or ethical considerations for accessing and storing darkweb data that need to be addressed? This project is very interesting and aligns well with my skills, especially in secure data handling and complex web scraping. Let's discuss the specifics to provide an accurate quote.
$750 CAD in 1 day
4.7
4.7

Hello, i have a good experience scraping variety of websites with python, contact me to discuss more project details, thanks
$500 CAD in 7 days
4.8
4.8

Calgary, Canada
Member since Mar 12, 2025
$30-250 USD
€750-1500 EUR
₹600-1500 INR
₹12500-37500 INR
₹75000-150000 INR
€250-750 EUR
$2-8 USD / hour
₹600-1500 INR
$15-25 USD / hour
$30-250 USD
$10-30 USD
₹750-1250 INR / hour
$15-25 USD / hour
$30-250 USD
₹12500-37500 INR
$750-1500 USD
$30-250 USD
$15-25 USD / hour
$30-250 USD
₹60000-70000 INR