Scraping web pages nutch jobs

Bộ lọc

Tìm kiếm gần đây của tôi
Lọc theo:
Ngân sách
đến
đến
đến
Nhiều kỹ năng
Ngôn ngữ
    Tình trạng công việc
    260 scraping web pages nutch công việc được tìm thấy, giá USD

    I'm trying to run a web crawling application in my site using Nutch, but I;m getting this error message. I need some expert to help me to solve this issue. For more details please visit this site [đăng nhập để xem URL] Max budget for solving this issue is $30 USD.

    $36 (Avg Bid)
    $36 Giá đặt trung bình
    22 lượt đặt giá

    I need somebody to take the code of nutch [đăng nhập để xem URL] and do the following. 1. Rename all the files, folder , names etc to remove nutch and replace with our own name. 2. Delete from the code all the opensource licenses etc to make the size smaller. 3. Create API for the software similar to ([đăng nhập để xem URL])

    N/A
    N/A
    0 lượt đặt giá

    Nutch is a Java based Web-Search engine. While it can run on clusters of hundreds of machines it can also be run on a single host and can provide search results via a few JSP pages provided with nutch. Crawling would be accomplished by something like `./bin/nutch crawl [đăng nhập để xem URL] -dir crawl -depth 2 -topN 30000` and the HTML interface by

    $595 (Avg Bid)
    $595 Giá đặt trung bình
    5 lượt đặt giá

    I need somebody to take the code of nutch [đăng nhập để xem URL] and do the following. 1. Rename all the files, folder , names etc to remove nutch and replace with our own name. 2. Delete from the code all the opensource licenses etc to make the size smaller. 3. Create API for the software similar to ([đăng nhập để xem URL])

    N/A
    N/A
    0 lượt đặt giá

    Can you prove you know hadoop, nutch and aws in a realworld application. If so please bid.

    $30 (Avg Bid)
    $30 Giá đặt trung bình
    1 lượt đặt giá

    We want the coder to have experience with: 1. Running Nutch or similar end-to-end web crawling and indexing software preferably on a grid. 2. Using multiple machines with Amazon's EC2. For this task, we need user to tune the settings in Nutch/Hadoop to get optimal performance with the crawling of around 10 million URLs across around 100 amazon machines

    $100 - $400
    $100 - $400
    0 lượt đặt giá

    I have project to install / setup only all codes coming from WikiaSearch ( Nutch + Wikipedia + Social Engine) the project they close down for search engine with human/user control ranking system, i will need about 10 to 20 hours of arranging the codes and installing in in few location for testing of distributed crawling and indexing, if you can do please

    $443 (Avg Bid)
    $443 Giá đặt trung bình
    4 lượt đặt giá

    I have project to install / setup only all codes coming from WikiaSearch ( Nutch + Wikipedia + Social Engine) the project they close down for search engine with human/user control ranking system, i will need about 10 to 20 hours of arranging the codes and installing in in few location for testing of distributed crawling and indexing, if you can do please

    $200 - $300
    Đã niêm phong
    $200 - $300
    0 lượt đặt giá
    Custom Nutch Search Engine Đã kết thúc left

    I have project to install / setup only all codes coming from WikiaSearch ( Nutch + Wikipedia + Social Engine) the project they close down for search engine with human/user control ranking system, i will need about 10 to 20 hours of arranging the codes and installing in in few location for testing of distributed crawling and indexing, if you can do please

    $701 (Avg Bid)
    $701 Giá đặt trung bình
    6 lượt đặt giá

    I have project to install / setup only all codes coming from WikiaSearch ( Nutch + Wikipedia + Social Engine) the project they close down for search engine with human/user control ranking system, i will need about 10 to 20 hours of arranging the codes and installing in in few location for testing of distributed crawling and indexing, if you can do

    N/A
    N/A
    0 lượt đặt giá
    Webinterface for Nutch Đã kết thúc left

    We need a webapplication that is based on Nutch, see [đăng nhập để xem URL] The purpose with the application is to get a webbased interface for Nutch that can be used to create indexes on a set of specified pages and retrieve information from these [đăng nhập để xem URL] basically we would like a webapplication that is the controller of a webcrawler

    $1827 (Avg Bid)
    $1827 Giá đặt trung bình
    5 lượt đặt giá
    Web interface for Nutch Đã kết thúc left

    ...is based on Nutch, see [đăng nhập để xem URL] The purpose with the application is to get a webbased interface for Nutch that can be used to create indexes on a set of specified pages and retrieve information from these pages Basically the webapplication should work as a webbased configuration interface for Nutch where you can

    $382 (Avg Bid)
    $382 Giá đặt trung bình
    4 lượt đặt giá

    I am looking for an offshore resource who is comfortable configuring hadoop and nutch on amazon web services. You must have experience of configurations supporting 100m+ pages and 1TB+ of data on aws using ebs and the latest versions of code. This will be a long-term, several hours per week support job to provide round the clock cover with existing

    $382 (Avg Bid)
    $382 Giá đặt trung bình
    4 lượt đặt giá

    For finishing an eCommerce application we require a TOP Java developer full-time for the period of 2 months. Thereafter he must be able to do the regular maintenance...Thereafter he must be able to do the regular maintenance and further development of the project in long term bases. Required skills: Java, J2EE, Hibermate, Strut, Lucene, Nutch, My/SQL

    $1322 (Avg Bid)
    $1322 Giá đặt trung bình
    26 lượt đặt giá

    I am looking for an offshore resource who is comfortable configuring hadoop and nutch on amazon web services. You must have experience of configurations supporting 100m+ pages and 1TB+ of data on aws using ebs and the latest versions of code. This will be a long-term, several hours per week support job to provide round the clock cover with existing

    $470 (Avg Bid)
    $470 Giá đặt trung bình
    5 lượt đặt giá
    need java programmers Đã kết thúc left

    I want a java coder to modify a Nutch install I have currently on my dedicated server [đăng nhập để xem URL] Its currently installed and running with just a basic installation. I want to enhance it with the following features: 1) URL contribution and rating system. A way for visitors to suggest new URLs for keywords, people should

    $368 (Avg Bid)
    $368 Giá đặt trung bình
    3 lượt đặt giá
    Web spider and search engine Đã kết thúc left

    ...for help building a web spider / scraper and search engine. Most likely Nutch and Solr or Herirtix and Solr. Need to collect data and images and use this for the search. In total we are spidering 1500+ websites to build our search engine. Some might require a login… Applicant needs to be familiar with Web Spiders or Web Scrapers and search

    $750 - $1500
    Đã niêm phong
    $750 - $1500
    28 lượt đặt giá
    need java programmer Đã kết thúc left

    I want a java coder to modify a Nutch install I have currently on my dedicated server [đăng nhập để xem URL] Its currently installed and running, but I want parts of it modified 1) URL contribution and rating system. A way for visitors to suggest new URLs for keywords, people should be able to rate (with stars) existing results, also button

    $246 (Avg Bid)
    $246 Giá đặt trung bình
    4 lượt đặt giá
    Need Quick XML Feed Scrape Đã kết thúc left

    Need Quick Scrape If you haven't worked with me before please don't. If you are familiar with scraping on a larger level... nutch, etc... send me a message so we can talk. I m just looking for XML feed to be parsed though right now. Thanks P :) ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete

    $4 (Avg Bid)
    $4 Giá đặt trung bình
    1 lượt đặt giá
    modify nutch java Đã kết thúc left

    I have a nutch install up and running I want to add this to it. Basically, a way for users who visit suggest new URLs to show for queries. Wikia has source code which you may use all of if helpful. source code: [đăng nhập để xem URL] URL contribution and rating system. A way for visitors to suggest new URLs for keywords, people should be able to

    $127 (Avg Bid)
    $127 Giá đặt trung bình
    1 lượt đặt giá

    I have a nutch install up and running, I want to build some things on top of it. Nutch is in Java, I'm unsure if the following request has to be in Java as well or if it can be in php/mysql. Next to each of my requests is a link to source code you may use and implement from a similar project, not all of it is 100% complete which is why I cant just use

    $297 (Avg Bid)
    $297 Giá đặt trung bình
    1 lượt đặt giá
    Need Nutch Engineer Đã kết thúc left

    Have you worked with Nutch before and can set it up + modify it? We are looking for something fairly simple and need your help.? Please respond back with an example of work you have done with it.? ? Understanding and knowledge of cheapest way possible to host would help too.? Extra bonus. Thanks, LJMIII ## Deliverables 1) Complete and

    $100 - $500
    $100 - $500
    0 lượt đặt giá
    279419 Integrate Nutch and Solr Đã kết thúc left

    ...to help me integrate nutch and solr to build a search engine. So far I have successfully installed both on my server in CentOS 5.2. However, I have difficulty making them integrated. I want to use Nutch as web crawler and Solr as indexing tool. I need you to do: 1. Integrate the already installed Nutch and Solr 2. Build web interface for query

    N/A
    N/A
    0 lượt đặt giá
    P2P Search Engine (idea) Đã kết thúc left

    ...Client/Server will categorize/index/sort web pages and do most things that a search engine would do like Nutch for example. The idea is that each P2P client indexes and categorizes and there is a remote connection from each client ( will need to be encrypted ) that serves the pages as per search basis from the Web Server. The idea is creating a supercomputer

    $538 (Avg Bid)
    $538 Giá đặt trung bình
    3 lượt đặt giá

    We are looking for someone who understands hadoop and nutch to join our team. We need you to write some plugins for hadoop/nutch based on our requirements. Ideally you will be an expert in hadoop and can add real value to our project by helping us get this right. Please get in touch if you are interested and have the required skills. We can discuss

    $1191 (Avg Bid)
    $1191 Giá đặt trung bình
    18 lượt đặt giá
    need search for my website Đã kết thúc left

    ...like PHP. Some suggestions. ### Open source search engines DataparkSearch Egothor Gonzui Grub Ht://dig Isearch Lucene Lemur Toolkit & Indri Search Engine mnoGoSearch Namazu Nutch OpenFTS Sciencenet (for scientific knowledge, based on YaCy technology) Sphinx SWISH-E Terrier Search Engine Wikia Search Xapian YaCy Zettair ## Deliverables 1) Complete

    $31 (Avg Bid)
    $31 Giá đặt trung bình
    2 lượt đặt giá

    i need someone who can make a c or c++ search engine for me for a niche area of files/websites on the internet. this will be like nutch but using only c or c++ indexer, spider, search, (complete package)

    N/A
    N/A
    0 lượt đặt giá
    clone of searchme Đã kết thúc left

    I am looking for somebody to take the nutch code ( nutch is an opensource search software, with code available for free download ). I need a developer to take the nutch code and create an interface/visual display method along the same lines as [đăng nhập để xem URL] ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well

    $100 - $500
    $100 - $500
    0 lượt đặt giá
    install nutch on mediatemple Đã kết thúc left

    I need a coder to install a nutch set up on my media temple server and have it crawl one website. Thanks ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables):

    $106 (Avg Bid)
    $106 Giá đặt trung bình
    3 lượt đặt giá
    249334 vertical search engine Đã kết thúc left

    ...vertical search engine that searches a finite list of websites[<2000 sites] and indexes all the pages from these websites. the search engine spider must crawl the list of websites thrice in 24 hours. the search engine spider has to crawl and index all the pages only of a particular [đăng nhập để xem URL] doesnt need to crawl any external websites that url has links to

    N/A
    N/A
    0 lượt đặt giá
    Web crawler and a search engine Đã kết thúc left

    ...should be done using java and apache nutch. ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables): a)? For web sites or? other server-side deliverables

    PHP
    $2489 (Avg Bid)
    $2489 Giá đặt trung bình
    7 lượt đặt giá
    build a simple search engine Đã kết thúc left

    Nutch install to search and regularly crawl [đăng nhập để xem URL] I'd like some information about how much we can customize nutch after the initial install. For example, how difficult is it to add social features like rating search results, letting users add URLs to add to the crawl? [đăng nhập để xem URL] has a lot of these features and is based on

    $42 (Avg Bid)
    $42 Giá đặt trung bình
    1 lượt đặt giá
    Install spider to crawl my site Đã kết thúc left

    Nutch install to search and regularly crawl [đăng nhập để xem URL] I'd like some information about how much we can customize nutch after the initial install. For example, how difficult is it to add social features like rating search results, letting users add URLs to add to the crawl? [đăng nhập để xem URL] has a lot of these features and is based on

    $30 - $100
    $30 - $100
    0 lượt đặt giá
    build nutch based search engine Đã kết thúc left

    Id like to have a search engine built as a proof of concept, meaning, it doesnt need to support a large number of users, and it can be fai...Ideally, I'd like to make one similar to wikia search without all the additional features. I'd like a coder to explain the process in using [đăng nhập để xem URL] as well as nutch for a search engine. Thanks

    $609 (Avg Bid)
    $609 Giá đặt trung bình
    5 lượt đặt giá

    ...[đăng nhập để xem URL] as well as nutch for a search engine. Thanks ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables): a)? For web sites or? other server-side

    $1969 (Avg Bid)
    $1969 Giá đặt trung bình
    6 lượt đặt giá

    Create web services that return query results using Java, Tomcat, Axis and Nutch/Lucene. The web server must have a pre-built index that is updated by some periodic task (which you must write/supply). SOAP based search queries will be sent to it from our suite of web sites (see: [đăng nhập để xem URL]) and results must be returned that

    $918 (Avg Bid)
    $918 Giá đặt trung bình
    4 lượt đặt giá

    ...crawls the web for content relevant for given search terms. For example: Search term: "australian home design" Example process: After querying the major search engines and other relevant sites like Wikipedia and [đăng nhập để xem URL] (for example) the software would extract the most relevant paragraphs for the given search term from the web pages it dee...

    $1105 (Avg Bid)
    $1105 Giá đặt trung bình
    8 lượt đặt giá

    I am looking for some help configuing and running nutch-0.9 on Windows. There are two parts: 1. I need a zipped nutch-0.9 folder, with all the properties configured to run as single instance with hadoop. I already have Cygwin running. If you have it running somewhere, please try it on some other machine and it's works just send me the zip with

    $30 - $250
    Đã niêm phong
    $30 - $250
    2 lượt đặt giá

    We have already an almost running application which we want to redesign and clone using Java / Hibermate – (Spring, Jboss, or Struts)– Lucene – Nutch -. MySQL This application covers all mayor functionalities from the most popular online auctions like eBay, eBid and furthermore has some own specialities like reverse auctions, wanted items, companies

    N/A
    N/A
    0 lượt đặt giá

    We have already an almost running application which we want to redesign and clone using Java / Hibermate ??" (Spring, Jboss, or Struts)??" Lucene ??" Nutch -. MySQL This application covers all mayor functionalities from the most popular online auctions like eBay, eBid and furthermore has some own specialities like reverse auctions, wanted items, companies

    $8666 (Avg Bid)
    $8666 Giá đặt trung bình
    40 lượt đặt giá

    We have already an almost running application which we want to redesign and clone using Java / Hibermate – (Spring, Jboss, or Struts)– Lucene – Nutch -. MySQL This application covers all mayor functionalities from the most popular online auctions like eBay, eBid and furthermore has some own specialities like reverse auctions, wanted items, companies

    min $3000
    Đã niêm phong
    min $3000
    39 lượt đặt giá

    We have a Web Portal written in Perl in which users can submit a company profile and introduce the URL from their Web Site. This URL should be crawled and indexed in order to build a vertical search engine. When visiting a company profile other visitors shall be able to search from this Page the main Web Site from this particular company. Every user

    $750 - $1500
    Đã niêm phong
    $750 - $1500
    10 lượt đặt giá

    We have a Web Portal written in Perl in which users can submit a company profile and introduce the URL from their Web Site. This URL should be crawled and indexed in order to build a vertical search engine. When visiting a company profile other visitors shall be able to search from this Page the main Web Site from this particular company. Every user

    $850 (Avg Bid)
    $850 Giá đặt trung bình
    2 lượt đặt giá

    I need to setup a WikiPedia Mirror with lucene (nutch/solr) search. The web site should have a text box with full support for lucene query language

    $550 (Avg Bid)
    $550 Giá đặt trung bình
    5 lượt đặt giá

    ...org/java/docs/[đăng nhập để xem URL]> ? ? ? 2. Results with local links to matched wikipedia articles should be retuned (number of results should be configurable) ? ? ? ? 3. Solr/Nutch separate or in combination should be used for? the project ? ? ? ? 4. It seems that not much of programming is required for thje project but setup and configuration of

    $850 (Avg Bid)
    $850 Giá đặt trung bình
    1 lượt đặt giá

    ...anything... so probably in c++, php/mysql front end. i dont like perl because its a pain to install its modules. its ok to use open source ports of nutch lucene in c++ question? is the zend/php port of nutch lucene a very fast solution? is it recommended? remember this is just to get a working prototype! so it needs to be CHEAP and simple!

    $5 (Avg Bid)
    $5 Giá đặt trung bình
    1 lượt đặt giá

    I need someone to look at the nutch index code and tell me know it works. I know the properties in nutch-0.9/conf/[đăng nhập để xem URL] boost the weight of curtain elements on a page when that page is getting ranked in the index. I need to understand all the factors how a page is ranking in the the nutch index. currently this is what I know

    $424 (Avg Bid)
    $424 Giá đặt trung bình
    2 lượt đặt giá

    ...can work to integrate a better search engine onto our business social networking site. We would like to use free search software, such as Apache Nutch or Apache SOLR, to start. The former indexes web pages, and the latter indexes arbitrary data (in comma-separated or xml format), both using the same underlying engine. We would set this up on our

    $750 - $1500
    Đã niêm phong
    $750 - $1500
    4 lượt đặt giá

    Vui lòng Đăng Ký hoặc Đăng Nhập để xem thông tin.

    Nồi Bật Đã niêm phong