This is a revised posting of our other project that we posted earlier today [url removed, login to view]
We are looking for someone that can build us a completely custom search engine built with the same idea as Google. We need for our entire website to be built from scratch and have a search database of over billions of websites. You will be building the website and the database.
We need a crawler that is capable of crawling 10,000,000+ websites per month when running 24/7.
We need a custom search results algorithm to be designed and implemented. Please see the notes regarding the algorithm below.
Our homepage should be very similar to Google which will have no pictures and should load extremely fast even for slow dialup users. It should be text only.
We need a program similar to Google Ad-Words to be created and implemented where it is based on the idea of pay per click, and users can bid for top positions. This program should be fully self operated. We need ALL of the features that Google Ad words has. We understand that this is also an extremely large project itself. This program itself is budgeted at over $13,000+
We need an admin backend where we can manipulate the search results, and place sites on a black list, or manually add or remove sites from our index.
We need a page, where webmasters can submit their site for our crawler to visit. This features needs to be installed and setup in the crawler
We need a premium page where a website can pay $100 for a crawl to be completed within 3 hours of payment verification. This features needs to be installed and setup in the crawler
We will be using 4 load balanced extremely high end servers to handle all the loads which will be eventually placed on the server. We have currently about 4,000 Gigabytes of storage on SCSI drives with Raid 5 array, and Quad Xeon 2.8 HT Processors, with dedicated un-metered bandwidth connections
You MUST be in DAILY contact with us via AIM or Yahoo messenger with reports of what you have completed.
We need direct phone numbers on where we can contact you.
Please let us know just how many developers you will assign to the project
All payments will be via [url removed, login to view] and we will cover all escrow fees. Payments will be released in stages. We will give 10% upfront after you design our very simple text homepage, and then 10% will be released every time you complete 10% of the project, for a total of 10 payments. This is the ONLY method we will agree to.
Special Search Algorithm Information…
Here is probably the BIGGEST part of the entire project. We need a special algorithm to be designed to rank our search results based on some of the same principles that Google uses. Some additional things our spider should be able to detect is spam sites, hate sites, link farms, and ffa’s and automatically put them on our black list to be excluded from our search results. Here is about the only information that Google will release regarding their search results ranking system. Our algorithm should be based on these same main principles.
[url removed, login to view]
The software behind Google's search technology conducts a series of simultaneous calculations requiring only a fraction of a second. Traditional search engines rely heavily on how often a word appears on a web page. Google uses PageRank™ to examine the entire link structure of the web and determine which pages are most important. It then conducts hypertext-matching analysis to determine which pages are relevant to the specific search being conducted. By combining overall importance and query-specific relevance, Google is able to put the most relevant and reliable results first.
PageRank Technology: PageRank performs an objective measurement of the importance of web pages by solving an equation of more than 500 million variables and 2 billion terms. Instead of counting direct links, PageRank interprets a link from Page A to Page B as a vote for Page B by Page A. PageRank then assesses a page's importance by the number of votes it receives.
PageRank also considers the importance of each page that casts a vote, as votes from some pages are considered to have greater value, thus giving the linked page greater value. Important pages receive a higher PageRank and appear at the top of the search results. Google's technology uses the collective intelligence of the web to determine a page's importance. There is no human involvement or manipulation of results, which is why users have come to trust Google as a source of objective information untainted by paid placement.
Hypertext-Matching Analysis: Google's search engine also analyzes page content. However, instead of simply scanning for page-based text (which can be manipulated by site publishers through meta-tags), Google's technology analyzes the full content of a page and factors in fonts, subdivisions and the precise location of each word. Google also analyzes the content of neighboring web pages to ensure the results returned are the most relevant to a user's query
We understand that to build a search engine that is advanced as Google it would cost more than $100,000,000 US Dollars, and obviously that is not our budget. We want a search engine that looks similar to Google, had all of the same sites that Google has indexed, has an advertising program similar to Google Adwords, and can run self sufficient with little admin intervention.
THIS IS AN EXTREMEMLY LARGE PROJECT…PLEASE BID ACCORDANLY.
---PERSONALIZED BIDS ONLY PLEASE---