When it comes to the World Wide Web, there are over 800 million pages full of information about different things but people are only able to navigate to about half of these pages when they use Internet search engines according to a new study by a team of computer scientists. With regard to indexing the Web, search engines are making less of an effort to do so. Considering one institute, a computer and communications firm owns it.
A third of all sites were hit by the best search engine and this was discovered by researchers after conducting similar study at the end of 1997 and they were also able to find out that 60 percent of the Web could collectively be covered by the top six search engines. There was a report last February that came from a well known journal saying that what was found was just 42 percent of all sites in a test of 11 top search engines and there was no program that was able to cover more than about 16 percent of the Web. To find search engine marketing australia information see this resource.
Promised by the Web was an effort to equalize access to information but because search engines often tend to index the sites that have more links to them people are able to view these popular sites and not those which may carry loads of relevant data.
At first it was estimated that the amount of Internet information and content resulted to around 320 million pages but there is more that needs to be patrolled since just 14 months later they found out that the number of pages was more than double of their first estimate. Generally, there is 6 trillion bytes of information on the Web but from the library of congress comes 20 trillion bytes. Based from the results gathered by researchers from the random surfing exercise of 2,500 Web sites they did, there were close to 3 million publicly available servers with 289 pages per server.
Still they said that the amount of information available on the Net could be larger because just a few sites may have millions of pages. A series of tests were done on the servers and from these they found out that 2 percent contained pornographic material, 2 percent were personal Web pages, about 83 percent of them contained commercial content company Web pages and catalogues, 6 percent had information about science and education, and 3 percent contained health information. It is not because of the volume but the techniques utilized by search engines that make so much of the Web hard to find. If you want more comprehensive info on online marketing sydney that site will help you.
When it comes to locating pages what the search providers use are user registration and following links and these are their two main methods. A biased sample of the Web is what search engines make according to researchers for they use links to find new pages in turn leading them to find and index pages that have more links to them. In this case, the problem is not about having a lack of the ability to do the indexing, the problem is when resources are made to have other uses for users including valuable services such as free email for example.
There are a lot of people who do not notice what they are seeing and this is because a number of them only make simple information requests according to a search engine expert. What is expected is that this imbalance in cataloguing will go on for some years especially due to the fact that the rate of increase in computer resources will generally be faster as compared with the production of information content by humans to be posted on new sites.




Comments on this entry are closed.