Traditional Culture Encyclopedia - Traditional culture - The Development Trend of Search Engine

The Development Trend of Search Engine

Technical development trend of search engine

After several years of development and exploration, search engines are getting closer to people's needs, and the technology of search engines has also developed greatly. The latest technological development of search engines includes the following aspects:

First, improve the search engine's understanding of users' retrieval problems.

In order to improve the search engine's understanding of users' retrieval problems, it is necessary to have a good retrieval problem language. In order to overcome the shortcomings of keyword retrieval and directory query, natural language intelligent question answering came into being. Users can enter simple questions, such as "How can I kill computer viruses?" . After analyzing the structure and content of the question, the search engine either directly gives the answer to the question or guides the user to re-select from several optional questions. The advantages of natural language are: firstly, it makes network communication more humanized; Secondly, it makes the query more convenient, direct and effective. Take the above example as an example. If you use keywords to query, most people will use the word "virus" to search, and the results will inevitably contain a lot of invalid information such as the introduction of various viruses, how viruses are produced, and so on. , and use "How can I kill a computer virus?" Search engines will provide users with information on how to kill viruses, which improves the retrieval efficiency.

Secondly, the retrieval results are processed.

1) Search engine based on link evaluation.

The excellent representative of search engine based on link evaluation is Googel (), and its original "link evaluation system" is based on the understanding that the importance of a web page depends on the number of links it has been linked to by other web pages, especially the number of links of some web pages that are deemed as "important". This evaluation system is very similar to the idea of sci-tech citation index. However, because the Internet is developing in a commercial environment, the number of links in a website is closely related to its commercial promotion, so this evaluation system lacks objectivity to some extent.

2) Search engine based on visit popularity.

The representative of search engine based on popularity is direct hit, and its basic idea is that the website that most people choose to visit is the most important website. According to the websites actually selected and visited by thousands of network users in the previous search results and the time they spent on these websites, the importance ranking of related websites is determined statistically, so as to determine which websites best meet the users' search requirements. Therefore, it has typical conformity characteristics. This evaluation system has the same shortcomings as the search engine based on link evaluation.

3) Remove extra redundant information from the retrieval results.

Some surveys point out that too much additional information increases the information burden of users. In order to get rid of these excessive additional information, retrieval technologies such as user customization and content filtering can be adopted.

Third, determine the information collection scope of search engines and improve the pertinence of search engines.

1) vertical topic search engine

There is a huge amount of information on the Internet, and network resources are growing at a rate of ten times. It is difficult for search engines to collect all the network information on all topics. Even if the information topics are collected comprehensively, it is difficult to make all the topics accurate and professional because of the wide range of topics, which leads to the search results being too junk. Since then, the vertical theme search engine has occupied a place in various search engines with its high objectivity and professionalism, such as stock, weather, news and other search engines. With strong pertinence, users are highly satisfied with the query results. The author thinks that the vertical theme has great room for development.

2) Search for non-www information

Provide FTP and other information retrieval.

3) Multimedia search engine

Multimedia retrieval mainly includes sound and image retrieval.

Fourthly, the technical development of search engines will focus on the processing of search results to provide more optimized search results.

1) pure search engine

This search engine does not have its own information collection system, but uses other people's existing index database, mainly focusing on the concept, technology and mechanism of retrieval.

2) Meta search engine

There are many search engines now, and the scope of information collection, search mechanism and algorithm are different. Users should learn the usage of multiple search engines. Each search engine can only cover 30-50% of the whole www resources (search engine watch data), resulting in the repetition rate of query results obtained by different search engines for the same search request less than 34%, while the precision rate of each search engine is less than 45%.

Metasearch enging is a search engine that searches the search requests submitted by users to multiple independent search engines, and processes the search results centrally and uniformly, and provides them to users in a unified format, so it is called a search engine above search engines. Its main focus is on improving search speed, intelligently processing search results, setting personalized search function and humanized search interface, which has high recall and precision. At present, the more successful meta-search engines are metacrawler, dopile, ixquick and so on.