CN1879107B - Information retrieval based on historical data - Google Patents
Information retrieval based on historical data Download PDFInfo
- Publication number
- CN1879107B CN1879107B CN200480033254.8A CN200480033254A CN1879107B CN 1879107 B CN1879107 B CN 1879107B CN 200480033254 A CN200480033254 A CN 200480033254A CN 1879107 B CN1879107 B CN 1879107B
- Authority
- CN
- China
- Prior art keywords
- document
- described document
- data
- relevant
- time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0242—Determining effectiveness of advertisements
- G06Q30/0246—Traffic
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Strategic Management (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- Finance (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Economics (AREA)
- Marketing (AREA)
- Game Theory and Decision Science (AREA)
- General Business, Economics & Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Document Processing Apparatus (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
A system (125) identifies a document and obtains one or more types of history data associated with the document. The system (125) may generate a score for the document based, at least in part, on the one or more types of history data.
Description
Technical field
The present invention relates generally to information retrieval system, and more particularly, relate to at least part of historical data based on relevant with relevant documentation, generate the system and method for Search Results.
Background technology
WWW (" webpage ") comprises bulk information.Search engine helps user by the web document of cataloging, and locates the required part of this information.Conventionally, response user's request, search engine turns back to linking of the document relevant with this request.
Search engine can determining based on customer-furnished search terms (being called as search inquiry) user interest.The target of search engine is based on search inquiry, recognizes the link of high-quality correlated results.Typically, the data bank of the web document of the term of search engine in inquiring about by match search and pre-stored realizes this target.The web document that comprises user search terms is regarded as " hitting " and returns to user.
Ideally, search engine will respond designated user search inquiry, for user provides correlated results.A kind of search engine based on the comparison search inquiry term is identified relevant documentation with the word being included in document.Another kind of search engine use in document, exist search inquiry term because usually identifying relevant documentation.This search engine use with to or determine the relative importance of document from the relevant information of linking of document.
These two kinds of search engines make every effort to provide high-quality search query results.Existence can affect several factors of the outcome quality being generated by search engine.For example, number of site manufacturer raises their grade artificially by spam technology.Meanwhile, can make " expired " document (i.e. long-time those documents that do not upgrade, thereby comprise stale data) grade higher than " newer " document (be those documents of recent renewal, thereby the data that comprise renewal).Under some specific environments, the expired document of higher level has reduced Search Results.
Therefore, still need to improve the quality of the result being generated by search engine.
Summary of the invention
The system and method conforming to the principle of the present invention at least partly historical data based on relevant with document scores to document.This score can be used for improving the Search Results generating together with search inquiry.
According to the aspect conforming to principle of the present invention, provide a kind of method for the document of scoring.The method can comprise identification document and obtain one or more historical datas relevant with described document.The method may further include at least partly based on one or more historical datas, generates the score for described document.
According on the other hand, provide a kind of method for the document of scoring.The method can comprise the life-span of determining the connection data relevant with linked document, and the attenuation function in life-span based on this connection data, carrys out the document that classification links.
Brief description of the drawings
Comprise and form the exemplary embodiments of the invention of accompanying drawing of the part of this instructions, and in conjunction with instructions, explain the present invention.In the drawings:
Fig. 1 is the exemplary network graph that can realize the system and method conforming to principle of the present invention;
Fig. 2 is according to the realization conforming to principle of the present invention, the client computer of Fig. 1 and/or the exemplary plot of server;
Fig. 3 is according to the realization conforming to principle of the present invention, the exemplary functional block diagram of the search engine of Fig. 1; And
Fig. 4 is according to the realization that conforms to principle of the present invention, for the process flow diagram of the exemplary process of the document of scoring.
Embodiment
Following detailed description of the present invention is with reference to accompanying drawing.Same reference numbers in different figure can be identified same or similar element.Meanwhile, following detailed description does not limit the present invention.
The system and method conforming to principle of the present invention for example can use with described document the relevant historical data document of scoring.System and method can use these must assign to provide high-quality Search Results.
" document " as used in this, broad interpretation becomes to comprise any machine readable and the storable works of machine.Document can comprise Email, website, file, combination of files, have one or more files of linking with the embedding of alternative document, newsgroup's notice, blog, web advertisement etc.The in the situation that of the Internet, public document is webpage.Webpage generally includes text message and can comprise the information (such as metamessage, image, hyperlink etc.) of embedding and/or the instruction (such as java script etc.) embedding.Webpage can be corresponding to document or partial document.Therefore, word " webpage " or " document " can exchange use in some cases.In other cases, webpage can refer to partial document, such as subdocument.Webpage is also possible corresponding to more than single document.
In following description, can be to have to the link of other documents and/or from the link of other documents document description.For example, in the time that document is included in the link of another document, link can be called as " forward link ".In the time that document comprises the link from another document, this link can be called as " backward link ".In the time using term " link ", can refer to backward link or forward link.
The example of network structure
Fig. 1 is the exemplary diagram of network 100, wherein, can realize the system and method conforming to principle of the present invention.Network 100 can comprise the multiple client computer 110 that are connected to multiple server 120-140 through network 150.Network 150 can comprise LAN (Local Area Network) (LAN), wide area network (WAN), telephone network, such as network or the combination of network of public switched telephone network (PSTN), Intranet, internet, memory devices, another type.For simplicity, two client computer 110 and three server 120-140 are exemplified as and are connected to network 150.In fact, can there is more or less client-server.Meanwhile, in some instances, client computer can be carried out the function of server, and server can be carried out the function of client computer.
In the realization conforming to principle of the present invention, server 120 can comprise the search engine 125 that can be used by client computer 110.Server 120 can be taken off the data bank (for example webpage), index file of (crawl) document and the storage information relevant with document in taken off document library.The document that can be taken off by server 120 can be stored or safeguard to server 130 and 140.Although server 120-140 is illustrated as corpus separatum, another of one or more execution server 120-140 that also can server 120-140 or multiple function one or more.For example, to be embodied as individual server be possible to two or more server 120-140.Also server 120-140 single can be embodied as to two or more independence (and can be distributed) equipment.
Exemplary Client/server architecture
Fig. 2 is that the exemplary diagram of client computer or server entity (hereinafter referred to as " client/server entity "), can be corresponding to one or more client computer 110 and server 120-140 according to the realization conforming to principle of the present invention.Client/server entity can comprise bus 210, processor 220, primary memory 230, ROM (read-only memory) (ROM) 240, memory device 250, one or more input equipment 260, one or more output device 270 and communication interface 280.Bus 210 can comprise one or more wires, the communication between the parts of permission client/server entity.
As described in detail below, conform to principle of the present invention, client/server entity is carried out some search related operations.Client/server entity can respond to carry out and be included in computer-readable medium, such as the processor 220 of the software instruction in storer 230, and carries out these operations.Computer-readable medium can be defined as one or more physics or logical memory device and/or carrier wave.
Software instruction can, from another computer-readable medium, such as data storage device 250, or through communication interface 280, read in storer 230 from another equipment.The software instruction being included in storer 230 can make processor 220 carry out the process hereinafter described.In addition, can replace or realize the process conforming to principle of the present invention in conjunction with software instruction by hard-wired circuitry.Therefore the realization, conforming to principle of the present invention can be not limited to any particular combinations of hard-wired circuitry and software.
Exemplary search engine
Fig. 3 is according to the realization conforming to principle of the present invention, the exemplary functional block diagram of search engine 125.Search engine 125 can comprise document locator 310, historical parts 320 and grade parts 330.As shown in Figure 3, the one or more of document locator 310 and historical parts 320 can be connected to document information storehouse 340.Document information storehouse 340 can comprise and the information of for example previously taking off in by the addressable database of search engine 125, the document of index and storage is relevant.Historical data, as hereinafter described in more detail, can be associated with each document in document information storehouse 340.Historical data can be stored in document information storehouse 340 or other places.
Exemplary historical data
Document origination date
According to the realization conforming to principle of the present invention, document origination date can be used for generating (or amendment) score relevant with that document.Term " date " is widely used and can comprises thus time and date tolerance at this.As described below, existence can be used for determining several technology of document origination date.Some in these technology can be subject to expect that at them aspect the meaning of the Third Party Effect that improves the score relevant with document be " having deviation ".Other technologies bias free.The combination of any, these technology in these technology or other technologies can be used for determining the origination date of document.
Realize according to one, can be learned first or date of index file by search engine 125, determine the origination date of document.Search engine 125 can be by taking off, submit to from " outside " source to search engine 125 document (or its represent/general introduction), take off or the combination of index technology based on submitting to, or otherwise, find described document.In addition, can be found to first by search engine 125 date of the link of described document, determine the origination date of document.
Realize according to another, can be used as the expression of the origination date of document by the date of territory register documents.Realize according to another, can use at another document, such as in the combination of news article, newsgroup, email list or one or more these documents, the time of reference documents is inferred the origination date of document for the first time.Realize according to another, document at least comprises that the date of threshold number page can be used as the expression of the origination date of document.Realize according to another, can make the origination date of document equal server to deposit the timestamp relevant with described document of document.Other technologies, specifically do not mention at this, or technical combinations also can be used for determining or inferring the origination date of document.
Suppose the example that origination date is the document of yesterday that has by 10 backward link references.Described document can be scored higher than being the document before 10 years by the origination date that has of 100 backward link references, because the former link rate of growth is relatively higher than the latter by search engine 125.Although the spike speed of the growth of backward link number (spiky rate) can be by search engine 125 be used for the scoring factor of document, may be also to send out to attempt a direction of signal search engine 125 and send spam.Therefore, in this case, the score value that in fact search engine 125 can reduce document reduces the impact that sends spam.
Therefore, according to the realization conforming to principle of the present invention, search engine 125 can have been determined by the origination date of document the speed (for example mean value of the time per unit of the link number of some window creation of conduct based on since origination date or during that cycle) of the link that is created to described document.Then, can with this speed described document of scoring, for example, provide larger weight to the document that more often generates link.
In one implementation, search engine 125 can revise document based on link score value as follows:
H=
L/
log(F+2)
Wherein, H refers to the historical link score value of adjusting, the link score value providing for described document can be provided L, its can use based on to/carry out the link of document and be that document distributes any known links score technology of score value (for example, at U.S. patent No.6,285, score technology described in 999) derive, and F passing the time of can referring to measure from the origination date relevant with described document (or window) in this cycle.
For some inquiries, early document is more favourable than new.Therefore, can based on the difference (life-span aspect) of the mean lifetime of result set, adjust the score value of document.In other words, search engine 125 can be determined the life-span (for example using their origination date) of each document in result set, determine the mean lifetime of document, and difference between life-span and mean lifetime based on document, revise the score value of document (plus or minus).
Generally speaking, search engine 125 information based on relevant with the origination date of document at least partly, generates (or amendment) score value relevant with document.
Content update/change
According to the realization conforming to principle of the present invention, the relevant information of mode changing in time with document content can be used to generate (or amendment) score value relevant with that document.For example, the document score that its content is often edited is different from the document that its content remains unchanged in time.The score of the document that meanwhile, relatively many contents are upgraded in time can be different from the document that upgrades in time relatively small amount content.
In one implementation, search engine 125 can generating content to upgrade score (U) as follows:
U=f(UF,UA)
Wherein, f can refer to function, and such as summation or weighted sum, UF can refer to represent how long to upgrade the renewal frequency score of document (or webpage), and UA can refer to represent that document (or webpage) changes how many renewal amount scores in time.UF can determine in multiple modes, comprises the averaging time between renewal, the update times within the appointment time limit etc.
UA also can be defined as the function of one or more factors, such as " newly " relevant with document within a time cycle or the quantity of unique page.Another factor can comprise the ratio of relevant with the document new or quantity of unique page in a time cycle and the total page number relevant with that document.Another factor can be included in the quantity (n% of the content visible of for example document can change with cycle t (for example m month recently)) of upgrading document in one or more time cycles, and it can be mean value.Another factor can be included in (for example, in x days recently) in one or more time cycles, the quantity that document (or webpage) changes.
According to an exemplary realization, UA can be defined as the function of the different weights part of document content.For example, in the time determining UA, think if upgrade/change unessential content, such as java script, annotation, advertisement, navigation elements, model data or date/time label, give relatively little weight or ignore even completely.On the other hand, when determining when UA, for example think, if very important content is upgraded/changed to (often, more closely, more extensive etc.), such as the title relevant with forward link or anchor text, give to change higher weight than other guide.
UF and UA can affect the score value of distributing to document by other modes.For example, the change rate in the current time cycle and the change rate in another (for example, front) time cycle can be compared, determine to exist and accelerate or deceleration trend.The document that change rate increases can be more stable than change rate the score of those documents higher, quite high even if that changes variability.Change amount can be also the factor in this score.For example, in the time that change amount is greater than some threshold values, can score or change amount stable higher than change rate of the document that change rate increases is less than those documents of threshold value.
In some cases, in the time monitoring the content changing of document, data storage resource may be not enough to store those documents.In this case, search engine 125 can be stored the expression of document and monitor the variation of these expressions.For example, search engine 125 can be stored " signature " of document, replaces (whole) document itself to detect the change of document content.In this case, search engine 125 can be stored for the term vector of document (or webpage) and monitor the change that it is relatively large.Realize according to another, search engine 125 can be stored with supervision and is defined as important or the relative fraction (for example several terms) of document of (except " stopping word ") the most frequently occurs.
Realize according to another, search engine 125 can be stored general introduction or other expressions of document and monitor the variation of this information.Realize according to another, search engine 125 can generate the similarity hash (can be used for detecting more closely copying of document) for described document and monitor its variation.The variation of similarity hash can be regarded as representing the relatively large variation in its relevant documentation.In other are realized, can monitor by other technologies the variation of document.In the situation that there is enough data storage resources, can store and use whole document to determine variation, instead of some expressions of document.
To some inquiries, the document with nearest unaltered content can be more favourable than the document with the nearest content changing.Therefore can the score value based on adjusting document with the difference on average change date of result set may be, favourable.In other words, search engine 125 can be determined the last date changing of the content of each document in result set, determine the average change date of described document, and change date based on document and on average change the difference between the date, the score value (plus or minus) of document revised.
Generally speaking, search engine 125 is the relevant information of the mode based on changing in time with the content of document at least partly, generates (or amendment) score value relevant with document.For the very large document that comprises the content that belongs to multiple individual or companies, score value can be corresponding to each subdocument (, belonging to single individual or company or the content by its renewal).
Query analysis
According to the realization conforming to principle of the present invention, can generate (or change) score value relevant with document by one or more factors based on inquiry.For example, when document be included in Search Results concentrate time, one based on inquiry factor relate to the degree of selecting in time the document.In this case, search engine 125 can make user relatively often/score that day by day increases the document of selecting is higher than other documents.
Another factor based on inquiry can relate to the appearance in time of some search terms of occurring in inquiry.Cycle incrementally appears in inquiry specific search term collection in time.For example, " hot topic " title or the relevant term of division media event that with just become/ become popular may occur continually on the time cycle.In this case, the score that search engine 125 can make the document relevant with these search termses (or inquiry) is higher than the relevant document with these terms not.
Another factor based on inquiry can relate to the change in time of Search Results number by similar query generation.For example can represent popular title or division news by the remarkable increase of the Search Results number of similar query generation, and search engine 125 is increased and these scores of inquiring about relevant document.
Another factor based on inquiry can relate to the inquiry that keeps in time relatively constant but can cause the result changing in time.For example, the inquiry relevant with " world's MLB Slam championship " causes changing in time Search Results (for example relevant document control Search Results within year or year with particular team).This change can be monitored and be used for correspondingly to score document.
Another factor based on inquiry can relate to " expired " of the document returning as Search Results.Document is expired can, based on following factor, be increased etc. such as document creation date, anchor growth, the traffic, content change, forward direction/backward link.For some inquiries, document extremely important (if for example search frequently asked question (FAQ) file will be wished recent release very much) recently.Search engine 125 can be selected which document in Search Results by analysis user, learns which inquiry and changes recently most important.More particularly, search engine 125 can consider that user often likes grade lower than the early up-to-date document of document in Search Results more than having.In addition, if passage in time, particular document is for example included in the inquiry (for example " world series ") of paying close attention to most, to more specifically inquiring about in (" New York American "), so, factor that should be based on inquiry-by self or by as mentioned herein other-can be used for reducing seeming the score value of expired document.
In some cases, can more preferably consider expired document than upgrading document.Therefore,, in the time generating the score value that is used for described document, search engine 125 can consider to select in time the degree of the document.For example, if to given query, user tends to select than more inferior grade, relatively expired document of more high-grade renewal document in time, and this is adjusted the instruction of the score value of expired document by search engine 125 use.
Another factor based on inquiry can relate to document and appear at the degree in different Query Results.In other words, can monitor the inquiry entropy for one or more documents, and with the basis that acts on score.For example, if particular document as occurring for hitting of inconsistent query set, this can (although not necessarily) regard the signal that described document is spam as, in this case, search engine 125 is the lowland described document of scoring relatively more.
Generally speaking, search engine 125 can, at least partly based on one or more factors based on inquiry, generate (or amendment) score value relevant with document.
Based on the standard of link
According to the realization conforming to principle of the present invention, can generate (or amendment) score value relevant with document by one or more factors based on link.In one implementation, the factor based on link can relate to new url and comes across the date that document and existing link disappear.The appearance date of link can be that search engine 125 finds the first date of link or document package to contain the date (for example, finding date or its date of recent renewal of document by link) linking.The disappearance date of link can be that the document that comprises this link is deleted this link or the first date disappearing own.
These dates can by search engine 125 take off or index upgrade operating period determine.By this date as a reference, then, the time that search engine 125 can monitor the link of document changes behavior, such as in the time that link occurs or disappear, link occurs in time or the speed that disappears, at the appointed time how many links occur or disappear, exist tendency to occur that the existing link of new url or document disappears etc. during the cycle.
Use and/or change behavior from the time of the link of document, search engine 125 document of can correspondingly scoring.For example, new url quantity or speed downtrending in time (for example based on the nearest time cycle to the quantity of new url or the comparison of speed in the early time cycle) can inform that search engine 125 documents are expired by signal, in this case, search engine 125 can reduce the score value of document.On the contrary, according to particular case and realization, uptrending can signal be informed can be regarded as more relevant " up-to-date " document (for example up-to-date establishment or upgrade the document of its content).
The quantity that is increased in time/reduced by the backward link of analytical documentation (or page) or the variation of speed, search engine 125 can be derived document how new signal of interest.For example, if the curve reflection of gliding gradually for this analysis, this can signal and inform that document is expired (for example no longer renewal, importance reduce, replaced etc. by another document).
Realize according to one, analysis can be depended on the quantity of the new url of document.For example, the quantity that search engine 125 can monitor new url since finding document is first than the quantity of the new url of document in nearest n days.In addition, search engine 125 can be determined compared with the first life-span linking of finding, the life-span the earliest of up-to-date y% link.
For example object, suppose y=10 and before 100 days, found first two documents (being website) in this example.For the first website, find that 10% link was less than before 10 days, and for the second website, the link of discovery 0% was less than before 10 days (in other words, earlier finds them).In this case, measure and cause website A to be 0.1 and to be 0 to website B.Can suitably amplify tolerance.In another exemplary realization, can link analyzing and revise tolerance relatively in more detail of date distribution by execution.For example, can build model, whether prediction specific distribution represents the website website of (for example no longer upgrade, popular increase or minimizing, replacement etc.) of particular type.
Realize according to another, analysis can be depended on the weight of distributing to link.In this case, each link can carry out weighting by the function increasing with the freshness of link.Can be by the date of the appearance/change linking, link the date of the appearance/change of relevant anchor text, the appearance/change date that comprises this document linking with this and determine the freshness that links.If still relevant and good based on link, the constant theory of good link in the time that document upgrades, appearance/change date of the document that comprises link can be the better instruction of the freshness of link.For can't help document trickle uncorrelated part small editor and upgrade the freshness of each link, can test the marked change (changes of the variation of the greater part of for example document or many different pieces of document) of each renewal document, and correspondingly upgrade the freshness of (or not upgrading) link.
Can carry out weighting link by other modes.For example, can carry out weighting link by the document (for example governmental documents can give higher trust) based on there being many trusts to comprise link.Link also can have how many authorities (for example, to be similar at U.S. patent No.6, the mode described in 285,999 is determined authoritative document) to carry out weighting by the document based on comprising link.Link also can be used some other features of determining freshness, and the freshness of the document based on comprising this link is carried out weighting (document (for example Yahoo homepage) of for example frequent updating is deleted suddenly the link of document).
According to another technology, analysis can depend on that the life-span relevant with pointing to linking of document distributes.In other words, can determine the date of the link that is created to document and be input in the function of determining life-span distribution.The life-span that can suppose expired document distributes and will be different from very much the life-span distribution of new document.Therefore, search engine 125 can distribute the document of scoring in the life-span of part based on relevant with document.
The date that link occurs also can be used to detect " spam ", and wherein, the owner of document or their colleague are created to the link of themselves document for improving the object of the score value being distributed by search engine.Typically " rationally " document attracts backward link lentamente.The large peak value of backward number of links can inform that (for example CDC website is after breaking out such as SARS for concern phenomenon by signal, many links can develop by leaps and bounds), or link, buy link or obtain the link from document by exchange, and do not have the editor about generating link to judge, signal is attempted sending spam (to obtain higher level, thereby obtaining the more good position in Search Results) to search engine.The example that provides link and do not edit the document of judgement comprises visiting book, with reference to daily record with allow anyone " freely " that increases document links page.
Realize according to another, analysis can be depended on the date that link disappears.Many links disappear and can represent that these link document pointed expired (for example no longer upgrade or substituted by another document).For example, search engine 125 can monitor date that the one or more links of document disappear, the link number that disappears in window at the appointed time, or change and reduce to some other times of the link number of the document link/renewal of the document that comprises these links (or to), identify and can be regarded as expired document.Once determined that document is expired, in the time determining the score value of the document being pointed to by link, the link being included in that document can be ignored or be ignored by search engine 125.
Realize according to another, the life-span of the link of document can be not only depended in analysis, and can depend on the mobilism of link.So, search engine 125 can weighting be different from (for example reducing) and upgrades all the time and be linked to all the time the document of the different characteristic link of the document of intended target document except having very new link, having every day.In an exemplary realization, search engine 125 can be based in time window, for all release documentations, has to the score value of each document of the link of a document, generates the score value for the document.This another version can be based on document main update time, by minimizing/decay factor be included in integrated in.
Generally speaking, search engine 125 can, partly based on one or more factors based on link, generate (or amendment) score value relevant with document.
Anchor text
According to the realization conforming to principle of the present invention, the relevant information of mode changing in time with anchor text can be used for generating (or amendment) score value relevant with document.For example, can by with upgrade or expression that even focus changes as having had in document to the change in time of the relevant anchor text of linking of document.
In addition, if the content changing of document is different from significantly with it it and backwardly links relevant anchor text, so the territory relevant with document significantly (completely) change from predecessor.In the time that expire in territory and Tongfang is not bought this territory, this can occur.Because anchor text is considered to be a part for its peer link document pointed conventionally, territory can no longer manifest for the Search Results of inquiring about on title.This is less desirable result.
The method addressing this problem is the date that estimation domain changes its focus.This can be by determining that the date that the text of document significantly changes or the text of anchor text significantly changes completes.Then can ignore or ignore all-links and/or anchor text before that date.
The freshness of anchor text also can be used as the scoring factor of document.Appearance/change date that can be by for example anchor text, with appearance/change date of appearance/change date linking of anchor text dependent and/or peer link document pointed, determine the freshness of anchor text.If still relevant and good based on anchor text, the constant theory of good anchor text in the time that document upgrades, appearance/change date of the document being pointed to by link can be the good indicator of the freshness of anchor text.For can't help document trickle uncorrelated part trickle editor and upgrade the freshness of anchor text, can test the marked change (major part of for example document changes or the changes of many different pieces of document) of each renewal document and correspondingly upgrade the freshness of (or not upgrading) anchor text.
Generally speaking, search engine 125 is the relevant information of the mode based on changing in time with anchor text at least partly, generates (or amendment) score value relevant with document.
The traffic
According to the realization conforming to principle of the present invention, about the traffic relevant with document information in time can be used for generating (or amendment) score value relevant with document.For example, search engine 125 can monitor that one or more users are to the traffic of document or the time behavior of other " purposes ".The large reduction of the traffic can represent that document is expired (for example no longer upgrades or may be substituted by another document).
In one implementation, average traffic and the document of search engine 125 can more nearest j days (for example wherein j=30) document receive maximum traffics, alternatively, during the moon of adjusting by seasonal variations, or nearest for example, average traffic during k days (k=365).Alternatively, search engine 125 can be identified repeated communications amount pattern or traffic pattern over time.Can find to exist the more or less cycle of popular (for example thering is the more or less traffic) of document, such as during the moon in summer, weekend or during some other time cycles in season.By the variation of identification repeated communications amount pattern or traffic pattern, during search engine 125 can suitably be adjusted at these cycles or outside the score of document.
In addition, or, search engine 125 can monitor with for " the advertisement traffic " of particular document relevant time behavior.For example, search engine 125 can monitor one or more combinations of following factor: (1) in time, is presented or upgraded degree or the frequency of advertisement by specified documents; (2) gray quality (for example its advertisement with reference to/be linked to search engine 125 and know the document in time with relative high traffic and trust, can be provided those documents that point to low traffic/unreliable document than its advertisement such as the document of amazon.com, such as porn site higher weight relatively); And (3) advertisement is generated to the degree (for example their clicking rate) of the user traffic of their related documents.Search engine 125 can be with these time behaviors relevant with advertisement traffic document of scoring.
Generally speaking, search engine 125 is the information in time of the traffic based on about relevant with document at least partly, generates (or amendment) score value relevant with document.
User behavior
According to the realization conforming to principle of the present invention, can use the information corresponding to relevant with document in time individual or collective's user behavior, generate (or amendment) score value relevant with document.For example search engine 125 can monitor from Search Results and concentrate and select number of times and/or one or more user of a document to access the time quantum that described document spends.Then, search engine 125 can be at least partly based on this information described document of scoring.
If a certain inquiry is returned to document, and given identical or similar inquiry, in time or at the appointed time, in window, user is the average cost time more or less on the document, and this can be used as respectively the new or old expression of the document so.The have title document of " the Riverview plan of swimming " is returned in for example supposition inquiry " the Riverview plan of swimming ".Further supposition user cost is in the past accessed it in 30 seconds, visits it but select now each user of described document only to spend several seconds.Search engine 125 can determine that described document is old (comprising out-of-date swimming plan) the described document of correspondingly scoring by this information.
Generally speaking, search engine 125 can be at least partly based on relevant individual or the corresponding information of collective's user behavior with document in time, generate (or amendment) score value relevant with document.
Territory relevant information
According to the realization conforming to principle of the present invention, the information that relates to the territory relevant with document can be used for generating (or amendment) score value relevant with described document.For example, search engine 125 can monitor and for example, how to deposit the relevant information of document in computer network (internet, Intranet or other networks or document database), and with this information document of scoring.
The individual who attempts deception (transmission spam) search engine throwaways or " doorway (doorway) " territory conventionally, and attempts obtaining the traffic as much as possible before being booked.In the time of the score document relevant with these territories, can be used by search engine 125 about the information of the legitimacy in territory.
Can distinguish non-legal order and legitimate domains with some signal.For example territory can continue the cycle that reaches 10 years.Useful (legal) territory pays several years conventionally in advance, and territory, doorway (illegally) was only used more than 1 year.Therefore, the date in the time expiring in following territory can be used as predicting the legitimacy in territory, thus the prediction factor of the legitimacy of relevant document with it.
Equally, or, can be monitored to predict that for name server (DNS) record in territory whether territory is legal.Whom DNS record comprises and registered the details of the address of territory, administration and technology address and name server (server by domain name mapping as IP address).Be used for these data in time in territory by analysis, can identify non-legal order.For example, search engine 125 can monitor on the time cycle, and whether the correct address information of physics exists, and whether the contact details in territory change relatively continually, between different names server and host company, whether have quite a large amount of variation etc.In one implementation, can identify, store the inventory of known bad contact details, name server and/or IP address, and for predicting the legitimacy in territory, thereby the legitimacy of associated document predicted.
Equally, in addition, can be used for predicting the legitimacy in territory about life-span or other information of the name server relevant with territory." well " name server can have from the mixing of the not same area of different Registers and have the history in these territories of host, and " bad " name server can main host plants pornographic or territory, doorway, there is the territory (the general designator of spam) of business vocabulary or may be mainly maybe brand-new from the scattered territory of single Register.The freshness of name server can be non-automaticly for determining the negative factor of legitimacy of the domain of dependence, and can be in conjunction with other factors, all as described herein.
Generally speaking, search engine 125 information of the legitimacy in the territory based on about relevant with document at least partly, generates (or amendment) score value relevant with document.
Grade history
According to the realization conforming to principle of the present invention, can generate by the information relevant with the previous grade of document (or amendment) score value relevant with document.For example, search engine 125 can respond the search inquiry that offers search engine 125, monitors the time variation grades of document.Search engine 125 can determine that the document that grade is jumped in many inquiries may be subject document, or it may be to signal to attempt to send spam to search engine 125.
Therefore, can affect in the quantity moving aspect grade or speed the following score value of distributing to that document with document on the time cycle.In one implementation, for each set of Search Results, can carry out weighting document in the position in top n Search Results according to it.To N=30, an example function can be [((N+1)-SLOT/N)]
4.In this case, the first result can obtain 1.0 score value, to N result, drops to the score value that approaches 0.
Can repeat query set (for example business inquiry), and can obtain the document of grade more than M% by mark, or the percentage increase of grade is used as the signal of the score value that is identified for described document.For example, if above average (medium) score value of result relatively high and above result there is month by month sizable variation, search engine 125 can determine that inquiry is likely business.Search engine 125 also can monitor the instruction of inflow and outflow (churn) as business inquiry.To business inquiry, the possibility of spam is higher, and therefore, search engine 125 can correspondingly be processed relevant with it document.
Except being used to specify the history of position (or grade) of document of inquiry, search engine 125 can monitor (on the page, main frame, document and/or basis, territory) one or more other factors, such as in time document is chosen as seek the inquiry number of result and speed (increase/reduce), seasonal, sudden and in time document be selected as other patterns of Search Results and/or inquire about rightly for URL, score value is over time.
In addition, or search engine 125 can monitor in time, for example, with irrelevant document (URL) quantity of standard based on inquiry.For example, search engine 125 can monitor the mean scores in the top result set generating in response to given query or query set, and adjusts the result set that generates in response to given query or query set and/or the score value of other results.In addition, search engine 125 can monitor in time, is the number of results of ad hoc inquiry or query set generation.For example, if search engine 125 determines that number of results increases or rate of growth changes (expression that this increase can be " topical subject " or other phenomenons), search engine 125 can make those results higher in score in future.
In addition, or search engine 125 can monitor that level of documentation in time detects the unexpected peak value in level of documentation.Peak value can represent theme phenomenon (for example topical subject) or attempt by for example conclude the business or buy link and send spam to search engine 125.Search engine 125 can lag behind to allow with a certain rate increase grade by utilization, adopts the measure that prevents that spam from attempting.In another is realized, the grade of specified documents can be allowed to a certain max-thresholds increasing on schedule time window.As the further measure that the document relevant with theme phenomenon and spam document are distinguished, search engine 125 can, based on for example will not mentioning the theory of spam document in news, be considered the record of document in news article, discussion group etc.Can reduce spam with any one of these technology or combination attempts.
In addition, or search engine 125 can be considered as the remarkable decline of level of documentation that these documents " are not liked " or expired instruction.For example, if the grade of document declines in time significantly, search engine 125 can be considered as described document the expired and described document of correspondingly scoring so.
Generally speaking, search engine 125 information based on relevant with the previous grade of document at least partly, generates (or amendment) score value relevant with document.
The data that user safeguards/generates
According to the realization conforming to principle of the present invention, the data of can user safeguarding or generating generate (or amendment) score value relevant with document.For example, search engine 125 can monitor the data of being safeguarded or being generated by user, maybe can provide user to like or the data of the other types of some instructions of interested document such as " bookmark ", " hobby ".Search engine 125 directly (for example auxiliary through browser) or indirect (for example, through browser) obtains this data.Then, search engine 125 in time analytical documentation with it relevant multiple bookmark/hobbies determine the importance of document.
In another is realized, can represent that user increases the interest of particular document in time or the user data of the other types of minimizing can be made for the document of scoring by search engine 125.For example, " temporarily " relevant with user or buffer culture can be monitored by search engine 125, increase or reduce to identify the document adding in time.Similarly, the cookie data block relevant with particular document also can be monitored and be determined upwards still downward trend of the interest existence of document by search engine 125.
Generally speaking, the data that search engine 125 can be safeguarded or generate based on user at least partly, generate (or amendment) score value relevant with document.
Unique word in anchor text, two-dimensional grammar (bigram), phrase
According to the realization conforming to principle of the present invention, can use about the information of the unique word in anchor text, two-dimensional grammar, phrase and generate (or amendment) score value relevant with document.For example search engine 125 can monitor website (or link) figure and their behavior in time, and this information is used for to score, spam detection or other objects.Naturally the web graph of exploitation comprises independently judgement conventionally.The web graph of the synthetic generation of ordinary representation spam intention is based on coordinating judgement, causes the growth chart point relatively of anchor word/two-dimensional grammar/phrase.
A kind of reason of this spike can be a large amount of identical anchor having increased from many documents.Another possibility is the intentional different anchor having increased from multiple documents.Search engine 125 can monitor anchor the factor using them as their peer link of score document pointed.For example, search engine 125 can improve the impact of suspicious anchor on relevant documentation score value.In addition, search engine 125 can and be derived multiplication factor and convert for the score value of described document with the continuous conversion of the synthetic likelihood score generating.
Generally speaking, search engine 125 can be at least partly based on about with one or more information that link unique word, two-dimensional grammar and phrase in relevant anchor text of pointing to document, generate (or amendment) score value relevant with document.
The connection of independent peer-to-peer (peer)
According to the realization conforming to principle of the present invention, for example can use, about the information of the connection of independent peer-to-peer (irrelevant document) and generate (or amendment) score value relevant with document.
Have to the obvious independent peer-to-peer-input of a large amount of links and/or the unexpected growth of output quantity of each document and can represent potential fake site figure, it is the designator of attempting to send spam.If increase corresponding to being conventionally concerned with or inconsistent anchor text, can strengthen this instruction.When in the time using together with score technology based on linking, can be with the demote impact of these links of this information, for example, as scale-of-two judgement (fixed amount of score value being demoted) or a multiplication factor.
Generally speaking, search engine 125 information of the connection based on about independent peer-to-peer at least partly, generates (or amendment) score value relevant with document.
Document subject matter
According to the realization conforming to principle of the present invention, can generate by the information about document subject matter (or amendment) score value relevant with document.For example, search engine 125 can carry out theme extract (for example by sectional lists, URL analysis, content analysis, troop, the theme of summary, unique low-frequency word collection or some other types extracts).Then, search engine 125 can monitor the theme of document in time and by this information for the object of scoring.
The theme collection relevant with document marked change in time can represent that document has changed the owner and previous document designator, no longer reliable such as score value, anchor text etc.Similarly, the peak value in theme number can represent spam.For example, if particular document is with can be considered as one or more theme collection on " stable " time cycle relevant, then in the theme number relevant with described document, occur (suddenly) peak value, this can be the instruction that document is substituted by " doorway " document.Another instruction can comprise the disappearance of the initial theme relevant with document.If one or more these situations detected, so, search engine 125 can reduce the relative score value of these documents and/or link, anchor text or other data relevant with described document.
Generally speaking, search engine 125 variation of the one or more themes based on relevant with described document at least partly, generates (or amendment) score value relevant with document.
Exemplary process
Fig. 4 is according to the realization that conforms to principle of the present invention, for the process flow diagram of the exemplary process of the document of scoring.Processing can be from server 120 be identified document (action 410).Document can comprise for example relevant with search inquiry one or more documents, such as being identified as the document relevant with search inquiry.In addition, document can comprise the one or more documents (for example identifying and be stored in the document in storehouse by taking off network) in document information storehouse or the storehouse irrelevant with any search inquiry.
Then, search engine 125 can be at least partly based on historical data the identified document (action 430) of scoring.In the time that identified document is relevant with search inquiry, search engine 125 can for example have heterogeneous pass based on them and search inquiry, generates the relevance score for described document.Then, search engine 125 can combine to obtain the total score value for described document by historical score value and relevance score.Replace combination score value, search engine 125 can be revised the relevance score for described document based on historical data, thereby improves or reduce score value, or in some cases, makes score value identical.In addition, search engine 125 can be based on the historical data document of scoring, and does not generate relevance score.In either case, search engine 125 can be with of historical data type or the combination document of scoring.
In the time that identified document is relevant with search inquiry, search engine 125 also can form Search Results by score document.For example, search engine 125 can carry out ranking documents by the score value based on them.Then, search engine 125 can form the reference to these documents, wherein, for example, with reference to comprising the title (can comprise in the time selecting, user is directed to the hyperlink of this authentic document) of document and the segment (text extract) from document.In other are realized, can form differently reference.Search engine 125 can for example, present to by the reference corresponding to multiple high score documents (predetermined multiple documents, have the document that exceeds threshold scores, all documents etc.) user who submits search inquiry to.
Conclusion
The system and method conforming to principle of the present invention can be with score document form high-quality Search Results of historical data.
The foregoing description of the preferred embodiments of the present invention provides example and description, but does not intend get rid of or limit the invention to disclosed concrete form.Instruct in view of above-mentioned, amendment and improvement are possible, or can obtain from implementing the present invention.For example, although described a series of actions with reference to figure 4, in other that conform to principle of the present invention are realized, can revise sequence of movement.Meanwhile, can the uncorrelated action of executed in parallel.
In addition, conventionally describe server 120 and carry out the major part action of describing with reference to the processing of figure 4, if not all.In another conforming to principle of the present invention realized, can be by another entity, such as another server 130 and/or 140 or client computer 110 carry out one or more or everything.
To those skilled in the art, aspect of the present invention as above can be apparent with the many multi-form realization of the software in realization shown in the figure, firmware and hardware.The real software code or the special control hardware that are used for realizing the aspect that conforms to principle of the present invention are not restrictions of the present invention.Therefore, not with reference to specific software code in the situation that, describe operation and the behavior of these aspects, it will be appreciated that the explanation that the ordinary skill of this area can be based at this, design realizes software and the control hardware of these aspects.
Claims (44)
1. the score method of document, comprising:
Identification document;
Obtain the multiple historical data relevant with described document, described multiple historical data at least comprises:
About with the data of the origination date of described document associations, wherein said origination date is at least based on one of following:
Search engine learn first or index described in date of document,
Search engine is found to the date of the link of described document first, or
In another document first with reference to date of described document,
And wherein register date of described document and described document by territory and at least comprise that at least one in date of threshold number page can be used as described origination date;
The data that change in time about document content, the wherein said data that change in time about document content based on:
Renewal frequency, how long its content based on described document within cycle a period of time changes, and
Renewal amount, its content changing based on described document within cycle a period of time is how many; And
At least another kind of data, it is one of following that wherein said at least another kind of data at least comprise:
About the query analysis data of one or more formerly search inquiries, wherein, for described one or more formerly search inquiries, described document is identified as Search Results;
About to or from the standard based on link of the behavior of the link of described document;
About with to the data that link associated anchor text of described document;
About with the data of the time behavior of the advertisement traffic of described document associations;
About the user behavior data of described document;
About with the territory related data of the legitimacy in the territory of described document associations;
About the data of the grade history of described document;
The data of safeguarding or generating with the user of described document associations, wherein, the data that user safeguards or generates with following at least one about: with a user or relevant favorites list, bookmark, temporary file and the buffer culture of multiple user;
About with the data of unique word, two-dimensional grammar or phrase in the anchor text being associated to linking of described document;
About the data of the connection of independent peer-to-peer, or
About with the data of the time dependent Document Title of described document associations; And
At least partly data based on about described origination date, the data that change in time about document content and generate the score value for described document with the described at least another kind of data of described document associations, wherein, generate score value and comprise:
Determine whether the data that described user safeguards or generates represent that user is interested in described document; And
Whether the data of safeguarding or generating based on user at least partly represent that user is interested in described document, the described document of scoring.
2. the method for claim 1, wherein described document comprises multiple documents; And
Wherein, the described document of scoring comprises:
Based on the origination date corresponding to document, determine the life-span of each document,
Based on the life-span of document, determine the mean lifetime of document; And
Difference between life-span and mean lifetime based on document at least partly, the described document of scoring.
3. the method for claim 1, wherein generate for the score value of described document and comprise: that measures based on the origination date from corresponding to described document at least partly passes the time, the described document of scoring.
4. the method for claim 1, wherein, how long changing of described document content is based at least one in following: the change frequency in averaging time, cycle a period of time between variation or the comparison of the rate of change in the current time cycle and the rate of change in the previous time cycle.
5. the method for claim 1, wherein, how much the change of described document content is based at least one in following: the number percent of the ratio of the new number of pages relevant with described document, the new number of pages relevant with described document and the total page number relevant with described document or the document content that changed during cycle a period of time within cycle a period of time.
6. the method for claim 1, wherein described renewal amount is determined based on following content:
The tolerance of the importance based on each several part, the differently different piece of document content described in weighting; And
The variable quantity of described document content is defined as to the function of the different weights part of described content.
7. the method for claim 1, wherein generating score value comprises:
At least partly based on described renewal amount, the described document of scoring.
8. the method for claim 1, wherein described multiple historical data comprises described query analysis data; And
Wherein, generating score value comprises:
In the time that described document is included in a Search Results and concentrates, determine the selecteed degree of described document in time; And
The selecteed degree of described document in time when being included in described Search Results when described document and concentrating at least partly, the described document of scoring.
9. method as claimed in claim 8, wherein, the described document of scoring comprises: in the time that document is more often selected described in other documents of concentrating than described Search Results on cycle a period of time, distribute more high score to described document.
10. the method for claim 1, wherein described multiple historical data comprises described query analysis data; And
Wherein, generating score value comprises:
Determine that whether described document is relevant with the search terms occurring with the frequency increasing along with the time in search inquiry; And
Whether relevant with search terms based on described document at least partly, the described document of scoring.
11. the method for claim 1, wherein described multiple historical data comprise described query analysis data; And
Wherein, generating score value comprises:
Determine described document whether with roughly remain unchanged in time but cause the inquiry of the result changing in time relevant; And
Whether relevant with the inquiry of the result that causes changing in time based on described document at least partly, the described document of scoring.
12. the method for claim 1, wherein described multiple historical data comprise described query analysis data; And
Wherein, generating score value comprises:
Determine that whether described document is expired; And
Whether expired based on described document at least partly, the described document of scoring.
13. methods as claimed in claim 12, wherein, the described document of scoring comprises:
In the time that definite described document is expired, determine whether to think that this expired document is conducive to search inquiry; And
At least partly based on whether think that this expired document is conducive to search inquiry, the described document of scoring in the time that definite described document is expired.
14. methods as claimed in claim 13, wherein, determine whether to think that expired document is conducive to search inquiry and is based, at least in part, on the time for search inquiry, how often to select expired documents recently on document.
15. the method for claim 1, wherein described multiple historical data comprise about the data of standard based on link; And
Wherein, generating score value comprises:
Determine the link behavior relevant with described document; And
The behavior that links based on relevant with described document at least partly, the described document of scoring.
16. methods as claimed in claim 15, wherein, link behavior is with to point at least one of one or more appearance that link of described document or disappearance relevant.
17. methods as claimed in claim 16, wherein, the appearance of one or more links with following at least one about: occur to date of the new url of described document, one or morely link the speed occurring in time or the one or more quantity that link that occur during cycle a period of time; And the disappearance of one or more links with following at least one about: link date of disappearing, one or morely link the speed disappearing in time or the one or more quantity that link that disappear during cycle a period of time to described document existing.
18. methods as claimed in claim 15, wherein, determine that the behavior that link relevant with described document comprises at least one supervision in following: the time that links relevant with described document change behavior, during cycle a period of time, occur or disappear how many relevant with described document link or the existing disappearance that link relevant with described document compared with whether there is the tendency appearance new url relevant with described document.
19. the method for claim 1, wherein described multiple historical data comprise about the data of standard based on link;
Wherein, generating score value comprises:
Determine the tolerance of the freshness that link relevant with described document;
Based on the tolerance of determined freshness, assign weight to link; And
At least part of based on distributing to the weight that link relevant with described document, the described document of scoring.
20. methods as claimed in claim 19, wherein, the tolerance of the freshness that link relevant with described document is based at least one in following: link date of occurring, link date of changing, with this link the appearance date of relevant anchor text, with this date that links the date of relevant anchor text variation, the date that comprises this chaiming file appearance linking or comprise this chaiming file variation linking.
21. methods as claimed in claim 19, wherein, the weight of distributing to link is based at least one in following: with the tolerance of the tolerance of the trust that comprises this document associations linking, the authoritative tolerance that comprises this document linking or the freshness that comprises this document linking.
22. methods as claimed in claim 19, wherein, score document comprises:
Determine the life-span of each link of pointing to described document;
Based on the life-span of link, determine and link relevant life-span distribution; And
At least part of based on distributing with linking the relevant life-span, the document of scoring.
23. the method for claim 1, wherein described multiple historical data comprise the data about anchor text; And
Wherein, generating score value comprises:
Identification with in the relevant anchor text of linking of described document over time; And
At least partly based on the variation that links relevant anchor text to described document, the described document of scoring.
24. the method for claim 1, wherein described multiple historical data comprise the data about anchor text; And
Wherein, generating score value comprises:
Determining whether document content changes is different from described content and links relevant anchor text to described document one or more; And
Whether at least part of content based on described document changes one or more relevant anchor text, the described documents of scoring of linking that described content are different from and arrive described document.
25. the method for claim 1, wherein described multiple historical data comprise the data about anchor text; And
Wherein, generating score value comprises:
Determine the tolerance with the one or more freshnesss that link relevant anchor text to described document; And
At least partly based on the tolerance of the one or more freshnesss that link relevant anchor text to described document, the described document of scoring.
26. methods as claimed in claim 25, wherein, be based at least one in following with the tolerance of the freshness to the relevant anchor text of linking of described document: the appearance date of anchor text, the change date of anchor text, with the appearance date linking of anchor text dependent, with the appearance date of the change date linking of anchor text dependent, described document or the change date of described document.
27. the method for claim 1, wherein described multiple historical data further comprise the data about the time behavior of the document traffic; And
Wherein, generating score value comprises:
Determine the characteristic of the traffic relevant with document; And
The characteristic of the traffic based on relevant with described document at least partly, the described document of scoring.
28. methods as claimed in claim 27, wherein, determine that the characteristic of the traffic relevant with described document comprises: analyze the traffic pattern relevant with described document so as identification communication amount pattern over time.
29. the method for claim 1, wherein described multiple historical data comprise user behavior data; And
Wherein, generating score value comprises:
Determine the user behavior relevant with document; And
User behavior based on relevant with document at least partly, the described document of scoring.
30. methods as claimed in claim 29, wherein, it is relevant that user behavior and the selecteed number of times of document in search result set and one or more user access at least one in the time quantum that described document spends.
31. the method for claim 1, wherein described multiple historical data comprise territory related data; And
Wherein, generating score value comprises:
Analyze corresponding to the territory relevant with document territory relevant information in time; And
At least partly based on analysis result, the described document of scoring.
32. methods as claimed in claim 31, wherein, the described document of scoring comprises:
Determine that whether the territory relevant with described document be legal; And
Whether the territory based on relevant with described document is legal at least partly, the described document of scoring.
33. methods as claimed in claim 31, wherein, territory relevant information with following at least one about: the expiration date in territory, the name server relevant with territory record or with territory relevant name server.
34. the method for claim 1, wherein described multiple historical data comprise the data about grade history; And
Wherein, generating score value comprises:
Determine the previous grade history of described document; And
Previous grade history based on described document at least partly, the described document of scoring.
35. methods as claimed in claim 34, wherein, the described document of scoring comprises:
Determine at the above document of cycle a period of time in the quantity moving aspect grade or speed; And
At least partly based on described document in the quantity moving aspect grade or speed, the described document of scoring.
36. methods as claimed in claim 34, wherein, previously grade history was based at least one in following: described document is selected as the inquiry quantity of Search Results in time, described document is selected as speed, the seasonality, sudden or right to URL inquiry of Search Results in time, and score value over time.
37. methods as claimed in claim 34, wherein, the previous grade history of determining document comprises and monitors the grade peak value of level of documentation in time.
The 38. described documents of the method for claim 1, wherein scoring comprise:
Analyze the data that user in time safeguards or generates, identify at least one in following: increase or shift out the trend of document, described document are increased to that user safeguards or the data that generate or the speed therefrom shifting out or described document are increased to data that user safeguards or generate, safeguard or generate from user data are deleted or by user safeguard or generated data accessed; And
At least partly based on analysis result, the described document of scoring.
39. the method for claim 1, wherein described multiple historical data comprise the data about anchor text; And
Wherein, generating score value comprises:
Determine and the one or more growth charts that link relevant anchor text that arrive described document; And
At least partly based on one or more growth charts that link relevant anchor text to described document, the described document of scoring.
40. the method for claim 1, wherein described multiple historical data comprise the data relevant with the connection of independent peer-to-peer; And
Wherein, generating score value comprises:
Determine the quantity growth of the independent peer-to-peer of the link that is included in described document; And
Quantity based on independent peer-to-peer at least partly, the described document of scoring.
41. the method for claim 1, wherein described multiple historical data comprise the data about document subject matter; And
Wherein, generating score value comprises:
Carrying out the theme relevant with described document extracts;
Monitor document subject matter over time; And
Variation based on document subject matter at least partly, the described document of scoring.
42. the method for claim 1, further comprise:
Obtaining search inquiry, wherein, is relevant with this search inquiry by identified document recognition; And
There is heterogeneous pass based on described document and search inquiry, generate the relevance score for described document; And
Wherein, generate the score value that is used for described document at least partly based on described multiple historical data and relevance score.
43. 1 kinds of systems for the document of scoring, comprising:
For identifying the device of document;
For obtaining the device of the multiple historical data relevant with described document, described multiple historical data at least comprises:
About with the data of the origination date of described document associations, wherein said origination date is at least based on one of following:
Search engine learn first or index described in date of document,
Search engine is found to the date of the link of described document first, or
In another document first with reference to date of described document,
And wherein register date of described document and described document by territory and at least comprise that at least one in date of threshold number page can be used as described origination date;
The data that change in time about document content, the wherein said data that change in time about document content based on:
Renewal frequency, how long its content based on described document within cycle a period of time changes, and
Renewal amount, its content changing based on described document within cycle a period of time is how many; And
At least another kind of data, it is one of following that wherein said at least another kind of data at least comprise:
About the query analysis data of one or more formerly search inquiries, wherein, for described one or more formerly search inquiries, described document is identified as Search Results;
About to or from the standard based on link of the behavior of the link of described document;
About with to the data that link associated anchor text of described document;
About with the data of the time behavior of the advertisement traffic of described document associations;
About the user behavior data of described document;
About with the territory related data of the legitimacy in the territory of described document associations;
About the data of the grade history of described document;
The data of safeguarding or generating with the user of described document associations, wherein, the data that user safeguards or generates with following at least one about: with a user or relevant favorites list, bookmark, temporary file and the buffer culture of multiple user;
About with the data of unique word, two-dimensional grammar or phrase in the anchor text being associated to linking of described document;
About the data of the connection of independent peer-to-peer, or
About with the data of the time dependent Document Title of described document associations; And
At least partly data, data that change in time about document content based on about origination date and generate the device for the score value of described document with the described at least another kind of data of described document associations, comprise for the device that generates score value:
For determining whether the data that described user safeguards or generates represent that user is to the interested device of described document, and
Whether represent that user is interested in described document, the device of the described document of scoring for the data of safeguarding or generating based on user at least partly.
44. 1 kinds of systems for the document of scoring, comprising:
Historical parts, are configured to obtain the multiple historical data relevant with document, and described multiple historical data comprises:
About with the wherein said origination date of data of the origination date of described document associations at least based on one of following:
Search engine learn first or index described in date of document,
Search engine is found to the date of the link of described document first, or
In another document first with reference to date of described document,
And wherein register date of described document and described document by territory and at least comprise that at least one in date of threshold number page can be used as described origination date;
The data that change in time about document content, the wherein said data that change in time about document content based on:
Renewal frequency, how long its content based on described document within cycle a period of time changes, and
Renewal amount, its content changing based on described document within cycle a period of time is how many; And
At least another kind of data, it is one of following that wherein said at least another kind of data at least comprise:
About the query analysis data of one or more formerly search inquiries, wherein, for described one or more formerly search inquiries, described document is identified as Search Results;
About to or from the standard based on link of the behavior of the link of described document;
About with to the data that link associated anchor text of described document;
About with the data of the time behavior of the advertisement traffic of described document associations;
About the user behavior data of described document;
About with the territory related data of the legitimacy in the territory of described document associations;
About the data of the grade history of described document;
The data of safeguarding or generating with the user of described document associations, wherein, the data that user safeguards or generates with following at least one about: with a user or relevant favorites list, bookmark, temporary file and the buffer culture of multiple user;
About with the data of unique word, two-dimensional grammar or phrase in the anchor text being associated to linking of described document;
About the data of the connection of independent peer-to-peer, or
About with the data of the time dependent Document Title of described document associations; And
Grade parts, are configured to:
At least partly data, data that change in time about document content based on about origination date and generate the score value for described document with the described at least another kind of data of described document associations, wherein, generate score value and comprise:
Determine whether the data that described user safeguards or generates represent that user is interested in described document, and
Whether the data of safeguarding or generating based on user at least partly represent that user is interested in described document, the described document of scoring.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US50761703P | 2003-09-30 | 2003-09-30 | |
US60/507,617 | 2003-09-30 | ||
US10/748,664 US7346839B2 (en) | 2003-09-30 | 2003-12-31 | Information retrieval based on historical data |
US10/748,664 | 2003-12-31 | ||
PCT/US2004/030000 WO2005033978A1 (en) | 2003-09-30 | 2004-09-15 | Information retrieval based on historical data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1879107A CN1879107A (en) | 2006-12-13 |
CN1879107B true CN1879107B (en) | 2014-10-15 |
Family
ID=34381362
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200480033254.8A Expired - Lifetime CN1879107B (en) | 2003-09-30 | 2004-09-15 | Information retrieval based on historical data |
Country Status (8)
Country | Link |
---|---|
US (19) | US7346839B2 (en) |
EP (5) | EP2416263A3 (en) |
JP (3) | JP2007507798A (en) |
CN (1) | CN1879107B (en) |
AU (1) | AU2004277678C1 (en) |
CA (2) | CA2757550A1 (en) |
DE (2) | DE202004021886U1 (en) |
WO (1) | WO2005033978A1 (en) |
Families Citing this family (546)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6883135B1 (en) | 2000-01-28 | 2005-04-19 | Microsoft Corporation | Proxy server using a statistical model |
US7398271B1 (en) * | 2001-04-16 | 2008-07-08 | Yahoo! Inc. | Using network traffic logs for search enhancement |
US7249034B2 (en) | 2002-01-14 | 2007-07-24 | International Business Machines Corporation | System and method for publishing a person's affinities |
US8590013B2 (en) | 2002-02-25 | 2013-11-19 | C. S. Lee Crawford | Method of managing and communicating data pertaining to software applications for processor-based devices comprising wireless communication circuitry |
US7693830B2 (en) * | 2005-08-10 | 2010-04-06 | Google Inc. | Programmable search engine |
US20070038614A1 (en) * | 2005-08-10 | 2007-02-15 | Guha Ramanathan V | Generating and presenting advertisements based on context data for programmable search engines |
US7716199B2 (en) * | 2005-08-10 | 2010-05-11 | Google Inc. | Aggregating context data for programmable search engines |
US7743045B2 (en) * | 2005-08-10 | 2010-06-22 | Google Inc. | Detecting spam related and biased contexts for programmable search engines |
US7130844B2 (en) * | 2002-10-31 | 2006-10-31 | International Business Machines Corporation | System and method for examining, calculating the age of an document collection as a measure of time since creation, visualizing, identifying selectively reference those document collections representing current activity |
US8042112B1 (en) | 2003-07-03 | 2011-10-18 | Google Inc. | Scheduler for search engine crawler |
US7725452B1 (en) * | 2003-07-03 | 2010-05-25 | Google Inc. | Scheduler for search engine crawler |
US8548995B1 (en) * | 2003-09-10 | 2013-10-01 | Google Inc. | Ranking of documents based on analysis of related documents |
US7505964B2 (en) | 2003-09-12 | 2009-03-17 | Google Inc. | Methods and systems for improving a search ranking using related queries |
US7346839B2 (en) | 2003-09-30 | 2008-03-18 | Google Inc. | Information retrieval based on historical data |
US7797316B2 (en) * | 2003-09-30 | 2010-09-14 | Google Inc. | Systems and methods for determining document freshness |
US7693827B2 (en) * | 2003-09-30 | 2010-04-06 | Google Inc. | Personalization of placed content ordering in search results |
US7231399B1 (en) | 2003-11-14 | 2007-06-12 | Google Inc. | Ranking documents based on large data sets |
US8521725B1 (en) | 2003-12-03 | 2013-08-27 | Google Inc. | Systems and methods for improved searching |
US8676790B1 (en) | 2003-12-05 | 2014-03-18 | Google Inc. | Methods and systems for improving search rankings using advertising data |
US7548968B1 (en) | 2003-12-10 | 2009-06-16 | Markmonitor Inc. | Policing internet domains |
US7302645B1 (en) | 2003-12-10 | 2007-11-27 | Google Inc. | Methods and systems for identifying manipulated articles |
US20050149388A1 (en) * | 2003-12-30 | 2005-07-07 | Scholl Nathaniel B. | Method and system for placing advertisements based on selection of links that are not prominently displayed |
US8655727B2 (en) | 2003-12-30 | 2014-02-18 | Amazon Technologies, Inc. | Method and system for generating and placing keyword-targeted advertisements |
US7676553B1 (en) * | 2003-12-31 | 2010-03-09 | Microsoft Corporation | Incremental web crawler using chunks |
US7461089B2 (en) * | 2004-01-08 | 2008-12-02 | International Business Machines Corporation | Method and system for creating profiling indices |
US8010459B2 (en) * | 2004-01-21 | 2011-08-30 | Google Inc. | Methods and systems for rating associated members in a social network |
US8577893B1 (en) * | 2004-03-15 | 2013-11-05 | Google Inc. | Ranking based on reference contexts |
US9104689B2 (en) * | 2004-03-17 | 2015-08-11 | International Business Machines Corporation | Method for synchronizing documents for disconnected operation |
US7584221B2 (en) * | 2004-03-18 | 2009-09-01 | Microsoft Corporation | Field weighting in text searching |
US7536382B2 (en) * | 2004-03-31 | 2009-05-19 | Google Inc. | Query rewriting with entity detection |
US7539674B2 (en) * | 2004-04-08 | 2009-05-26 | Yahoo! Inc. | Systems and methods for adaptive scheduling of references to documents |
US20050234877A1 (en) * | 2004-04-08 | 2005-10-20 | Yu Philip S | System and method for searching using a temporal dimension |
US20060010029A1 (en) * | 2004-04-29 | 2006-01-12 | Gross John N | System & method for online advertising |
US20050246391A1 (en) * | 2004-04-29 | 2005-11-03 | Gross John N | System & method for monitoring web pages |
US20050246358A1 (en) * | 2004-04-29 | 2005-11-03 | Gross John N | System & method of identifying and predicting innovation dissemination |
US20050256848A1 (en) * | 2004-05-13 | 2005-11-17 | International Business Machines Corporation | System and method for user rank search |
US7260573B1 (en) * | 2004-05-17 | 2007-08-21 | Google Inc. | Personalizing anchor text scores in a search engine |
US8019875B1 (en) | 2004-06-04 | 2011-09-13 | Google Inc. | Systems and methods for indicating a user state in a social network |
JP4254623B2 (en) * | 2004-06-09 | 2009-04-15 | 日本電気株式会社 | Topic analysis method, apparatus thereof, and program |
US7716225B1 (en) | 2004-06-17 | 2010-05-11 | Google Inc. | Ranking documents based on user behavior and/or feature data |
US7565445B2 (en) * | 2004-06-18 | 2009-07-21 | Fortinet, Inc. | Systems and methods for categorizing network traffic content |
US8832132B1 (en) | 2004-06-22 | 2014-09-09 | Google Inc. | Personalizing search queries based on user membership in social network communities |
US7783639B1 (en) | 2004-06-30 | 2010-08-24 | Google Inc. | Determining quality of linked documents |
US8621215B1 (en) | 2004-06-30 | 2013-12-31 | Google Inc. | Methods and systems for creating monetary accounts for members in a social network |
US8078607B2 (en) * | 2006-03-30 | 2011-12-13 | Google Inc. | Generating website profiles based on queries from webistes and user activities on the search results |
US20060020583A1 (en) * | 2004-07-23 | 2006-01-26 | Baranov Alexey V | System and method for searching and retrieving documents by their descriptions |
US7580921B2 (en) | 2004-07-26 | 2009-08-25 | Google Inc. | Phrase identification in an information retrieval system |
US7702618B1 (en) | 2004-07-26 | 2010-04-20 | Google Inc. | Information retrieval system for archiving multiple document versions |
US7711679B2 (en) | 2004-07-26 | 2010-05-04 | Google Inc. | Phrase-based detection of duplicate documents in an information retrieval system |
US7567959B2 (en) * | 2004-07-26 | 2009-07-28 | Google Inc. | Multiple index based information retrieval system |
US8015019B1 (en) | 2004-08-03 | 2011-09-06 | Google Inc. | Methods and systems for providing a document |
US7752200B2 (en) | 2004-08-09 | 2010-07-06 | Amazon Technologies, Inc. | Method and system for identifying keywords for use in placing keyword-targeted advertisements |
JP2006065395A (en) * | 2004-08-24 | 2006-03-09 | Fujitsu Ltd | Hyperlink generation device, hyperlink generation method, and hyperlink generation program |
US7987172B1 (en) | 2004-08-30 | 2011-07-26 | Google Inc. | Minimizing visibility of stale content in web searching including revising web crawl intervals of documents |
US7606793B2 (en) * | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US8065296B1 (en) * | 2004-09-29 | 2011-11-22 | Google Inc. | Systems and methods for determining a quality of provided items |
US7761448B2 (en) | 2004-09-30 | 2010-07-20 | Microsoft Corporation | System and method for ranking search results using click distance |
US7739277B2 (en) * | 2004-09-30 | 2010-06-15 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US7827181B2 (en) * | 2004-09-30 | 2010-11-02 | Microsoft Corporation | Click distance determination |
US20060069675A1 (en) * | 2004-09-30 | 2006-03-30 | Ogilvie John W | Search tools and techniques |
US8056128B1 (en) | 2004-09-30 | 2011-11-08 | Google Inc. | Systems and methods for detecting potential communications fraud |
US11283885B2 (en) | 2004-10-19 | 2022-03-22 | Verizon Patent And Licensing Inc. | System and method for location based matching and promotion |
JP2006146873A (en) * | 2004-10-22 | 2006-06-08 | Canon Inc | Data retrieval method, device, and program |
US7533092B2 (en) * | 2004-10-28 | 2009-05-12 | Yahoo! Inc. | Link-based spam detection |
US20060095841A1 (en) * | 2004-10-28 | 2006-05-04 | Microsoft Corporation | Methods and apparatus for document management |
US20060200487A1 (en) * | 2004-10-29 | 2006-09-07 | The Go Daddy Group, Inc. | Domain name related reputation and secure certificates |
US20080022013A1 (en) * | 2004-10-29 | 2008-01-24 | The Go Daddy Group, Inc. | Publishing domain name related reputation in whois records |
US20060095404A1 (en) * | 2004-10-29 | 2006-05-04 | The Go Daddy Group, Inc | Presenting search engine results based on domain name related reputation |
US20080028100A1 (en) * | 2004-10-29 | 2008-01-31 | The Go Daddy Group, Inc. | Tracking domain name related reputation |
US8904040B2 (en) * | 2004-10-29 | 2014-12-02 | Go Daddy Operating Company, LLC | Digital identity validation |
US7970858B2 (en) * | 2004-10-29 | 2011-06-28 | The Go Daddy Group, Inc. | Presenting search engine results based on domain name related reputation |
US7797413B2 (en) * | 2004-10-29 | 2010-09-14 | The Go Daddy Group, Inc. | Digital identity registration |
US20080028443A1 (en) * | 2004-10-29 | 2008-01-31 | The Go Daddy Group, Inc. | Domain name related reputation and secure certificates |
US8117339B2 (en) * | 2004-10-29 | 2012-02-14 | Go Daddy Operating Company, LLC | Tracking domain name related reputation |
US20060095459A1 (en) * | 2004-10-29 | 2006-05-04 | Warren Adelman | Publishing domain name related reputation in whois records |
US9015263B2 (en) | 2004-10-29 | 2015-04-21 | Go Daddy Operating Company, LLC | Domain name searching with reputation rating |
US7716206B2 (en) * | 2004-11-01 | 2010-05-11 | At&T Intellectual Property I, L.P. | Communication networks and methods and computer program products for performing searches thereon while maintaining user privacy |
US7584194B2 (en) * | 2004-11-22 | 2009-09-01 | Truveo, Inc. | Method and apparatus for an application crawler |
EP1831796A4 (en) | 2004-11-22 | 2010-01-27 | Truveo Inc | Method and apparatus for an application crawler |
WO2006055983A2 (en) * | 2004-11-22 | 2006-05-26 | Truveo, Inc. | Method and apparatus for a ranking engine |
US20060112089A1 (en) * | 2004-11-22 | 2006-05-25 | International Business Machines Corporation | Methods and apparatus for assessing web page decay |
US8874570B1 (en) | 2004-11-30 | 2014-10-28 | Google Inc. | Search boost vector based on co-visitation information |
US7801723B2 (en) * | 2004-11-30 | 2010-09-21 | Palo Alto Research Center Incorporated | Systems and methods for user-interest sensitive condensation |
US7827029B2 (en) * | 2004-11-30 | 2010-11-02 | Palo Alto Research Center Incorporated | Systems and methods for user-interest sensitive note-taking |
US20060122957A1 (en) * | 2004-12-03 | 2006-06-08 | Johnny Chen | Method and system to detect e-mail spam using concept categorization of linked content |
US7401077B2 (en) * | 2004-12-21 | 2008-07-15 | Palo Alto Research Center Incorporated | Systems and methods for using and constructing user-interest sensitive indicators of search results |
US7716198B2 (en) * | 2004-12-21 | 2010-05-11 | Microsoft Corporation | Ranking search results using feature extraction |
JP4344339B2 (en) * | 2004-12-24 | 2009-10-14 | 日本電信電話株式会社 | Information evaluation device, content search device, information evaluation method, content search method, program thereof, and recording medium |
US20060149710A1 (en) * | 2004-12-30 | 2006-07-06 | Ross Koningstein | Associating features with entities, such as categories of web page documents, and/or weighting such features |
US8538970B1 (en) * | 2004-12-30 | 2013-09-17 | Google Inc. | Personalizing search results |
US10402457B1 (en) | 2004-12-31 | 2019-09-03 | Google Llc | Methods and systems for correlating connections between users and links between articles |
US8060405B1 (en) | 2004-12-31 | 2011-11-15 | Google Inc. | Methods and systems for correlating connections between users and links between articles |
US8230422B2 (en) * | 2005-01-13 | 2012-07-24 | International Business Machines Corporation | Assist thread for injecting cache memory in a microprocessor |
US20060161520A1 (en) * | 2005-01-14 | 2006-07-20 | Microsoft Corporation | System and method for generating alternative search terms |
US20050125451A1 (en) * | 2005-02-10 | 2005-06-09 | The Go Daddy Group, Inc. | Search engine and domain name search integration |
US7657520B2 (en) * | 2005-03-03 | 2010-02-02 | Google, Inc. | Providing history and transaction volume information of a content source to users |
US20060200460A1 (en) * | 2005-03-03 | 2006-09-07 | Microsoft Corporation | System and method for ranking search results using file types |
US7792833B2 (en) * | 2005-03-03 | 2010-09-07 | Microsoft Corporation | Ranking search results using language types |
US8538810B2 (en) * | 2005-03-29 | 2013-09-17 | Google Inc. | Methods and systems for member-created advertisement in a member network |
US8412780B2 (en) | 2005-03-30 | 2013-04-02 | Google Inc. | Methods and systems for providing current email addresses and contact information for members within a social network |
US9256685B2 (en) * | 2005-03-31 | 2016-02-09 | Google Inc. | Systems and methods for modifying search results based on a user's history |
US20060224583A1 (en) * | 2005-03-31 | 2006-10-05 | Google, Inc. | Systems and methods for analyzing a user's web history |
US20060224608A1 (en) * | 2005-03-31 | 2006-10-05 | Google, Inc. | Systems and methods for combining sets of favorites |
US20060235842A1 (en) * | 2005-04-14 | 2006-10-19 | International Business Machines Corporation | Web page ranking for page query across public and private |
BRPI0610286A2 (en) * | 2005-04-18 | 2010-06-08 | Collage Analytics Llc | system and method for efficiently crawling and dating content in very large dynamic document spaces |
US8732175B2 (en) | 2005-04-21 | 2014-05-20 | Yahoo! Inc. | Interestingness ranking of media objects |
US7660792B2 (en) * | 2005-04-29 | 2010-02-09 | Microsoft Corporation | System and method for spam identification |
US7403767B2 (en) * | 2005-04-29 | 2008-07-22 | Siemens Aktiengesellschaft | Cellular telephone network with record keeping for missed calls |
US7765481B2 (en) | 2005-05-03 | 2010-07-27 | Mcafee, Inc. | Indicating website reputations during an electronic commerce transaction |
US8566726B2 (en) | 2005-05-03 | 2013-10-22 | Mcafee, Inc. | Indicating website reputations based on website handling of personal information |
US7562304B2 (en) | 2005-05-03 | 2009-07-14 | Mcafee, Inc. | Indicating website reputations during website manipulation of user information |
US9384345B2 (en) | 2005-05-03 | 2016-07-05 | Mcafee, Inc. | Providing alternative web content based on website reputation assessment |
US7822620B2 (en) | 2005-05-03 | 2010-10-26 | Mcafee, Inc. | Determining website reputations using automatic testing |
US8438499B2 (en) | 2005-05-03 | 2013-05-07 | Mcafee, Inc. | Indicating website reputations during user interactions |
US20060253423A1 (en) * | 2005-05-07 | 2006-11-09 | Mclane Mark | Information retrieval system and method |
US7630976B2 (en) * | 2005-05-10 | 2009-12-08 | Microsoft Corporation | Method and system for adapting search results to personal information needs |
US7962462B1 (en) * | 2005-05-31 | 2011-06-14 | Google Inc. | Deriving and using document and site quality signals from search query streams |
JP2006350954A (en) * | 2005-06-20 | 2006-12-28 | Chugoku Electric Power Co Inc:The | Telephone pole management system |
US7788132B2 (en) * | 2005-06-29 | 2010-08-31 | Google, Inc. | Reviewing the suitability of Websites for participation in an advertising network |
US8244722B1 (en) | 2005-06-30 | 2012-08-14 | Google Inc. | Ranking documents |
US8195654B1 (en) | 2005-07-13 | 2012-06-05 | Google Inc. | Prediction of human ratings or rankings of information retrieval quality |
US20070022385A1 (en) * | 2005-07-20 | 2007-01-25 | Mikhail Denissov | Software module, method and system for managing information items by bookmarking information items through activation of said items |
US7599917B2 (en) * | 2005-08-15 | 2009-10-06 | Microsoft Corporation | Ranking search results using biased click distance |
US7774335B1 (en) * | 2005-08-23 | 2010-08-10 | Amazon Technologies, Inc. | Method and system for determining interest levels of online content navigation paths |
KR100644159B1 (en) | 2005-09-05 | 2006-11-10 | 엔에이치엔(주) | Search controller control method and device |
US8099674B2 (en) * | 2005-09-09 | 2012-01-17 | Tableau Software Llc | Computer systems and methods for automatically viewing multidimensional databases |
US8244720B2 (en) * | 2005-09-13 | 2012-08-14 | Google Inc. | Ranking blog documents |
US20070239724A1 (en) * | 2005-09-14 | 2007-10-11 | Jorey Ramer | Mobile search services related to direct identifiers |
US8238888B2 (en) * | 2006-09-13 | 2012-08-07 | Jumptap, Inc. | Methods and systems for mobile coupon placement |
US20110313853A1 (en) | 2005-09-14 | 2011-12-22 | Jorey Ramer | System for targeting advertising content to a plurality of mobile communication facilities |
US9058406B2 (en) | 2005-09-14 | 2015-06-16 | Millennial Media, Inc. | Management of multiple advertising inventories using a monetization platform |
US20070060114A1 (en) * | 2005-09-14 | 2007-03-15 | Jorey Ramer | Predictive text completion for a mobile communication facility |
US20070100806A1 (en) * | 2005-11-01 | 2007-05-03 | Jorey Ramer | Client libraries for mobile content |
US8156128B2 (en) | 2005-09-14 | 2012-04-10 | Jumptap, Inc. | Contextual mobile content placement on a mobile communication facility |
US20070073719A1 (en) * | 2005-09-14 | 2007-03-29 | Jorey Ramer | Physical navigation of a mobile search application |
US7702318B2 (en) | 2005-09-14 | 2010-04-20 | Jumptap, Inc. | Presentation of sponsored content based on mobile transaction event |
US20070060109A1 (en) * | 2005-09-14 | 2007-03-15 | Jorey Ramer | Managing sponsored content based on user characteristics |
US20070192318A1 (en) * | 2005-09-14 | 2007-08-16 | Jorey Ramer | Creation of a mobile search suggestion dictionary |
US7769764B2 (en) | 2005-09-14 | 2010-08-03 | Jumptap, Inc. | Mobile advertisement syndication |
US8688671B2 (en) | 2005-09-14 | 2014-04-01 | Millennial Media | Managing sponsored content based on geographic region |
US8805339B2 (en) | 2005-09-14 | 2014-08-12 | Millennial Media, Inc. | Categorization of a mobile user profile based on browse and viewing behavior |
US20070100653A1 (en) * | 2005-11-01 | 2007-05-03 | Jorey Ramer | Mobile website analyzer |
US7752209B2 (en) * | 2005-09-14 | 2010-07-06 | Jumptap, Inc. | Presenting sponsored content on a mobile communication facility |
US9471925B2 (en) * | 2005-09-14 | 2016-10-18 | Millennial Media Llc | Increasing mobile interactivity |
US20070288427A1 (en) * | 2005-09-14 | 2007-12-13 | Jorey Ramer | Mobile pay-per-call campaign creation |
US8503995B2 (en) | 2005-09-14 | 2013-08-06 | Jumptap, Inc. | Mobile dynamic advertisement creation and placement |
US7577665B2 (en) | 2005-09-14 | 2009-08-18 | Jumptap, Inc. | User characteristic influenced search results |
US20070100805A1 (en) * | 2005-09-14 | 2007-05-03 | Jorey Ramer | Mobile content cross-inventory yield optimization |
US7603360B2 (en) * | 2005-09-14 | 2009-10-13 | Jumptap, Inc. | Location influenced search results |
US9201979B2 (en) | 2005-09-14 | 2015-12-01 | Millennial Media, Inc. | Syndication of a behavioral profile associated with an availability condition using a monetization platform |
US8302030B2 (en) | 2005-09-14 | 2012-10-30 | Jumptap, Inc. | Management of multiple advertising inventories using a monetization platform |
US7548915B2 (en) | 2005-09-14 | 2009-06-16 | Jorey Ramer | Contextual mobile content placement on a mobile communication facility |
US20070073722A1 (en) * | 2005-09-14 | 2007-03-29 | Jorey Ramer | Calculation and presentation of mobile content expected value |
US20070061246A1 (en) * | 2005-09-14 | 2007-03-15 | Jorey Ramer | Mobile campaign creation |
US8666376B2 (en) * | 2005-09-14 | 2014-03-04 | Millennial Media | Location based mobile shopping affinity program |
US8131271B2 (en) | 2005-11-05 | 2012-03-06 | Jumptap, Inc. | Categorization of a mobile user profile based on browse behavior |
US20080214155A1 (en) * | 2005-11-01 | 2008-09-04 | Jorey Ramer | Integrating subscription content into mobile search results |
US7676394B2 (en) | 2005-09-14 | 2010-03-09 | Jumptap, Inc. | Dynamic bidding and expected value |
US8229914B2 (en) | 2005-09-14 | 2012-07-24 | Jumptap, Inc. | Mobile content spidering and compatibility determination |
US10592930B2 (en) | 2005-09-14 | 2020-03-17 | Millenial Media, LLC | Syndication of a behavioral profile using a monetization platform |
US20070061242A1 (en) * | 2005-09-14 | 2007-03-15 | Jorey Ramer | Implicit searching for mobile content |
US20070073718A1 (en) * | 2005-09-14 | 2007-03-29 | Jorey Ramer | Mobile search service instant activation |
US8819659B2 (en) | 2005-09-14 | 2014-08-26 | Millennial Media, Inc. | Mobile search service instant activation |
US9703892B2 (en) | 2005-09-14 | 2017-07-11 | Millennial Media Llc | Predictive text completion for a mobile communication facility |
US9076175B2 (en) | 2005-09-14 | 2015-07-07 | Millennial Media, Inc. | Mobile comparison shopping |
US20070100650A1 (en) * | 2005-09-14 | 2007-05-03 | Jorey Ramer | Action functionality for mobile content search results |
US8209344B2 (en) | 2005-09-14 | 2012-06-26 | Jumptap, Inc. | Embedding sponsored content in mobile applications |
US8615719B2 (en) | 2005-09-14 | 2013-12-24 | Jumptap, Inc. | Managing sponsored content for delivery to mobile communication facilities |
US8515401B2 (en) | 2005-09-14 | 2013-08-20 | Jumptap, Inc. | System for targeting advertising content to a plurality of mobile communication facilities |
US8364521B2 (en) * | 2005-09-14 | 2013-01-29 | Jumptap, Inc. | Rendering targeted advertisement on mobile communication facilities |
US20080214153A1 (en) * | 2005-09-14 | 2008-09-04 | Jorey Ramer | Mobile User Profile Creation based on User Browse Behaviors |
US20070100651A1 (en) * | 2005-11-01 | 2007-05-03 | Jorey Ramer | Mobile payment facilitation |
US8812526B2 (en) | 2005-09-14 | 2014-08-19 | Millennial Media, Inc. | Mobile content cross-inventory yield optimization |
US7912458B2 (en) | 2005-09-14 | 2011-03-22 | Jumptap, Inc. | Interaction analysis and prioritization of mobile content |
US7660581B2 (en) * | 2005-09-14 | 2010-02-09 | Jumptap, Inc. | Managing sponsored content based on usage history |
US20080214154A1 (en) * | 2005-11-01 | 2008-09-04 | Jorey Ramer | Associating mobile and non mobile web content |
US10038756B2 (en) | 2005-09-14 | 2018-07-31 | Millenial Media LLC | Managing sponsored content based on device characteristics |
US8103545B2 (en) | 2005-09-14 | 2012-01-24 | Jumptap, Inc. | Managing payment for sponsored content presented to mobile communication facilities |
US8660891B2 (en) | 2005-11-01 | 2014-02-25 | Millennial Media | Interactive mobile advertisement banners |
US20080270220A1 (en) * | 2005-11-05 | 2008-10-30 | Jorey Ramer | Embedding a nonsponsored mobile content within a sponsored mobile content |
US20070061245A1 (en) * | 2005-09-14 | 2007-03-15 | Jorey Ramer | Location based presentation of mobile content |
US7860871B2 (en) | 2005-09-14 | 2010-12-28 | Jumptap, Inc. | User history influenced search results |
US8364540B2 (en) | 2005-09-14 | 2013-01-29 | Jumptap, Inc. | Contextual targeting of content using a monetization platform |
US20080214151A1 (en) * | 2005-09-14 | 2008-09-04 | Jorey Ramer | Methods and systems for mobile coupon placement |
US8989718B2 (en) | 2005-09-14 | 2015-03-24 | Millennial Media, Inc. | Idle screen advertising |
US8290810B2 (en) * | 2005-09-14 | 2012-10-16 | Jumptap, Inc. | Realtime surveying within mobile sponsored content |
US20080215623A1 (en) * | 2005-09-14 | 2008-09-04 | Jorey Ramer | Mobile communication facility usage and social network creation |
US20070060173A1 (en) * | 2005-09-14 | 2007-03-15 | Jorey Ramer | Managing sponsored content based on transaction history |
US20070100652A1 (en) * | 2005-11-01 | 2007-05-03 | Jorey Ramer | Mobile pay per call |
US8027879B2 (en) * | 2005-11-05 | 2011-09-27 | Jumptap, Inc. | Exclusivity bidding for mobile sponsored content |
US20070061247A1 (en) * | 2005-09-14 | 2007-03-15 | Jorey Ramer | Expected value and prioritization of mobile content |
US8311888B2 (en) | 2005-09-14 | 2012-11-13 | Jumptap, Inc. | Revenue models associated with syndication of a behavioral profile using a monetization platform |
US8832100B2 (en) * | 2005-09-14 | 2014-09-09 | Millennial Media, Inc. | User transaction history influenced search results |
US20070168354A1 (en) * | 2005-11-01 | 2007-07-19 | Jorey Ramer | Combined algorithmic and editorial-reviewed mobile content search results |
US20090029687A1 (en) * | 2005-09-14 | 2009-01-29 | Jorey Ramer | Combining mobile and transcoded content in a mobile search result |
US20080214204A1 (en) * | 2005-11-01 | 2008-09-04 | Jorey Ramer | Similarity based location mapping of mobile comm facility users |
US8195133B2 (en) | 2005-09-14 | 2012-06-05 | Jumptap, Inc. | Mobile dynamic advertisement creation and placement |
US20080214152A1 (en) * | 2005-09-14 | 2008-09-04 | Jorey Ramer | Methods and systems of mobile dynamic content presentation |
US10911894B2 (en) | 2005-09-14 | 2021-02-02 | Verizon Media Inc. | Use of dynamic content generation parameters based on previous performance of those parameters |
US20070073717A1 (en) * | 2005-09-14 | 2007-03-29 | Jorey Ramer | Mobile comparison shopping |
US7987251B2 (en) * | 2005-09-16 | 2011-07-26 | Microsoft Corporation | Validation of domain name control |
US7925786B2 (en) * | 2005-09-16 | 2011-04-12 | Microsoft Corp. | Hosting of network-based services |
US7499919B2 (en) * | 2005-09-21 | 2009-03-03 | Microsoft Corporation | Ranking functions using document usage statistics |
US20070078939A1 (en) * | 2005-09-26 | 2007-04-05 | Technorati, Inc. | Method and apparatus for identifying and classifying network documents as spam |
JP4241705B2 (en) * | 2005-09-30 | 2009-03-18 | ブラザー工業株式会社 | Information management apparatus and program |
US7933897B2 (en) | 2005-10-12 | 2011-04-26 | Google Inc. | Entity display priority in a distributed geographic information system |
US8095419B1 (en) * | 2005-10-17 | 2012-01-10 | Yahoo! Inc. | Search score for the determination of search quality |
US7613690B2 (en) * | 2005-10-21 | 2009-11-03 | Aol Llc | Real time query trends with multi-document summarization |
US8266162B2 (en) | 2005-10-31 | 2012-09-11 | Lycos, Inc. | Automatic identification of related search keywords |
US7783632B2 (en) * | 2005-11-03 | 2010-08-24 | Microsoft Corporation | Using popularity data for ranking |
US8175585B2 (en) * | 2005-11-05 | 2012-05-08 | Jumptap, Inc. | System for targeting advertising content to a plurality of mobile communication facilities |
US10324899B2 (en) * | 2005-11-07 | 2019-06-18 | Nokia Technologies Oy | Methods for characterizing content item groups |
US20100285818A1 (en) * | 2009-05-08 | 2010-11-11 | Crawford C S Lee | Location based service for directing ads to subscribers |
US8571999B2 (en) | 2005-11-14 | 2013-10-29 | C. S. Lee Crawford | Method of conducting operations for a social network application including activity list generation |
US9135304B2 (en) * | 2005-12-02 | 2015-09-15 | Salesforce.Com, Inc. | Methods and systems for optimizing text searches over structured data in a multi-tenant environment |
US8645376B2 (en) | 2008-05-02 | 2014-02-04 | Salesforce.Com, Inc. | Method and system for managing recent data in a mobile device linked to an on-demand service |
US8095565B2 (en) * | 2005-12-05 | 2012-01-10 | Microsoft Corporation | Metadata driven user interface |
IL172551A0 (en) * | 2005-12-13 | 2006-04-10 | Grois Dan | Method for assigning one or more categorized scores to each document over a data network |
US7971137B2 (en) * | 2005-12-14 | 2011-06-28 | Google Inc. | Detecting and rejecting annoying documents |
US20080010252A1 (en) * | 2006-01-09 | 2008-01-10 | Google, Inc. | Bookmarks and ranking |
US8117196B2 (en) * | 2006-01-23 | 2012-02-14 | Chacha Search, Inc. | Search tool providing optional use of human search guides |
US7962466B2 (en) * | 2006-01-23 | 2011-06-14 | Chacha Search, Inc | Automated tool for human assisted mining and capturing of precise results |
US20070174258A1 (en) * | 2006-01-23 | 2007-07-26 | Jones Scott A | Targeted mobile device advertisements |
US8266130B2 (en) | 2006-01-23 | 2012-09-11 | Chacha Search, Inc. | Search tool providing optional use of human search guides |
US8065286B2 (en) * | 2006-01-23 | 2011-11-22 | Chacha Search, Inc. | Scalable search system using human searchers |
US7814099B2 (en) * | 2006-01-31 | 2010-10-12 | Louis S. Wang | Method for ranking and sorting electronic documents in a search result list based on relevance |
US7584183B2 (en) * | 2006-02-01 | 2009-09-01 | Yahoo! Inc. | Method for node classification and scoring by combining parallel iterative scoring calculation |
US8429177B2 (en) * | 2006-02-08 | 2013-04-23 | Yahoo! Inc. | Using exceptional changes in webgraph snapshots over time for internet entity marking |
US7844603B2 (en) * | 2006-02-17 | 2010-11-30 | Google Inc. | Sharing user distributed search results |
KR100804671B1 (en) * | 2006-02-27 | 2008-02-20 | 엔에이치엔(주) | Local terminal search system and method for removing response delay |
US7493403B2 (en) * | 2006-03-13 | 2009-02-17 | Markmonitor Inc. | Domain name ownership validation |
US8117195B1 (en) | 2006-03-22 | 2012-02-14 | Google Inc. | Providing blog posts relevant to search results |
US7933890B2 (en) | 2006-03-31 | 2011-04-26 | Google Inc. | Propagating useful information among related web pages, such as web pages of a website |
US9135238B2 (en) * | 2006-03-31 | 2015-09-15 | Google Inc. | Disambiguation of named entities |
US7647314B2 (en) * | 2006-04-28 | 2010-01-12 | Yahoo! Inc. | System and method for indexing web content using click-through features |
US7624104B2 (en) * | 2006-06-22 | 2009-11-24 | Yahoo! Inc. | User-sensitive pagerank |
CN100524307C (en) * | 2006-06-27 | 2009-08-05 | 国际商业机器公司 | Method and device for establishing coupled relation between documents |
US7716236B2 (en) * | 2006-07-06 | 2010-05-11 | Aol Inc. | Temporal search query personalization |
US20080016072A1 (en) * | 2006-07-14 | 2008-01-17 | Bea Systems, Inc. | Enterprise-Based Tag System |
US20080016053A1 (en) * | 2006-07-14 | 2008-01-17 | Bea Systems, Inc. | Administration Console to Select Rank Factors |
US20080016052A1 (en) * | 2006-07-14 | 2008-01-17 | Bea Systems, Inc. | Using Connections Between Users and Documents to Rank Documents in an Enterprise Search System |
US7873641B2 (en) * | 2006-07-14 | 2011-01-18 | Bea Systems, Inc. | Using tags in an enterprise search system |
US20080016061A1 (en) * | 2006-07-14 | 2008-01-17 | Bea Systems, Inc. | Using a Core Data Structure to Calculate Document Ranks |
WO2008010847A2 (en) * | 2006-07-14 | 2008-01-24 | Bea Systems, Inc. | Improved enterprise search system |
US20080016071A1 (en) * | 2006-07-14 | 2008-01-17 | Bea Systems, Inc. | Using Connections Between Users, Tags and Documents to Rank Documents in an Enterprise Search System |
WO2008010729A1 (en) * | 2006-07-17 | 2008-01-24 | Eurekster, Inc | A method of determining reputation for community search engines |
US8965874B1 (en) * | 2006-08-04 | 2015-02-24 | Google Inc. | Dynamic aggregation of users |
US8606834B2 (en) * | 2006-08-16 | 2013-12-10 | Apple Inc. | Managing supplied data |
US7831472B2 (en) | 2006-08-22 | 2010-11-09 | Yufik Yan M | Methods and system for search engine revenue maximization in internet advertising |
US20080126331A1 (en) * | 2006-08-25 | 2008-05-29 | Xerox Corporation | System and method for ranking reference documents |
US20080071797A1 (en) * | 2006-09-15 | 2008-03-20 | Thornton Nathaniel L | System and method to calculate average link growth on search engines for a keyword |
US9037581B1 (en) * | 2006-09-29 | 2015-05-19 | Google Inc. | Personalized search result ranking |
US8548991B1 (en) | 2006-09-29 | 2013-10-01 | Google Inc. | Personalized browsing activity displays |
US7577643B2 (en) * | 2006-09-29 | 2009-08-18 | Microsoft Corporation | Key phrase extraction from query logs |
US9740778B2 (en) * | 2006-10-10 | 2017-08-22 | Microsoft Technology Licensing, Llc | Ranking domains using domain maturity |
US8745183B2 (en) * | 2006-10-26 | 2014-06-03 | Yahoo! Inc. | System and method for adaptively refreshing a web page |
WO2008052205A2 (en) * | 2006-10-27 | 2008-05-02 | Jumptap, Inc. | Combined algorithmic and editorial-reviewed mobile content search results |
US7937403B2 (en) * | 2006-10-30 | 2011-05-03 | Yahoo! Inc. | Time-based analysis of related keyword searching |
US9110975B1 (en) * | 2006-11-02 | 2015-08-18 | Google Inc. | Search result inputs using variant generalized queries |
US8661029B1 (en) | 2006-11-02 | 2014-02-25 | Google Inc. | Modifying search result ranking based on implicit user feedback |
US20080126430A1 (en) * | 2006-11-28 | 2008-05-29 | Garrett Andrew J | Intermediary document for critical change control |
US8983970B1 (en) | 2006-12-07 | 2015-03-17 | Google Inc. | Ranking content using content and content authors |
US8577866B1 (en) * | 2006-12-07 | 2013-11-05 | Googe Inc. | Classifying content |
JP5137397B2 (en) * | 2006-12-28 | 2013-02-06 | キヤノン株式会社 | Data management apparatus, data processing method, and computer program |
US8280871B2 (en) * | 2006-12-29 | 2012-10-02 | Yahoo! Inc. | Identifying offensive content using user click data |
US8046358B2 (en) * | 2007-02-16 | 2011-10-25 | Ge Healthcare | Context-based information retrieval |
US8938463B1 (en) | 2007-03-12 | 2015-01-20 | Google Inc. | Modifying search result ranking based on implicit user feedback and a model of presentation bias |
US7827170B1 (en) | 2007-03-13 | 2010-11-02 | Google Inc. | Systems and methods for demoting personalized search results based on personal information |
US8694374B1 (en) | 2007-03-14 | 2014-04-08 | Google Inc. | Detecting click spam |
JP4861865B2 (en) * | 2007-03-15 | 2012-01-25 | 富士通株式会社 | Access result feedback program, recording medium, access result feedback method, access result feedback device, and terminal device |
JP4781466B2 (en) * | 2007-03-16 | 2011-09-28 | 富士通株式会社 | Document importance calculation program |
JP4894580B2 (en) * | 2007-03-20 | 2012-03-14 | 日本電気株式会社 | Seasonal analysis system, seasonality analysis method, and seasonality analysis program |
US8176055B1 (en) * | 2007-03-27 | 2012-05-08 | Google Inc. | Content entity management |
US8788320B1 (en) | 2007-03-28 | 2014-07-22 | Amazon Technologies, Inc. | Release advertisement system |
US7693813B1 (en) | 2007-03-30 | 2010-04-06 | Google Inc. | Index server architecture using tiered and sharded phrase posting lists |
US7925655B1 (en) | 2007-03-30 | 2011-04-12 | Google Inc. | Query scheduling using hierarchical tiers of index servers |
US20080244428A1 (en) * | 2007-03-30 | 2008-10-02 | Yahoo! Inc. | Visually Emphasizing Query Results Based on Relevance Feedback |
US8166045B1 (en) | 2007-03-30 | 2012-04-24 | Google Inc. | Phrase extraction using subphrase scoring |
US8086594B1 (en) | 2007-03-30 | 2011-12-27 | Google Inc. | Bifurcated document relevance scoring |
US8166021B1 (en) | 2007-03-30 | 2012-04-24 | Google Inc. | Query phrasification |
US7702614B1 (en) | 2007-03-30 | 2010-04-20 | Google Inc. | Index updating using segment swapping |
US7672937B2 (en) * | 2007-04-11 | 2010-03-02 | Yahoo, Inc. | Temporal targeting of advertisements |
US7676520B2 (en) * | 2007-04-12 | 2010-03-09 | Microsoft Corporation | Calculating importance of documents factoring historical importance |
IL182518A0 (en) * | 2007-04-12 | 2007-09-20 | Grois Dan | Pay per relevance (ppr) advertising method and system |
US9092510B1 (en) | 2007-04-30 | 2015-07-28 | Google Inc. | Modifying search result ranking based on a temporal element of user feedback |
US20080275846A1 (en) * | 2007-05-04 | 2008-11-06 | Sony Ericsson Mobile Communications Ab | Filtering search results using contact lists |
US7788254B2 (en) * | 2007-05-04 | 2010-08-31 | Microsoft Corporation | Web page analysis using multiple graphs |
US20080275877A1 (en) * | 2007-05-04 | 2008-11-06 | International Business Machines Corporation | Method and system for variable keyword processing based on content dates on a web page |
US8706696B2 (en) | 2007-05-04 | 2014-04-22 | Salesforce.Com, Inc. | Method and system for on-demand communities |
US20090248623A1 (en) * | 2007-05-09 | 2009-10-01 | The Go Daddy Group, Inc. | Accessing digital identity related reputation data |
US8359309B1 (en) | 2007-05-23 | 2013-01-22 | Google Inc. | Modifying search result ranking based on corpus search statistics |
US8046372B1 (en) | 2007-05-25 | 2011-10-25 | Amazon Technologies, Inc. | Duplicate entry detection system and method |
US7814107B1 (en) | 2007-05-25 | 2010-10-12 | Amazon Technologies, Inc. | Generating similarity scores for matching non-identical data strings |
US7908279B1 (en) | 2007-05-25 | 2011-03-15 | Amazon Technologies, Inc. | Filtering invalid tokens from a document using high IDF token filtering |
US7644075B2 (en) * | 2007-06-01 | 2010-01-05 | Microsoft Corporation | Keyword usage score based on frequency impulse and frequency weight |
US8244737B2 (en) * | 2007-06-18 | 2012-08-14 | Microsoft Corporation | Ranking documents based on a series of document graphs |
US20090006358A1 (en) * | 2007-06-27 | 2009-01-01 | Microsoft Corporation | Search results |
US8290986B2 (en) * | 2007-06-27 | 2012-10-16 | Yahoo! Inc. | Determining quality measures for web objects based on searcher behavior |
US20090006341A1 (en) * | 2007-06-28 | 2009-01-01 | Bruce Chapman | Method of website ranking promotion using creation of mass blog posting links |
US20090013068A1 (en) * | 2007-07-02 | 2009-01-08 | Eaglestone Robert J | Systems and processes for evaluating webpages |
US20090013033A1 (en) * | 2007-07-06 | 2009-01-08 | Yahoo! Inc. | Identifying excessively reciprocal links among web entities |
US7991790B2 (en) * | 2007-07-20 | 2011-08-02 | Salesforce.Com, Inc. | System and method for storing documents accessed by multiple users in an on-demand service |
US7966341B2 (en) * | 2007-08-06 | 2011-06-21 | Yahoo! Inc. | Estimating the date relevance of a query from query logs |
CN102016825A (en) | 2007-08-17 | 2011-04-13 | 谷歌公司 | Ranking social network objects |
US20110022621A1 (en) * | 2007-08-17 | 2011-01-27 | Google Inc. | Dynamically naming communities within online social networks |
US20110010384A1 (en) * | 2007-08-17 | 2011-01-13 | Google Inc. | Multi-community content sharing in online social networks |
US8694511B1 (en) | 2007-08-20 | 2014-04-08 | Google Inc. | Modifying search result ranking based on populations |
EP2193457A1 (en) * | 2007-09-03 | 2010-06-09 | IQser IP AG | Detecting correlations between data representing information |
US8117223B2 (en) * | 2007-09-07 | 2012-02-14 | Google Inc. | Integrating external related phrase information into a phrase-based indexing information retrieval system |
CN100514337C (en) * | 2007-09-10 | 2009-07-15 | 腾讯科技(深圳)有限公司 | Association information generating system of key words and generation method thereof |
JP2009070156A (en) * | 2007-09-13 | 2009-04-02 | Ntt Docomo Inc | Information retrieval system and information retrieval method |
KR20090030966A (en) * | 2007-09-21 | 2009-03-25 | 삼성전자주식회사 | Method and device for organizing menu list ranking in portable terminal |
US20090089311A1 (en) * | 2007-09-28 | 2009-04-02 | Yahoo! Inc. | System and method for inclusion of history in a search results page |
US8909655B1 (en) | 2007-10-11 | 2014-12-09 | Google Inc. | Time based ranking |
US9348912B2 (en) * | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US7840569B2 (en) * | 2007-10-18 | 2010-11-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US20090106221A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
US8078613B2 (en) * | 2007-11-28 | 2011-12-13 | Red Hat, Inc. | Method for removing network effects from search engine results |
US9946722B2 (en) * | 2007-11-30 | 2018-04-17 | Red Hat, Inc. | Generating file usage information |
US7895225B1 (en) * | 2007-12-06 | 2011-02-22 | Amazon Technologies, Inc. | Identifying potential duplicates of a document in a document corpus |
JP2009145953A (en) * | 2007-12-11 | 2009-07-02 | Sharp Corp | Data retrieving apparatus, data retrieving method, computer program, and recording medium |
US8176017B2 (en) * | 2007-12-14 | 2012-05-08 | Microsoft Corporation | Live volume access |
US9239882B2 (en) * | 2007-12-17 | 2016-01-19 | Iac Search & Media, Inc. | System and method for categorizing answers such as URLs |
US9501453B2 (en) | 2007-12-23 | 2016-11-22 | Salesforce.Com Inc. | Method and system for a flexible-data column user interface |
JP2009157422A (en) * | 2007-12-25 | 2009-07-16 | Fuji Xerox Co Ltd | Handling restriction information management system and program |
US8578260B2 (en) * | 2007-12-28 | 2013-11-05 | Business Objects Software Limited | Apparatus and method for reformatting a report for access by a user in a network appliance |
US20090182614A1 (en) * | 2008-01-11 | 2009-07-16 | Yahoo! Inc. | System And Method For Serving Advertisements According To Network Traffic |
US8752184B1 (en) * | 2008-01-17 | 2014-06-10 | Google Inc. | Spam detection for user-generated multimedia items based on keyword stuffing |
US8745056B1 (en) * | 2008-03-31 | 2014-06-03 | Google Inc. | Spam detection for user-generated multimedia items based on concept clustering |
US7860755B2 (en) * | 2008-02-19 | 2010-12-28 | The Go Daddy Group, Inc. | Rating e-commerce transactions |
US7653577B2 (en) * | 2008-02-19 | 2010-01-26 | The Go Daddy Group, Inc. | Validating e-commerce transactions |
US9489495B2 (en) | 2008-02-25 | 2016-11-08 | Georgetown University | System and method for detecting, collecting, analyzing, and communicating event-related information |
US8881040B2 (en) | 2008-08-28 | 2014-11-04 | Georgetown University | System and method for detecting, collecting, analyzing, and communicating event-related information |
US9746985B1 (en) | 2008-02-25 | 2017-08-29 | Georgetown University | System and method for detecting, collecting, analyzing, and communicating event-related information |
US9529974B2 (en) | 2008-02-25 | 2016-12-27 | Georgetown University | System and method for detecting, collecting, analyzing, and communicating event-related information |
US8224832B2 (en) * | 2008-02-29 | 2012-07-17 | Kemp Richard Douglas | Computerized document examination for changes |
US8171020B1 (en) | 2008-03-31 | 2012-05-01 | Google Inc. | Spam detection for user-generated multimedia items based on appearance in popular queries |
US8812493B2 (en) * | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US9128945B1 (en) * | 2008-05-16 | 2015-09-08 | Google Inc. | Query augmentation |
EP2304660A4 (en) * | 2008-06-19 | 2013-11-27 | Wize Technologies Inc | System and method for aggregating and summarizing product/topic sentiment |
US20100010982A1 (en) * | 2008-07-09 | 2010-01-14 | Broder Andrei Z | Web content characterization based on semantic folksonomies associated with user generated content |
US8538942B2 (en) | 2008-09-12 | 2013-09-17 | Salesforce.Com, Inc. | Method and system for sharing documents between on-demand services |
US20100082649A1 (en) * | 2008-09-22 | 2010-04-01 | Microsoft Corporation | Automatic search suggestions from server-side user history |
US8370329B2 (en) * | 2008-09-22 | 2013-02-05 | Microsoft Corporation | Automatic search query suggestions with search result suggestions from user history |
KR101086530B1 (en) * | 2008-10-02 | 2011-11-23 | 엔에이치엔(주) | Method and system for determining web document origin, method and system for providing web document history information therefor |
US20100169492A1 (en) * | 2008-12-04 | 2010-07-01 | The Go Daddy Group, Inc. | Generating domain names relevant to social website trending topics |
US8396865B1 (en) | 2008-12-10 | 2013-03-12 | Google Inc. | Sharing search engine relevance data between corpora |
US9037999B2 (en) * | 2008-12-31 | 2015-05-19 | Tivo Inc. | Adaptive search result user interface |
US9152300B2 (en) | 2008-12-31 | 2015-10-06 | Tivo Inc. | Methods and techniques for adaptive search |
US8239397B2 (en) * | 2009-01-27 | 2012-08-07 | Palo Alto Research Center Incorporated | System and method for managing user attention by detecting hot and cold topics in social indexes |
JP2010176354A (en) * | 2009-01-29 | 2010-08-12 | Fuji Xerox Co Ltd | Information processor and information processing program |
US8001462B1 (en) | 2009-01-30 | 2011-08-16 | Google Inc. | Updating search engine document index based on calculated age of changed portions in a document |
US9836538B2 (en) * | 2009-03-03 | 2017-12-05 | Microsoft Technology Licensing, Llc | Domain-based ranking in document search |
CN101499098B (en) * | 2009-03-04 | 2012-07-11 | 阿里巴巴集团控股有限公司 | Web page assessed value confirming and employing method and system |
US8224839B2 (en) * | 2009-04-07 | 2012-07-17 | Microsoft Corporation | Search query extension |
US9009146B1 (en) | 2009-04-08 | 2015-04-14 | Google Inc. | Ranking search results based on similar queries |
CN101887437B (en) * | 2009-05-12 | 2016-03-30 | 阿里巴巴集团控股有限公司 | A kind of Search Results generation method and information search system |
US8719298B2 (en) * | 2009-05-21 | 2014-05-06 | Microsoft Corporation | Click-through prediction for news queries |
US10353967B2 (en) * | 2009-06-22 | 2019-07-16 | Microsoft Technology Licensing, Llc | Assigning relevance weights based on temporal dynamics |
US20100332550A1 (en) * | 2009-06-26 | 2010-12-30 | Microsoft Corporation | Platform For Configurable Logging Instrumentation |
US20100332531A1 (en) * | 2009-06-26 | 2010-12-30 | Microsoft Corporation | Batched Transfer of Arbitrarily Distributed Data |
US9870572B2 (en) * | 2009-06-29 | 2018-01-16 | Google Llc | System and method of providing information based on street address |
US20150261858A1 (en) * | 2009-06-29 | 2015-09-17 | Google Inc. | System and method of providing information based on street address |
US8447760B1 (en) | 2009-07-20 | 2013-05-21 | Google Inc. | Generating a related set of documents for an initial set of documents |
AU2009350126A1 (en) * | 2009-07-22 | 2012-02-23 | Foundationip, Llc | Method, system, and apparatus for delivering query results from an electronic document collection |
US20110029516A1 (en) * | 2009-07-30 | 2011-02-03 | Microsoft Corporation | Web-Used Pattern Insight Platform |
US8082247B2 (en) * | 2009-07-30 | 2011-12-20 | Microsoft Corporation | Best-bet recommendations |
GB2472250A (en) * | 2009-07-31 | 2011-02-02 | Stephen Timothy Morris | Method for determining document relevance |
JP5014386B2 (en) * | 2009-08-12 | 2012-08-29 | ヤフー株式会社 | Content search device |
US8498974B1 (en) | 2009-08-31 | 2013-07-30 | Google Inc. | Refining search results |
JP5002631B2 (en) * | 2009-09-04 | 2012-08-15 | ヤフー株式会社 | Word information collection device, word information collection method, and word information collection program |
US8595194B2 (en) * | 2009-09-15 | 2013-11-26 | At&T Intellectual Property I, L.P. | Forward decay temporal data analysis |
US20110078017A1 (en) * | 2009-09-29 | 2011-03-31 | Selina Lam | Systems and methods for rating an originator of an online publication |
US8972391B1 (en) * | 2009-10-02 | 2015-03-03 | Google Inc. | Recent interest based relevance scoring |
US8874555B1 (en) | 2009-11-20 | 2014-10-28 | Google Inc. | Modifying scoring data based on historical changes |
US8515975B1 (en) | 2009-12-07 | 2013-08-20 | Google Inc. | Search entity transition matrix and applications of the transition matrix |
US9043319B1 (en) * | 2009-12-07 | 2015-05-26 | Google Inc. | Generating real-time search results |
US20110145822A1 (en) * | 2009-12-10 | 2011-06-16 | The Go Daddy Group, Inc. | Generating and recommending task solutions |
US20110145823A1 (en) * | 2009-12-10 | 2011-06-16 | The Go Daddy Group, Inc. | Task management engine |
US8311792B1 (en) * | 2009-12-23 | 2012-11-13 | Intuit Inc. | System and method for ranking a posting |
KR101361328B1 (en) * | 2009-12-28 | 2014-02-10 | 라쿠텐 인코포레이티드 | Information search device, number-of-items determination method, information search system and recording medium |
US20110178868A1 (en) * | 2010-01-21 | 2011-07-21 | Priyank Shanker Garg | Enhancing search result pages using content licensed from content providers |
US8615514B1 (en) | 2010-02-03 | 2013-12-24 | Google Inc. | Evaluating website properties by partitioning user feedback |
EP2533163A4 (en) * | 2010-02-04 | 2015-04-15 | Ebay Inc | List display on the basis of list activities and related applications |
US8924379B1 (en) | 2010-03-05 | 2014-12-30 | Google Inc. | Temporal-based score adjustments |
US8959093B1 (en) | 2010-03-15 | 2015-02-17 | Google Inc. | Ranking search results based on anchors |
US8700642B2 (en) * | 2010-03-22 | 2014-04-15 | Microsoft Corporation | Software agent for monitoring content relevance |
US8650195B2 (en) * | 2010-03-26 | 2014-02-11 | Palle M Pedersen | Region based information retrieval system |
US8260789B2 (en) * | 2010-04-01 | 2012-09-04 | Microsoft Corporation | System and method for authority value obtained by defining ranking functions related to weight and confidence value |
CN101883180A (en) * | 2010-05-11 | 2010-11-10 | 中兴通讯股份有限公司 | Method and system for shielding information in wireless network accessed by mobile terminal and mobile terminal |
US9116990B2 (en) * | 2010-05-27 | 2015-08-25 | Microsoft Technology Licensing, Llc | Enhancing freshness of search results |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US8738377B2 (en) * | 2010-06-07 | 2014-05-27 | Google Inc. | Predicting and learning carrier phrases for speech input |
US8595207B2 (en) * | 2010-06-14 | 2013-11-26 | Salesforce.Com | Methods and systems for dynamically suggesting answers to questions submitted to a portal of an online service |
US9623119B1 (en) | 2010-06-29 | 2017-04-18 | Google Inc. | Accentuating search results |
AU2010202901B2 (en) * | 2010-07-08 | 2016-04-14 | Patent Analytics Holding Pty Ltd | A system, method and computer program for preparing data for analysis |
US8832083B1 (en) | 2010-07-23 | 2014-09-09 | Google Inc. | Combining user feedback |
US9020922B2 (en) * | 2010-08-10 | 2015-04-28 | Brightedge Technologies, Inc. | Search engine optimization at scale |
US20120047044A1 (en) * | 2010-08-19 | 2012-02-23 | Stephen James Lazuka | Method to Develop Search Engine Optimized Content Through a Web-Based Software Platform |
US8332408B1 (en) | 2010-08-23 | 2012-12-11 | Google Inc. | Date-based web page annotation |
US8762326B1 (en) | 2010-09-23 | 2014-06-24 | Google Inc. | Personalized hot topics |
US8346792B1 (en) | 2010-11-09 | 2013-01-01 | Google Inc. | Query generation using structural similarity between documents |
US8861896B2 (en) * | 2010-11-29 | 2014-10-14 | Sap Se | Method and system for image-based identification |
US9348925B2 (en) * | 2010-12-01 | 2016-05-24 | Google Inc. | Locally significant search queries |
US8688706B2 (en) | 2010-12-01 | 2014-04-01 | Google Inc. | Topic based user profiles |
JP5673051B2 (en) * | 2010-12-09 | 2015-02-18 | 日本電気株式会社 | Document feature amount calculation apparatus, document feature amount calculation method, and document feature amount calculation program |
US8793706B2 (en) | 2010-12-16 | 2014-07-29 | Microsoft Corporation | Metadata-based eventing supporting operations on data |
US9002867B1 (en) | 2010-12-30 | 2015-04-07 | Google Inc. | Modifying ranking data based on document changes |
US8370365B1 (en) | 2011-01-31 | 2013-02-05 | Go Daddy Operating Company, LLC | Tools for predicting improvement in website search engine rankings based upon website linking relationships |
US8972412B1 (en) | 2011-01-31 | 2015-03-03 | Go Daddy Operating Company, LLC | Predicting improvement in website search engine rankings based upon website linking relationships |
US10162892B2 (en) * | 2011-02-28 | 2018-12-25 | International Business Machines Corporation | Identifying information assets within an enterprise using a semantic graph created using feedback re-enforced search and navigation |
US9646110B2 (en) | 2011-02-28 | 2017-05-09 | International Business Machines Corporation | Managing information assets using feedback re-enforced search and navigation |
WO2012129102A2 (en) * | 2011-03-22 | 2012-09-27 | Brightedge Technologies, Inc. | Detection and analysis of backlink activity |
US8732151B2 (en) | 2011-04-01 | 2014-05-20 | Microsoft Corporation | Enhanced query rewriting through statistical machine translation |
EP2700025A4 (en) * | 2011-04-19 | 2014-10-22 | Nokia Corp | METHOD AND APPARATUS FOR SOFT DIVERSIFICATION OF RECOMMENDATION RESULTS |
US8775431B2 (en) * | 2011-04-25 | 2014-07-08 | Disney Enterprises, Inc. | Systems and methods for hot topic identification and metadata |
US8819000B1 (en) * | 2011-05-03 | 2014-08-26 | Google Inc. | Query modification |
US20120304072A1 (en) * | 2011-05-23 | 2012-11-29 | Microsoft Corporation | Sentiment-based content aggregation and presentation |
US10068022B2 (en) * | 2011-06-03 | 2018-09-04 | Google Llc | Identifying topical entities |
US10223451B2 (en) * | 2011-06-14 | 2019-03-05 | International Business Machines Corporation | Ranking search results based upon content creation trends |
CA2832911C (en) * | 2011-06-22 | 2016-12-13 | Rogers Communications Inc. | System and method for filtering documents |
US9286334B2 (en) | 2011-07-15 | 2016-03-15 | International Business Machines Corporation | Versioning of metadata, including presentation of provenance and lineage for versioned metadata |
US9384193B2 (en) | 2011-07-15 | 2016-07-05 | International Business Machines Corporation | Use and enforcement of provenance and lineage constraints |
US8510285B1 (en) * | 2011-08-18 | 2013-08-13 | Google Inc. | Using pre-search triggers |
JP5506104B2 (en) * | 2011-09-30 | 2014-05-28 | 楽天株式会社 | Information processing apparatus, information processing method, and information processing program |
KR101510647B1 (en) * | 2011-10-07 | 2015-04-10 | 한국전자통신연구원 | Method and apparatus for providing web trend analysis based on issue template extraction |
US10776431B2 (en) * | 2011-10-26 | 2020-09-15 | Oath Inc. | System and method for recommending content based on search history and trending topics |
US8694507B2 (en) * | 2011-11-02 | 2014-04-08 | Microsoft Corporation | Tenantization of search result ranking |
US9104769B2 (en) * | 2011-11-10 | 2015-08-11 | Room 77, Inc. | Metasearch infrastructure with incremental updates |
US9436758B1 (en) | 2011-12-27 | 2016-09-06 | Google Inc. | Methods and systems for partitioning documents having customer feedback and support content |
US8868536B1 (en) * | 2012-01-04 | 2014-10-21 | Google Inc. | Real time map spam detection |
US9201964B2 (en) | 2012-01-23 | 2015-12-01 | Microsoft Technology Licensing, Llc | Identifying related entities |
US9418065B2 (en) | 2012-01-26 | 2016-08-16 | International Business Machines Corporation | Tracking changes related to a collection of documents |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
JP5929356B2 (en) * | 2012-03-15 | 2016-06-01 | 富士ゼロックス株式会社 | Information processing apparatus and information processing program |
US9189526B1 (en) * | 2012-03-21 | 2015-11-17 | Google Inc. | Freshness based ranking |
CN103049511B (en) * | 2012-03-28 | 2016-02-03 | 温州大学 | The display packing of a kind of microblogging concern list, content of microblog and client thereof |
US9081831B2 (en) | 2012-03-30 | 2015-07-14 | Google Inc. | Methods and systems for presenting document-specific snippets |
CN103377191B (en) * | 2012-04-12 | 2017-04-12 | 阿里巴巴集团控股有限公司 | Method and device for providing relevant information of images |
US20130282707A1 (en) * | 2012-04-24 | 2013-10-24 | Discovery Engine Corporation | Two-step combiner for search result scores |
US9916396B2 (en) | 2012-05-11 | 2018-03-13 | Google Llc | Methods and systems for content-based search |
US8924375B1 (en) * | 2012-05-31 | 2014-12-30 | Symantec Corporation | Item attention tracking system and method |
US8954438B1 (en) | 2012-05-31 | 2015-02-10 | Google Inc. | Structured metadata extraction |
US8984012B2 (en) * | 2012-06-20 | 2015-03-17 | Microsoft Technology Licensing, Llc | Self-tuning alterations framework |
US9471606B1 (en) | 2012-06-25 | 2016-10-18 | Google Inc. | Obtaining information to provide to users |
US9195717B2 (en) * | 2012-06-26 | 2015-11-24 | Google Inc. | Image result provisioning based on document classification |
US9436687B2 (en) * | 2012-07-09 | 2016-09-06 | Facebook, Inc. | Acquiring structured user data using composer interface having input fields corresponding to acquired structured data |
US9110852B1 (en) | 2012-07-20 | 2015-08-18 | Google Inc. | Methods and systems for extracting information from text |
US8793258B2 (en) * | 2012-07-31 | 2014-07-29 | Hewlett-Packard Development Company, L.P. | Predicting sharing on a social network |
US9390174B2 (en) | 2012-08-08 | 2016-07-12 | Google Inc. | Search result ranking and presentation |
US8938438B2 (en) | 2012-10-11 | 2015-01-20 | Go Daddy Operating Company, LLC | Optimizing search engine ranking by recommending content including frequently searched questions |
US8898113B2 (en) | 2012-11-21 | 2014-11-25 | International Business Machines Corporation | Managing replicated data |
US9558233B1 (en) | 2012-11-30 | 2017-01-31 | Google Inc. | Determining a quality measure for a resource |
US9256682B1 (en) | 2012-12-05 | 2016-02-09 | Google Inc. | Providing search results based on sorted properties |
US8949228B2 (en) * | 2013-01-15 | 2015-02-03 | Google Inc. | Identification of new sources for topics |
US20140236964A1 (en) * | 2013-02-19 | 2014-08-21 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems And Methods For Ranking A Plurality Of Documents Based On User Activity |
US9218819B1 (en) | 2013-03-01 | 2015-12-22 | Google Inc. | Customizing actions based on contextual data and voice-based inputs |
US11429651B2 (en) * | 2013-03-14 | 2022-08-30 | International Business Machines Corporation | Document provenance scoring based on changes between document versions |
US10055462B2 (en) | 2013-03-15 | 2018-08-21 | Google Llc | Providing search results using augmented search queries |
US9501506B1 (en) | 2013-03-15 | 2016-11-22 | Google Inc. | Indexing system |
US10108700B2 (en) | 2013-03-15 | 2018-10-23 | Google Llc | Question answering to populate knowledge base |
US9477759B2 (en) | 2013-03-15 | 2016-10-25 | Google Inc. | Question answering using entity references in unstructured data |
ES2518015B1 (en) * | 2013-04-01 | 2015-08-12 | Crambo, S.A. | METHOD, MOBILE DEVICE, SYSTEM AND COMPUTER PRODUCT FOR THE DETECTION AND MEASUREMENT OF A USER'S CARE LEVEL |
US9183499B1 (en) | 2013-04-19 | 2015-11-10 | Google Inc. | Evaluating quality based on neighbor features |
US9251146B2 (en) | 2013-05-10 | 2016-02-02 | International Business Machines Corporation | Altering relevancy of a document and/or a search query |
CN105247481B (en) * | 2013-05-29 | 2019-05-07 | 惠普发展公司,有限责任合伙企业 | The computing system, method and machine readable non-transitory storage medium of selection are exported for webpage |
US9483568B1 (en) | 2013-06-05 | 2016-11-01 | Google Inc. | Indexing system |
RU2592390C2 (en) * | 2013-07-15 | 2016-07-20 | Общество С Ограниченной Ответственностью "Яндекс" | System, method and device for evaluation of browsing sessions |
US20150046219A1 (en) * | 2013-08-08 | 2015-02-12 | Mark J. Shavlik | Avatar-based automated lead scoring system |
US9946804B2 (en) | 2013-08-19 | 2018-04-17 | Business Objects Software Ltd | Displaying historical data associated with data values within business intelligence reports |
US20150058073A1 (en) * | 2013-08-20 | 2015-02-26 | Dmitrii Gorbunov | Crowdsourced innovation exchange |
US9723053B1 (en) | 2013-08-30 | 2017-08-01 | Amazon Technologies, Inc. | Pre-fetching a cacheable network resource based on a time-to-live value |
US10079737B2 (en) * | 2013-09-13 | 2018-09-18 | Clicktale Ltd. | Method and system for generating comparable visual maps for browsing activity analysis |
US10902004B2 (en) * | 2013-10-16 | 2021-01-26 | Salesforce.Com, Inc. | Processing user-submitted updates based on user reliability scores |
US11017426B1 (en) * | 2013-12-20 | 2021-05-25 | BloomReach Inc. | Content performance analytics |
CN104753805B (en) * | 2013-12-31 | 2018-07-24 | 腾讯科技(深圳)有限公司 | Distributed flow control method, server and system |
US20150186463A1 (en) * | 2013-12-31 | 2015-07-02 | International Business Machines Corporation | Identifying changes to query results system and method |
US9984165B2 (en) * | 2014-02-13 | 2018-05-29 | Amadeus S.A.S. | Increasing search result validity |
US9582536B2 (en) | 2014-02-19 | 2017-02-28 | Amadeus S.A.S. | Long-term validity of pre-computed request results |
US9471689B2 (en) | 2014-05-29 | 2016-10-18 | International Business Machines Corporation | Managing documents in question answering systems |
US9875242B2 (en) * | 2014-06-03 | 2018-01-23 | Google Llc | Dynamic current results for second device |
US9934319B2 (en) | 2014-07-04 | 2018-04-03 | Yandex Europe Ag | Method of and system for determining creation time of a web resource |
US9692804B2 (en) | 2014-07-04 | 2017-06-27 | Yandex Europe Ag | Method of and system for determining creation time of a web resource |
US10592539B1 (en) * | 2014-07-11 | 2020-03-17 | Twitter, Inc. | Trends in a messaging platform |
AU2015287694A1 (en) * | 2014-07-11 | 2017-02-02 | Celgene Corporation | Combination therapy for cancer |
US10601749B1 (en) | 2014-07-11 | 2020-03-24 | Twitter, Inc. | Trends in a messaging platform |
EP3178021A1 (en) * | 2014-08-05 | 2017-06-14 | Piksel, Inc. | Content source driven recommendation for given context of content delivery and display system |
US9703840B2 (en) | 2014-08-13 | 2017-07-11 | International Business Machines Corporation | Handling information source ingestion in a question answering system |
US10572925B1 (en) | 2014-08-15 | 2020-02-25 | Groupon, Inc. | Universal relevance service framework |
US10459927B1 (en) | 2014-08-15 | 2019-10-29 | Groupon, Inc. | Enforcing diversity in ranked relevance results returned from a universal relevance service framework |
US11216843B1 (en) | 2014-08-15 | 2022-01-04 | Groupon, Inc. | Ranked relevance results using multi-feature scoring returned from a universal relevance service framework |
US10210214B2 (en) * | 2014-08-27 | 2019-02-19 | International Business Machines Corporation | Scalable trend detection in a personalized search context |
US9501851B2 (en) | 2014-10-03 | 2016-11-22 | Palantir Technologies Inc. | Time-series analysis system |
US9767172B2 (en) * | 2014-10-03 | 2017-09-19 | Palantir Technologies Inc. | Data aggregation and analysis system |
US9690862B2 (en) * | 2014-10-18 | 2017-06-27 | International Business Machines Corporation | Realtime ingestion via multi-corpus knowledge base with weighting |
US10042514B2 (en) * | 2014-10-30 | 2018-08-07 | Microsoft Technology Licensing, Llc | Typeahead features |
RU2580432C1 (en) | 2014-10-31 | 2016-04-10 | Общество С Ограниченной Ответственностью "Яндекс" | Method for processing a request from a potential unauthorised user to access resource and server used therein |
RU2610280C2 (en) | 2014-10-31 | 2017-02-08 | Общество С Ограниченной Ответственностью "Яндекс" | Method for user authorization in a network and server used therein |
US9785304B2 (en) | 2014-10-31 | 2017-10-10 | Bank Of America Corporation | Linking customer profiles with household profiles |
US9940409B2 (en) * | 2014-10-31 | 2018-04-10 | Bank Of America Corporation | Contextual search tool |
US9922117B2 (en) | 2014-10-31 | 2018-03-20 | Bank Of America Corporation | Contextual search input from advisors |
US9160680B1 (en) | 2014-11-18 | 2015-10-13 | Kaspersky Lab Zao | System and method for dynamic network resource categorization re-assignment |
CN104778202B (en) * | 2015-02-05 | 2018-08-14 | 北京航空航天大学 | The analysis method and system of event evolutionary process based on keyword |
US9836435B2 (en) | 2015-03-19 | 2017-12-05 | International Business Machines Corporation | Embedded content suitability scoring |
CN104731914A (en) * | 2015-03-24 | 2015-06-24 | 浪潮集团有限公司 | Method for detecting user abnormal behavior based on behavior similarity |
US9984330B2 (en) | 2015-04-10 | 2018-05-29 | Microsoft Technology Licensing, Llc. | Predictive trending of digital entities |
US11803918B2 (en) | 2015-07-07 | 2023-10-31 | Oracle International Corporation | System and method for identifying experts on arbitrary topics in an enterprise social network |
RU2632131C2 (en) * | 2015-08-28 | 2017-10-02 | Общество С Ограниченной Ответственностью "Яндекс" | Method and device for creating recommended list of content |
RU2629638C2 (en) | 2015-09-28 | 2017-08-30 | Общество С Ограниченной Ответственностью "Яндекс" | Method and server of creating recommended set of elements for user |
RU2632100C2 (en) * | 2015-09-28 | 2017-10-02 | Общество С Ограниченной Ответственностью "Яндекс" | Method and server of recommended set of elements creation |
US11442945B1 (en) * | 2015-12-31 | 2022-09-13 | Groupon, Inc. | Dynamic freshness for relevance rankings |
CA3014072A1 (en) | 2016-02-08 | 2017-08-17 | Acxiom Corporation | Change fingerprinting for database tables, text files, and data feeds |
RU2632144C1 (en) | 2016-05-12 | 2017-10-02 | Общество С Ограниченной Ответственностью "Яндекс" | Computer method for creating content recommendation interface |
RU2632132C1 (en) | 2016-07-07 | 2017-10-02 | Общество С Ограниченной Ответственностью "Яндекс" | Method and device for creating contents recommendations in recommendations system |
RU2636702C1 (en) | 2016-07-07 | 2017-11-27 | Общество С Ограниченной Ответственностью "Яндекс" | Method and device for selecting network resource as source of content in recommendations system |
US10769156B2 (en) * | 2016-08-26 | 2020-09-08 | Microsoft Technology Licensing, Llc | Rank query results for relevance utilizing external context |
USD882600S1 (en) | 2017-01-13 | 2020-04-28 | Yandex Europe Ag | Display screen with graphical user interface |
CN107622090B (en) * | 2017-08-22 | 2020-10-16 | 上海艾融软件股份有限公司 | Object acquisition method, device and system |
US10853375B1 (en) | 2017-08-25 | 2020-12-01 | Roblox Corporation | Leveraging historical data to improve the relevancy of search results |
CN109446402B (en) * | 2017-08-29 | 2022-04-01 | 阿里巴巴集团控股有限公司 | Searching method and device |
US11163759B2 (en) * | 2017-12-21 | 2021-11-02 | Salesforce.Com, Inc. | Predicting entities for database query results |
CN110569335B (en) * | 2018-03-23 | 2022-05-27 | 百度在线网络技术(北京)有限公司 | Triple verification method and device based on artificial intelligence and storage medium |
US11514095B2 (en) | 2018-05-04 | 2022-11-29 | International Business Machines Corporation | Tiered retrieval of secured documents |
US10796022B2 (en) | 2018-05-16 | 2020-10-06 | Ebay Inc. | Weighted source data secured on blockchains |
US10671371B2 (en) | 2018-06-12 | 2020-06-02 | International Business Machines Corporation | Alerting an offline user of a predicted computer file update |
US11017221B2 (en) * | 2018-07-01 | 2021-05-25 | International Business Machines Corporation | Classifying digital documents in multi-document transactions based on embedded dates |
US10885081B2 (en) | 2018-07-02 | 2021-01-05 | Optum Technology, Inc. | Systems and methods for contextual ranking of search results |
GB201811003D0 (en) | 2018-07-04 | 2018-08-15 | Bp Plc | Multiple cooling circuit systems and methods for using them |
RU2720952C2 (en) | 2018-09-14 | 2020-05-15 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for generating digital content recommendation |
RU2714594C1 (en) | 2018-09-14 | 2020-02-18 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for determining parameter relevance for content items |
RU2720899C2 (en) | 2018-09-14 | 2020-05-14 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for determining user-specific content proportions for recommendation |
US11294974B1 (en) * | 2018-10-04 | 2022-04-05 | Apple Inc. | Golden embeddings |
RU2725659C2 (en) | 2018-10-08 | 2020-07-03 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for evaluating data on user-element interactions |
RU2731335C2 (en) | 2018-10-09 | 2020-09-01 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for generating recommendations of digital content |
US11003889B2 (en) | 2018-10-22 | 2021-05-11 | International Business Machines Corporation | Classifying digital documents in multi-document transactions based on signatory role analysis |
US11677703B2 (en) | 2019-08-15 | 2023-06-13 | Rovi Guides, Inc. | Systems and methods for automatically identifying spam in social media comments based on context |
US11258741B2 (en) * | 2019-08-15 | 2022-02-22 | Rovi Guides, Inc. | Systems and methods for automatically identifying spam in social media comments |
RU2757406C1 (en) | 2019-09-09 | 2021-10-15 | Общество С Ограниченной Ответственностью «Яндекс» | Method and system for providing a level of service when advertising content element |
KR102426056B1 (en) * | 2019-10-30 | 2022-07-27 | 네이버 주식회사 | Method, system, and computer program for dedecting multimodal abusing pattern to select document |
JP2021077256A (en) * | 2019-11-13 | 2021-05-20 | 株式会社Fronteo | Document processing device, document review system, document processing device control method, document review service providing method, and control program |
WO2022097197A1 (en) | 2020-11-04 | 2022-05-12 | データ・サイエンティスト株式会社 | Search needs evaluation program, search needs evaluation device, search needs evaluation method, evaluation program, evaluation device, and evaluation method |
CN112783837B (en) * | 2021-01-12 | 2024-01-30 | 北京首汽智行科技有限公司 | API document searching method |
US12126728B2 (en) * | 2021-06-15 | 2024-10-22 | Whitestar Communications, Inc. | Anti-replay protection based on hashing encrypted temporal key in a secure peer-to-peer data network |
CN114090935B (en) * | 2021-11-25 | 2024-10-29 | 马上消费金融股份有限公司 | Data acquisition method and device |
US11914906B2 (en) * | 2022-05-17 | 2024-02-27 | Kyocera Document Solutions Inc. | Pre-processing print jobs |
US12197483B1 (en) * | 2023-11-01 | 2025-01-14 | Varonis Systems, Inc. | Enterprise-level classification of data-items in an enterprise repository and prevention of leakage of personally identifiable information (PII) |
Family Cites Families (145)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5594897A (en) * | 1993-09-01 | 1997-01-14 | Gwg Associates | Method for retrieving high relevance, high quality objects from an overall source |
US5465353A (en) * | 1994-04-01 | 1995-11-07 | Ricoh Company, Ltd. | Image matching and retrieval by multi-access redundant hashing |
GB9408894D0 (en) * | 1994-05-05 | 1994-06-22 | Secr Defence | Electronic circuit |
US5758257A (en) * | 1994-11-29 | 1998-05-26 | Herz; Frederick | System and method for scheduling broadcast of and access to video programs and other data using customer profiles |
JP2914226B2 (en) | 1995-06-16 | 1999-06-28 | 日本電気株式会社 | Transformation encoding of digital signal enabling reversible transformation |
US5873076A (en) * | 1995-09-15 | 1999-02-16 | Infonautics Corporation | Architecture for processing search queries, retrieving documents identified thereby, and method for using same |
US5742816A (en) * | 1995-09-15 | 1998-04-21 | Infonautics Corporation | Method and apparatus for identifying textual documents and multi-mediafiles corresponding to a search topic |
AU1566597A (en) * | 1995-12-27 | 1997-08-11 | Gary B. Robinson | Automated collaborative filtering in world wide web advertising |
US6457004B1 (en) * | 1997-07-03 | 2002-09-24 | Hitachi, Ltd. | Document retrieval assisting method, system and service using closely displayed areas for titles and topics |
US6092091A (en) * | 1996-09-13 | 2000-07-18 | Kabushiki Kaisha Toshiba | Device and method for filtering information, device and method for monitoring updated document information and information storage medium used in same devices |
US6285999B1 (en) * | 1997-01-10 | 2001-09-04 | The Board Of Trustees Of The Leland Stanford Junior University | Method for node ranking in a linked database |
JPH10247201A (en) * | 1997-03-05 | 1998-09-14 | Nippon Telegr & Teleph Corp <Ntt> | Information guidance system with information evaluating value |
US7636732B1 (en) * | 1997-05-30 | 2009-12-22 | Sun Microsystems, Inc. | Adaptive meta-tagging of websites |
US5893111A (en) * | 1997-06-13 | 1999-04-06 | Sharon, Jr.; Paul A. | Ad taking pagination information system |
US6078916A (en) * | 1997-08-01 | 2000-06-20 | Culliss; Gary | Method for organizing information |
US6182068B1 (en) * | 1997-08-01 | 2001-01-30 | Ask Jeeves, Inc. | Personalized search methods |
US6014665A (en) * | 1997-08-01 | 2000-01-11 | Culliss; Gary | Method for organizing information |
US5956722A (en) * | 1997-09-23 | 1999-09-21 | At&T Corp. | Method for effective indexing of partially dynamic documents |
US6389436B1 (en) * | 1997-12-15 | 2002-05-14 | International Business Machines Corporation | Enhanced hypertext categorization using hyperlinks |
US6067565A (en) * | 1998-01-15 | 2000-05-23 | Microsoft Corporation | Technique for prefetching a web page of potential future interest in lieu of continuing a current information download |
US6182133B1 (en) * | 1998-02-06 | 2001-01-30 | Microsoft Corporation | Method and apparatus for display of information prefetching and cache status having variable visual indication based on a period of time since prefetching |
US6163778A (en) * | 1998-02-06 | 2000-12-19 | Sun Microsystems, Inc. | Probabilistic web link viability marker and web page ratings |
AU3292699A (en) * | 1998-02-13 | 1999-08-30 | Yahoo! Inc. | Search engine using sales and revenue to weight search results |
US6185558B1 (en) * | 1998-03-03 | 2001-02-06 | Amazon.Com, Inc. | Identifying the items most relevant to a current query based on items selected in connection with similar queries |
US6421675B1 (en) * | 1998-03-16 | 2002-07-16 | S. L. I. Systems, Inc. | Search engine |
US6457028B1 (en) * | 1998-03-18 | 2002-09-24 | Xerox Corporation | Method and apparatus for finding related collections of linked documents using co-citation analysis |
US6990437B1 (en) * | 1999-07-02 | 2006-01-24 | Abu El Ata Nabil A | Systems and method for determining performance metrics for constructing information systems |
US6638314B1 (en) * | 1998-06-26 | 2003-10-28 | Microsoft Corporation | Method of web crawling utilizing crawl numbers |
US6421375B1 (en) * | 1998-07-28 | 2002-07-16 | Conexant Systems, Inc | Method and apparatus for transmitting control signals in a data communication system having a fully digital communication channel |
US7765179B2 (en) * | 1998-12-01 | 2010-07-27 | Alcatel-Lucent Usa Inc. | Method and apparatus for resolving domain names of persistent web resources |
US6615242B1 (en) * | 1998-12-28 | 2003-09-02 | At&T Corp. | Automatic uniform resource locator-based message filter |
US6598054B2 (en) * | 1999-01-26 | 2003-07-22 | Xerox Corporation | System and method for clustering data objects in a collection |
JP3347088B2 (en) * | 1999-02-12 | 2002-11-20 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Related information search method and system |
US6510406B1 (en) * | 1999-03-23 | 2003-01-21 | Mathsoft, Inc. | Inverse inference engine for high performance web search |
US6907566B1 (en) * | 1999-04-02 | 2005-06-14 | Overture Services, Inc. | Method and system for optimum placement of advertisements on a webpage |
US7752251B1 (en) * | 2000-04-14 | 2010-07-06 | Brian Mark Shuster | Method, apparatus and system for hosting information exchange groups on a wide area network |
CA2372867A1 (en) * | 1999-05-07 | 2000-11-16 | Carlos Cardona | System and method for database retrieval, indexing and statistical analysis |
US6350271B1 (en) | 1999-05-17 | 2002-02-26 | Micrus Corporation | Clot retrieval device |
JP2000339316A (en) | 1999-05-25 | 2000-12-08 | Nippon Telegr & Teleph Corp <Ntt> | Method and device for collecting retrieval link type information and recording medium with its method stored therein |
US7110993B2 (en) * | 1999-05-28 | 2006-09-19 | Overture Services, Inc. | System and method for influencing a position on a search result list generated by a computer network search engine |
US6269361B1 (en) | 1999-05-28 | 2001-07-31 | Goto.Com | System and method for influencing a position on a search result list generated by a computer network search engine |
JP2001005705A (en) * | 1999-06-22 | 2001-01-12 | Hitachi Ltd | Document information management system |
US6665665B1 (en) * | 1999-07-30 | 2003-12-16 | Verizon Laboratories Inc. | Compressed document surrogates |
US6321228B1 (en) * | 1999-08-31 | 2001-11-20 | Powercast Media, Inc. | Internet search system for retrieving selected results from a previous search |
US6839680B1 (en) * | 1999-09-30 | 2005-01-04 | Fujitsu Limited | Internet profiling |
AU1797401A (en) * | 1999-11-22 | 2001-06-04 | Avenue, A, Inc. | Targeting electronic advertising placement in accordance with an analysis of user inclination and affinity |
US6751612B1 (en) * | 1999-11-29 | 2004-06-15 | Xerox Corporation | User query generate search results that rank set of servers where ranking is based on comparing content on each server with user query, frequency at which content on each server is altered using web crawler in a search engine |
EP1107128A1 (en) * | 1999-12-03 | 2001-06-13 | Hyundai Electronics Industries Co., Ltd. | Apparatus and method for checking the validity of links in a computer network |
AUPQ475799A0 (en) * | 1999-12-20 | 2000-01-20 | Youramigo Pty Ltd | An internet indexing system and method |
US8661111B1 (en) * | 2000-01-12 | 2014-02-25 | The Nielsen Company (Us), Llc | System and method for estimating prevalence of digital content on the world-wide-web |
US6546388B1 (en) * | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US6883135B1 (en) * | 2000-01-28 | 2005-04-19 | Microsoft Corporation | Proxy server using a statistical model |
EP3367268A1 (en) * | 2000-02-22 | 2018-08-29 | Nokia Technologies Oy | Spatially coding and displaying information |
US7567958B1 (en) * | 2000-04-04 | 2009-07-28 | Aol, Llc | Filtering system for providing personalized information in the absence of negative data |
US20010030773A1 (en) * | 2000-04-17 | 2001-10-18 | Satoshi Matsuura | Digital photograph system |
JP3562572B2 (en) * | 2000-05-02 | 2004-09-08 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Detect and track new items and new classes in database documents |
US6789076B1 (en) * | 2000-05-11 | 2004-09-07 | International Business Machines Corp. | System, method and program for augmenting information retrieval in a client/server network using client-side searching |
JP2001326635A (en) * | 2000-05-16 | 2001-11-22 | Matsushita Electric Ind Co Ltd | Charging system for the internet |
AU2001264928A1 (en) * | 2000-05-25 | 2001-12-03 | Kanisa Inc. | System and method for automatically classifying text |
JP2002007801A (en) | 2000-06-21 | 2002-01-11 | Nec Corp | On-line shopping system, method and server for providing credit information, and recording medium of program therefor |
US20020022999A1 (en) * | 2000-06-23 | 2002-02-21 | Shuster Brian Mark | Method and apparatus for providing audio advertisements in a computer network |
US7003513B2 (en) | 2000-07-04 | 2006-02-21 | International Business Machines Corporation | Method and system of weighted context feedback for result improvement in information retrieval |
JP2002024065A (en) | 2000-07-07 | 2002-01-25 | Ricoh Co Ltd | Document management system, document managing method and recording medium in which program to execute its method is recorded |
US7080073B1 (en) * | 2000-08-18 | 2006-07-18 | Firstrain, Inc. | Method and apparatus for focused crawling |
US7146416B1 (en) * | 2000-09-01 | 2006-12-05 | Yahoo! Inc. | Web site activity monitoring system with tracking by categories and terms |
NO313399B1 (en) * | 2000-09-14 | 2002-09-23 | Fast Search & Transfer Asa | Procedure for searching and analyzing information in computer networks |
AUPR033800A0 (en) * | 2000-09-25 | 2000-10-19 | Telstra R & D Management Pty Ltd | A document categorisation system |
US6684205B1 (en) * | 2000-10-18 | 2004-01-27 | International Business Machines Corporation | Clustering hypertext with applications to web searching |
JP3934325B2 (en) * | 2000-10-31 | 2007-06-20 | 株式会社日立製作所 | Document search method, document search apparatus, and storage medium for document search program |
FR2816734B1 (en) * | 2000-11-15 | 2003-03-14 | Linkkit | METHOD FOR SEARCHING, SELECTING AND MAPPING WEB PAGES |
US8862656B2 (en) * | 2000-11-21 | 2014-10-14 | Chironet, Llc | Performance outcomes benchmarking |
US7130889B2 (en) | 2000-11-29 | 2006-10-31 | Ncr Corporation | Method of printing information by a network kiosk |
US20020078045A1 (en) * | 2000-12-14 | 2002-06-20 | Rabindranath Dutta | System, method, and program for ranking search results using user category weighting |
JP2002183216A (en) * | 2000-12-18 | 2002-06-28 | Fuji Electric Co Ltd | Time series information storage / reproduction device |
US7356530B2 (en) * | 2001-01-10 | 2008-04-08 | Looksmart, Ltd. | Systems and methods of retrieving relevant information |
US7359944B2 (en) * | 2001-02-07 | 2008-04-15 | Lg Electronics Inc. | Method of providing digital electronic book |
JP2002245070A (en) * | 2001-02-20 | 2002-08-30 | Hitachi Ltd | Data display method and apparatus, and medium storing processing program therefor |
US8001118B2 (en) * | 2001-03-02 | 2011-08-16 | Google Inc. | Methods and apparatus for employing usage statistics in document retrieval |
US20030018659A1 (en) * | 2001-03-14 | 2003-01-23 | Lingomotors, Inc. | Category-based selections in an information access environment |
US20020188635A1 (en) * | 2001-03-20 | 2002-12-12 | Larson Stephen C. | System and method for incorporation of print-ready advertisement in digital newspaper editions |
US20020161838A1 (en) * | 2001-04-27 | 2002-10-31 | Pickover Cilfford A. | Method and apparatus for targeting information |
US7194483B1 (en) * | 2001-05-07 | 2007-03-20 | Intelligenxia, Inc. | Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information |
US7299219B2 (en) * | 2001-05-08 | 2007-11-20 | The Johns Hopkins University | High refresh-rate retrieval of freshly published content using distributed crawling |
JP4489994B2 (en) * | 2001-05-11 | 2010-06-23 | 富士通株式会社 | Topic extraction apparatus, method, program, and recording medium for recording the program |
JP4025517B2 (en) * | 2001-05-31 | 2007-12-19 | 株式会社日立製作所 | Document search system and server |
US7035772B2 (en) * | 2001-05-31 | 2006-04-25 | International Business Machines Corporation | Method and apparatus for calculating data integrity metrics for web server activity log analysis |
US7058624B2 (en) * | 2001-06-20 | 2006-06-06 | Hewlett-Packard Development Company, L.P. | System and method for optimizing search results |
US7299270B2 (en) * | 2001-07-10 | 2007-11-20 | Lycos, Inc. | Inferring relations between internet objects that are not connected directly |
US7146409B1 (en) * | 2001-07-24 | 2006-12-05 | Brightplanet Corporation | System and method for efficient control and capture of dynamic database content |
JP2003046764A (en) | 2001-08-03 | 2003-02-14 | Matsushita Graphic Communication Systems Inc | Page space transmission system and method therefor |
US7076483B2 (en) * | 2001-08-27 | 2006-07-11 | Xyleme Sa | Ranking nodes in a graph |
US20040205454A1 (en) * | 2001-08-28 | 2004-10-14 | Simon Gansky | System, method and computer program product for creating a description for a document of a remote network data source for later identification of the document and identifying the document utilizing a description |
US20030046098A1 (en) * | 2001-09-06 | 2003-03-06 | Seong-Gon Kim | Apparatus and method that modifies the ranking of the search results by the number of votes cast by end-users and advertisers |
JP4283466B2 (en) * | 2001-10-12 | 2009-06-24 | 富士通株式会社 | Document arrangement method based on link relationship |
JP2003122699A (en) | 2001-10-15 | 2003-04-25 | Toshiba Corp | Information processing system and its peripheral equipment |
US6944609B2 (en) * | 2001-10-18 | 2005-09-13 | Lycos, Inc. | Search results using editor feedback |
US20030101126A1 (en) * | 2001-11-13 | 2003-05-29 | Cheung Dominic Dough-Ming | Position bidding in a pay for placement database search system |
US20030101166A1 (en) * | 2001-11-26 | 2003-05-29 | Fujitsu Limited | Information analyzing method and system |
US6763362B2 (en) * | 2001-11-30 | 2004-07-13 | Micron Technology, Inc. | Method and system for updating a search engine |
US7249034B2 (en) * | 2002-01-14 | 2007-07-24 | International Business Machines Corporation | System and method for publishing a person's affinities |
US7565367B2 (en) * | 2002-01-15 | 2009-07-21 | Iac Search & Media, Inc. | Enhanced popularity ranking |
US20110066510A1 (en) * | 2002-01-16 | 2011-03-17 | Galip Talegon | Methods for valuing and placing advertising |
US20030135460A1 (en) * | 2002-01-16 | 2003-07-17 | Galip Talegon | Methods for valuing and placing advertising |
JP4003468B2 (en) * | 2002-02-05 | 2007-11-07 | 株式会社日立製作所 | Method and apparatus for retrieving similar data by relevance feedback |
US20040205569A1 (en) * | 2002-02-06 | 2004-10-14 | Mccarty Jon S. | Method and system to manage outdated web page links in a computing system |
US7343365B2 (en) * | 2002-02-20 | 2008-03-11 | Microsoft Corporation | Computer system architecture for automatic context associations |
US7188107B2 (en) * | 2002-03-06 | 2007-03-06 | Infoglide Software Corporation | System and method for classification of documents |
US7203909B1 (en) * | 2002-04-04 | 2007-04-10 | Microsoft Corporation | System and methods for constructing personalized context-sensitive portal pages or views by analyzing patterns of users' information access activities |
US7085832B2 (en) * | 2002-04-30 | 2006-08-01 | International Business Machines Corporation | Method and apparatus for enabling an internet web server to keep an accurate count of page hits |
US6993586B2 (en) | 2002-05-09 | 2006-01-31 | Microsoft Corporation | User intention modeling for web navigation |
US7599911B2 (en) * | 2002-08-05 | 2009-10-06 | Yahoo! Inc. | Method and apparatus for search ranking using human input and automated ranking |
US8375286B2 (en) * | 2002-09-19 | 2013-02-12 | Ancestry.com Operations, Inc. | Systems and methods for displaying statistical information on a web page |
US20040059625A1 (en) * | 2002-09-20 | 2004-03-25 | Ncr Corporation | Method for providing feedback to advertising on interactive channels |
US7568148B1 (en) * | 2002-09-20 | 2009-07-28 | Google Inc. | Methods and apparatus for clustering news content |
US7158983B2 (en) * | 2002-09-23 | 2007-01-02 | Battelle Memorial Institute | Text analysis technique |
US20040064447A1 (en) * | 2002-09-27 | 2004-04-01 | Simske Steven J. | System and method for management of synonymic searching |
US6886010B2 (en) * | 2002-09-30 | 2005-04-26 | The United States Of America As Represented By The Secretary Of The Navy | Method for data and text mining and literature-based discovery |
US7130844B2 (en) * | 2002-10-31 | 2006-10-31 | International Business Machines Corporation | System and method for examining, calculating the age of an document collection as a measure of time since creation, visualizing, identifying selectively reference those document collections representing current activity |
US20040098405A1 (en) * | 2002-11-16 | 2004-05-20 | Michael Zrubek | System and Method for Automated Link Analysis |
US7792827B2 (en) * | 2002-12-31 | 2010-09-07 | International Business Machines Corporation | Temporal link analysis of linked entities |
US7016889B2 (en) * | 2003-01-30 | 2006-03-21 | Hewlett-Packard Development Company, Lp. | System and method for identifying useful content in a knowledge repository |
US20040193698A1 (en) * | 2003-03-24 | 2004-09-30 | Sadasivuni Lakshminarayana | Method for finding convergence of ranking of web page |
US20040225644A1 (en) * | 2003-05-09 | 2004-11-11 | International Business Machines Corporation | Method and apparatus for search engine World Wide Web crawling |
US7283997B1 (en) * | 2003-05-14 | 2007-10-16 | Apple Inc. | System and method for ranking the relevance of documents retrieved by a query |
US20040249871A1 (en) * | 2003-05-22 | 2004-12-09 | Mehdi Bazoon | System and method for automatically removing documents from a knowledge repository |
US7146361B2 (en) * | 2003-05-30 | 2006-12-05 | International Business Machines Corporation | System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a Weighted AND (WAND) |
US7685117B2 (en) * | 2003-06-05 | 2010-03-23 | Hayley Logistics Llc | Method for implementing search engine |
US7308643B1 (en) * | 2003-07-03 | 2007-12-11 | Google Inc. | Anchor tag indexing in a web crawler system |
US20050060290A1 (en) * | 2003-09-15 | 2005-03-17 | International Business Machines Corporation | Automatic query routing and rank configuration for search queries in an information retrieval system |
US7739281B2 (en) * | 2003-09-16 | 2010-06-15 | Microsoft Corporation | Systems and methods for ranking documents based upon structurally interrelated information |
US7685296B2 (en) * | 2003-09-25 | 2010-03-23 | Microsoft Corporation | Systems and methods for client-based web crawling |
US7346839B2 (en) | 2003-09-30 | 2008-03-18 | Google Inc. | Information retrieval based on historical data |
US7797316B2 (en) * | 2003-09-30 | 2010-09-14 | Google Inc. | Systems and methods for determining document freshness |
US20050102282A1 (en) | 2003-11-07 | 2005-05-12 | Greg Linden | Method for personalized search |
US8631001B2 (en) | 2004-03-31 | 2014-01-14 | Google Inc. | Systems and methods for weighting a search query result |
US20050234877A1 (en) | 2004-04-08 | 2005-10-20 | Yu Philip S | System and method for searching using a temporal dimension |
US7519586B2 (en) * | 2004-04-30 | 2009-04-14 | International Business Machines Corporation | Method of searching |
US20050256848A1 (en) | 2004-05-13 | 2005-11-17 | International Business Machines Corporation | System and method for user rank search |
US7562068B2 (en) | 2004-06-30 | 2009-07-14 | Microsoft Corporation | System and method for ranking search results based on tracked user preferences |
US20060047643A1 (en) | 2004-08-31 | 2006-03-02 | Chirag Chaman | Method and system for a personalized search engine |
US20060195443A1 (en) | 2005-02-11 | 2006-08-31 | Franklin Gary L | Information prioritisation system and method |
US20060248055A1 (en) | 2005-04-28 | 2006-11-02 | Microsoft Corporation | Analysis and comparison of portfolios by classification |
US8438142B2 (en) | 2005-05-04 | 2013-05-07 | Google Inc. | Suggesting and refining user input based on original user input |
US7853485B2 (en) * | 2005-11-22 | 2010-12-14 | Nec Laboratories America, Inc. | Methods and systems for utilizing content, dynamic patterns, and/or relational information for data analysis |
US9177124B2 (en) * | 2006-03-01 | 2015-11-03 | Oracle International Corporation | Flexible authentication framework |
-
2003
- 2003-12-31 US US10/748,664 patent/US7346839B2/en active Active
-
2004
- 2004-09-15 DE DE200420021886 patent/DE202004021886U1/en not_active Expired - Lifetime
- 2004-09-15 WO PCT/US2004/030000 patent/WO2005033978A1/en active Application Filing
- 2004-09-15 CA CA2757550A patent/CA2757550A1/en not_active Abandoned
- 2004-09-15 EP EP11186365.0A patent/EP2416263A3/en not_active Withdrawn
- 2004-09-15 JP JP2006533916A patent/JP2007507798A/en not_active Withdrawn
- 2004-09-15 CN CN200480033254.8A patent/CN1879107B/en not_active Expired - Lifetime
- 2004-09-15 AU AU2004277678A patent/AU2004277678C1/en not_active Expired
- 2004-09-15 EP EP11186372.6A patent/EP2416265A3/en not_active Withdrawn
- 2004-09-15 DE DE200420021885 patent/DE202004021885U1/en not_active Expired - Lifetime
- 2004-09-15 EP EP04784004A patent/EP1668551A1/en not_active Ceased
- 2004-09-15 EP EP11186370.0A patent/EP2416264A3/en not_active Withdrawn
- 2004-09-15 EP EP11186363.5A patent/EP2416262A3/en not_active Withdrawn
- 2004-09-15 CA CA2540573A patent/CA2540573C/en not_active Expired - Lifetime
-
2006
- 2006-11-20 US US11/561,625 patent/US7840572B2/en not_active Expired - Fee Related
- 2006-11-21 US US11/562,285 patent/US8112426B2/en not_active Expired - Lifetime
- 2006-11-22 US US11/562,617 patent/US8051071B2/en active Active
- 2006-11-30 US US11/565,004 patent/US20070094255A1/en not_active Abandoned
- 2006-11-30 US US11/565,026 patent/US8316029B2/en not_active Expired - Fee Related
-
2007
- 2007-01-09 JP JP2007001794A patent/JP4603556B2/en not_active Expired - Fee Related
-
2010
- 2010-10-01 US US12/896,744 patent/US8407231B2/en not_active Expired - Lifetime
- 2010-10-12 US US12/902,966 patent/US8521749B2/en not_active Expired - Lifetime
-
2011
- 2011-02-10 JP JP2011027886A patent/JP5312498B2/en not_active Expired - Lifetime
- 2011-06-30 US US13/174,243 patent/US8234273B2/en not_active Expired - Fee Related
- 2011-06-30 US US13/174,304 patent/US8527524B2/en not_active Expired - Lifetime
- 2011-09-14 US US13/232,599 patent/US8549014B2/en not_active Expired - Lifetime
- 2011-09-26 US US13/244,841 patent/US8224827B2/en not_active Expired - Fee Related
- 2011-09-26 US US13/244,867 patent/US8266143B2/en not_active Expired - Fee Related
- 2011-09-26 US US13/244,863 patent/US8185522B2/en not_active Expired - Fee Related
- 2011-09-26 US US13/244,848 patent/US8239378B2/en not_active Expired - Fee Related
- 2011-09-26 US US13/244,853 patent/US8244723B2/en not_active Expired - Fee Related
- 2011-09-30 US US13/250,703 patent/US8577901B2/en not_active Expired - Lifetime
-
2012
- 2012-04-24 US US13/454,424 patent/US8639690B2/en not_active Expired - Lifetime
- 2012-09-14 US US13/615,730 patent/US9767478B2/en active Active
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1879107B (en) | Information retrieval based on historical data | |
US7409402B1 (en) | Systems and methods for presenting advertising content based on publisher-selected labels | |
EP1775665A2 (en) | Document scoring based on link-based criteria |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: American California Patentee after: Google Inc. Address before: American California Patentee before: GOOGLE Inc. |
|
CX01 | Expiry of patent term | ||
CX01 | Expiry of patent term |
Granted publication date: 20141015 |