US8812493B2 - Search results ranking using editing distance and document information - Google Patents
Search results ranking using editing distance and document information Download PDFInfo
- Publication number
- US8812493B2 US8812493B2 US12/101,951 US10195108A US8812493B2 US 8812493 B2 US8812493 B2 US 8812493B2 US 10195108 A US10195108 A US 10195108A US 8812493 B2 US8812493 B2 US 8812493B2
- Authority
- US
- United States
- Prior art keywords
- string
- document
- term
- edit distance
- terms
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
- 238000013528 artificial neural network Methods 0.000 claims abstract description 22
- 150000001875 compounds Chemical class 0.000 claims abstract description 15
- 238000001914 filtration Methods 0.000 claims abstract description 9
- 238000000034 method Methods 0.000 claims description 53
- 238000003780 insertion Methods 0.000 claims description 39
- 238000012217 deletion Methods 0.000 claims description 38
- 230000037430 deletion Effects 0.000 claims description 36
- 230000037431 insertion Effects 0.000 claims description 36
- 238000012545 processing Methods 0.000 claims description 30
- 230000008569 process Effects 0.000 claims description 21
- 230000006870 function Effects 0.000 claims description 15
- 238000004891 communication Methods 0.000 description 13
- 238000010586 diagram Methods 0.000 description 8
- 239000011159 matrix material Substances 0.000 description 8
- 230000003287 optical effect Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 5
- 230000006855 networking Effects 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
Definitions
- a user can enter a query by selecting the topmost relevant documents out of an indexed collection of URLs (universal resource locators) that match the query.
- the search engine utilizes one or more methods (e.g., an inverted index data structure) that map keywords to documents.
- a first step performed by the engine can be to identify the set of candidate documents that contain the keywords specified by the user query. These keywords can be located in the document body or the metadata, or additional metadata about this document that is actually stored in other documents or datastores (such as anchor text).
- the search engine performs a second step of ranking of the candidate documents with respect to relevance.
- the search engine utilizes a ranking function to predict the degree of relevance of a document to a particular query.
- the ranking function takes multiple features from the document as inputs and computes a number that allows the search engine to sort the documents by predicted relevance.
- the quality of the ranking function with respect as to how accurately the function predicts relevance of a document is ultimately determined by the user satisfaction with the search results or how many times on average the user finds the answer to the question posed.
- the overall user satisfaction with the system can be approximated by a single number (or metric), because the number can be optimized by varying the ranking function.
- the metrics are computed over a representative set of queries that are selected up front by random sampling of the query logs, and involve assigning relevance labels to each result returned by the engine for each of the evaluation queries.
- these processes for document ranking and relevance are still inefficient in providing the desired results.
- the architecture provides a mechanism for extracting document information from documents received as search results based on a query string and computing an edit distance between a data string and the query string.
- the data string can be a short and accurate description of the document obtained from document information such as TAUC (title, anchor text, URL (uniform resource locator), and clicks), for example.
- the edit distance is employed in determining relevance of the document as part of result ranking.
- the mechanism improves the relevance of search results ranking be employing a set of proximity-related features to detect near-matches of a whole query or part of the query.
- the edit distance is processed to evaluate how close the query string is to a given data stream that includes the document information.
- the architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results.
- a neural network e.g., 2-layer
- FIG. 1 illustrates a computer-implemented relevance system.
- FIG. 2 illustrates a flow chart of an exemplary the matching algorithm for computing edit distance.
- FIG. 3 illustrates processing and generating edit distance values based on a query string and data string using the modified edit distance and matching algorithm.
- FIG. 4 illustrates another example of processing and generating edit distance values based on a query string and data string using the modified edit distance and matching algorithm.
- FIG. 5 illustrates a computer-implemented relevance system that employs a neural network to assist in generating a relevance score for the document.
- FIG. 6 illustrates the types of data that can be employed in the document information for determining the edit distance between the query string and the data string.
- FIG. 7 illustrates an index-time processing data flow.
- FIG. 8 illustrates a block diagram showing inputs to the neural network from the index process of FIG. 7 for result ranking.
- FIG. 9 illustrates an exemplary system implementation of a neural network, edit distance inputs and raw feature inputs for computing generating search results.
- FIG. 10 illustrates a method of determining document relevance of a document result set.
- FIG. 11 illustrates a method of computing relevance of a document.
- FIG. 12 illustrates a block diagram of a computing system operable to execute edit distance processing for search result ranking using TAUC features in accordance with the disclosed architecture.
- the disclosed architecture improves the relevance of search results ranking by implementing a set of proximity-related features to detect near-matches of a whole query or matches with accurate metadata about the document, such as titles, anchors, URLs, or clicks. For example, consider a query “company store”, a document title “company store online” of a first document and a document title “new NEC LCD monitors in company store” of a second document. Assuming other properties is same for both the first and second documents, the architecture assigns a score for a document based on how much editing effort is devoted to make a chosen stream match the query. In this example, the document title is selected for evaluation.
- first document requires only one delete operation (delete the term “online”) to make a full match, while the title of second document requires five deletes (delete the terms “new”, “NEC”, “LCD”, “monitors” and “in”). Thus, the first document is computed to be more relevant.
- the title is one element of TAUC (title, anchor, URL, and clicks) document information for which processing can be applied to some streams of data (e.g., a URL) so that query terms can be found from compound terms.
- TAUC title, anchor, URL, and clicks
- a URL some streams of data
- FIG. 1 illustrates a computer-implemented relevance system 100 .
- the system 100 includes a processing component 102 for extracting document information 104 from a document 106 received as search results 108 based on a query string 110 .
- the system 100 can also include a proximity component 112 for computing the edit distance 114 between a data string 116 derived from the document information 104 and the query string 110 .
- the edit distance 114 is employed in determining relevance of the document 106 as part of the search results 108 .
- the document information 104 employed to generate the data string 116 can include title information (or characters), link information (e.g., URL characters), click stream information, and/or anchor text (or characters), for example.
- the processing component 102 splits compound terms of the document information 104 at index time to compute the edit distance 114 .
- the processing component 102 also filters document information such as anchor text at index time to compute a top-ranked set of anchor text.
- the computing of the edit distance 114 is based on insertion and deletion of terms to increase proximity (bring closer) between the data string 116 and the query string 110 .
- the computing of the edit distance 114 can also be based on costs associated with insertion and deletion of terms to increase the proximity (bring closer) between the data string 116 and the query string 110 .
- This term processing can be performed according to four operations: insert a non-query word into the query string 110 ; insert a query term into the query string 110 ; delete a TAUC term from the query string 110 ; and/or, delete a non-TAUC term from the query string 110 .
- the edit distance 114 is based on the insertion and deletion operations, but not substitution.
- a word can be inserted into the query string 110 , which exists in the original query string 110 , then the cost is defined as one; otherwise, the cost is defined as w 1 ( ⁇ 1).
- w 1 is a weighting parameter that is tuned. For example, if the query string 110 is AB, then the cost of generating the data string of ABC is higher than that of the data string ABA.
- TAUC a weighting parameter that is tuned.
- cost there can be two types of cost for deletion. Again, consider a scenario of generating the data string 116 from the query string 110 . When deleting a term in the query string 110 , which term exists in the original data string 116 , then the cost is defined as one; otherwise, the cost is defined as w 2 ( ⁇ 1).
- Another type of cost is a position cost. If a deletion or insertion occurs at the first position of the data string 116 , then there is an additional cost (+w 3 ). The intuition is that a matching at the beginning of the two strings (query string 110 and data string 116 ) is given greater importance than matches later in the strings.
- the query string 110 is “cnn”
- FIG. 2 illustrates a flow chart of an exemplary the modified matching algorithm 200 for computing edit distance. While, for purposes of simplicity of explanation, the one or more methodologies shown herein, for example, in the form of a flow chart or flow diagram, are shown and described as a series of acts, it is to be understood and appreciated that the methodologies are not limited by the order of acts, as some acts may, in accordance therewith, occur in a different order and/or concurrently with other acts from that shown and described herein. For example, those skilled in the art will understand and appreciate that a methodology could alternatively be represented as a series of interrelated states or events, such as in a state diagram. Moreover, not all acts illustrated in a methodology may be required for a novel implementation.
- elements of the query string and the data (or target) string are enumerated. This is accomplished by setting n to be the length of the query string (where each term in query string is s[i]), and setting m to be the length of the target (or data) string (where each term in target string is denoted t[j]).
- a matrix is constructed that contains 0 . . . m rows and 0 . . . n columns (where each term in the matrix is denoted as d[j,i]).
- the first row is initialized with a value that depends on the different cost of deletion and the first column is initialized with a value that depends on the different cost of insertion.
- each character of the query string is examined (i from 1 to n).
- each character of the target data string is examined (j from 1 to m).
- d[j,i] contains the edit distance between strings s[0 . . . i] and t[0 . . . j].
- d[0,0] 0 by definition (no edits needed to make an empty string equal to empty string).
- d[0, y] d[0,y ⁇ 1]+(w 2 or w 4 ). If it is known how many edits are used to make the string d[0,y ⁇ 1], then d[0,y] can be calculated as d[0, y ⁇ 1]+cost of deleting current character from the target string, which cost can be w 2 or w 4 . The cost w 2 is used if the current character is present in both s[0 . .
- d[x, 0] d[x ⁇ 1,0]+(w 1 or w 3 ). If it is known how many edits are used to make the string d[x ⁇ 1,0] then d[x,0] can be calculated as d[x ⁇ 1,0]+cost of insertion of the current character from s to t, which cost can be w 1 or w 3 . The cost w 1 is used if the current character is present in both s[0 . . . n], t[0 . . . m]; and w 3 , otherwise.
- FIG. 3 illustrates processing and generating edit distance values based on a query string and data string using the modified edit distance and matching algorithm.
- the process involves one or more of left-right, top-down, and diagonal computations.
- a query string of terms “A B C” is processed against with a target data string of terms “C B A X” (where X denotes a term not in the query string).
- the process for computing an edit distance can be performed in different ways; however, the specific details for performing a modified version of an edit distance is different as computed according to the disclosed architecture.
- the query string 302 is placed along the horizontal axis and the target data string 304 is along the vertical axis of the matrix 300 .
- the description will use the matrix 300 denoted with four columns (0-3) and five rows (0-4).
- the intersecting cell d[0,0] receives “0” since the compare of the empty cell of the query string ABC to the empty cell of the target data string CBAX does not cause insertion or deletion of a term to make the query string the same as the target data string.
- the “terms” are the same so the edit distance is zero.
- the empty cell of the query string row is compared to the first term C of the target data string 304 .
- One deletion is used to make the strings the same, with an edit distance of “1” in d[1,0].
- the compare is made between the A term of the query string 302 and the C term of the target data string 304 .
- a deletion and insertion is used to make the strings alike, thus, a value of “2” is inserted into cell d[1,1]. Skipping to the last cell d[1,3], the matching process for matching ABC to C results in using two deletions for an edit distance of “2” in the cell d[1,3].
- matching terms ABC to terms CBAX results in an edit distance of “8” in cell d[4,3] using insertion/deletion in the first term C of the target string for a value of “2”, a value of “0” for the match between the B terms, an insertion/deletion for the match of the third terms C and A for a value of “2”, an insertion of the term X for a value of “1” and a value of “3” for position cost, resulting in a final edit distance value of “8” in cell d[4,3].
- FIG. 4 illustrates another example of processing and generating edit distance values based on a query string and target data string using the modified edit distance and matching algorithm.
- working row 0 from left to right matching term A of the query string 402 to the empty cell before the target string 404 results in one insertion in the target string 404 of the term A for a value of “1” cell d[0,1].
- Matching terms AB of the query string 402 to the empty cell before the target string 404 results in two insertions in the target string 404 of the terms AB for a value of “2” cell d[0,2], and matching terms ABC of the query string 402 to the empty cell before the target string 404 results in the two insertions in the target string 404 of the terms AB value plus value w 4 26 for the term C for a value of “28” in cell d[0,3], since the term C is not in both strings.
- Matching terms AB of the query string 402 to the term A of the target string 404 results in one insertion in the target string 404 for the term B for a minimum value of “1” cell d[1,2].
- matching term A of the query string 402 to the terms AB of the target string 404 results in a deletion in the target string 404 for a value of “1” in cell d[2,1].
- FIG. 5 illustrates a computer-implemented relevance system 500 that employs a neural network 502 to assist in generating a relevance score 504 for the document 106 .
- the system 500 includes the processing component 102 for extracting document information 104 from the document 106 received as the search results 108 based on the query string 110 , and the proximity component 112 for computing the edit distance 114 between the data string 116 derived from the document information 104 and the query string 110 .
- the edit distance 114 is employed in determining relevance of the document 106 as part of the search results 108 .
- the neural network 502 can be employed to receive the document information 104 as an input for computing a relevance score for the document 106 . Based solely or in part on the relevance scores for some or all of the search results 108 , the documents in the search results 108 can be ranked.
- the system 500 employs the neural network 502 and codebase to generate the relevance score for ranking of the associated document in the search results 108 .
- anchor text There can be multiple instances of anchor text for a document, as well as URLs and clicks (where a click is a previously executed query for which this document was clicked on). The idea is that this document is more relevant for similar queries.
- the N anchor texts having the highest frequencies are selected.
- the ED score is calculated for each selected anchor.
- TAUC(Anchor) is used as a neural network input after applying a transform function.
- URL strings are split into parts using a set of characters as separators. Then terms are found in each part from a dictionary of title and anchor terms. Each occurrence of a term from dictionary is stored in an index with the position measured in characters from the beginning of the URL string.
- the result of ED processing is a neural network input, after application of a transform function.
- Another property that can be processed is the number of “clicks” the user enters for a given document content. Each time a user clicks on the document, a stream is entered into a database and associated with the document. This process can also be applied to stream data in the document information text such as short streams of data.
- the index-time URL processing algorithm splits the entire URL into parts using a set of characters as separators.
- the split function also sets urlpart.startpos to a position of part in the source string.
- the split function performs filtering of insignificant parts of the URL.
- query-time processing before ED at query time the occurrences of the query terms are read, a string of query terms constructed in the order of appearance in the source URL string, and space between the terms filled in with “non-query” word marks. For example, consider a query string of “company policy” and a resulting string of “company” “non-query term” “non-query term”.
- a parts_separator, query term positions, and stream length are determined to know how many parts are in the original URL string and what part contains a given query. Each part without terms is deemed to contain a “non-query term”. If a part does not start with a query term, a “non-query term” is inserted before the term. All spaces between query terms are filled with “non-query terms”.
- FIG. 6 illustrates the types of data that can be employed in the document information 104 for determining the edit distance between the query string and the data string.
- the document information 104 can include TAUC data 602 , such as title text 604 , anchor text 606 , URL 608 text or characters, and click information 610 , for example, for processing by the processing component 102 and generation of the data (or target) string 116 .
- the document information 104 can also include click information 610 related to the number of times a user clicks on document content, the type of content the user selects (via the click), the number of clicks on the content, the document in general, etc.
- FIG. 7 illustrates an index-time processing data flow 700 .
- document information in the form of the title 604 , document anchors 606 , click information 610 , etc. are received based on document analysis and extraction.
- the title 604 is processed through a term-splitting algorithm 704 and then to a dictionary 706 .
- the dictionary 706 is a temporary storage of different terms found in the title 604 , anchors 606 , click information 610 , etc.
- the dictionary 706 is used to split the URL 608 via a URL splitting algorithm 708 .
- the output of the URL splitting algorithm 708 is sent to an indexing process 710 for relevance and ranking processing.
- the document anchors 606 can also be processed through a filter 712 for the top N anchors.
- the click information 610 can be processed directly via the indexing process 710 .
- Other document information can be processed accordingly (e.g., term splitting, filtering, etc.).
- FIG. 8 illustrates a block diagram 800 showing inputs to the neural network from the index process 710 of FIG. 7 for result ranking.
- the indexing process 710 can be used for computing a URL edit distance (ED) 802 relative to the query string 110 , a top-N-anchors ED 804 relative to the query string 110 , a title ED 806 relative to the query string 110 , a click ED 808 relative to the query string 110 , as well as other features 810 not related to edit distance, some or all of which (URL ED 802 , top-N-anchors ED 804 , title ED 806 , click ED 808 , and other features 810 ) can be employed as inputs to the neural network 502 , ultimately to find the relevance score for the associated document, and then ranking of the document among other document search results.
- the neural network 502 can be a 2-layer model that receives at least the TAUC features as raw input features that contribute to identifying relevance of the document. The neural network determines how
- neural network 502 is just one example of mathematical or computational models that can be employed for the relevance and ranking processing.
- Other forms of statistical regression can be employed such as naive Bayes, Bayesian networks, decision trees, fuzzy logic models, and other statistical classification models representing different patterns of independence can be employed, where classification is inclusive of methods used to assign rank and/or priority.
- FIG. 9 illustrates an exemplary system 900 implementation of the neural network 502 , edit distance inputs and raw feature inputs for computing generating search results.
- the set of raw ranking features 810 on the input(s) of the neural network 502 can include a BM25 function 902 (e.g., BM25F), click distance 904 , URL depth 906 , file types 908 , and language match 910 .
- the BM25 components can include body, title, author, anchor text, URL display name, and extracted title, for example.
- FIG. 10 illustrates a method of determining relevance.
- a query string is received as part of a search process.
- document information is extracted from a document returned during the search process.
- a data string is generated from the document information.
- the edit distance is computed between the data string and the query string.
- a relevance score is calculated based on the edit distance.
- Other aspects of the method can include employing term insertion as part of computing the edit distance and assessing an insertion cost for insertion of a term in the query string to generate the data string, the cost represented as a weighting parameter.
- the method can further comprise employing term deletion as part of computing the edit distance and assessing a deletion cost for deletion of a term in the query string to generate the data string, the cost represented as a weighting parameter.
- a position cost can be computed as part of computing the edit distance, the position cost associated with term insertion and/or term deletion of a term position in the data string. Additionally, a matching process is performed between characters of the data string and characters of the query string to compute an overall cost of computing the edit distance.
- the splitting compound terms of a URL of the data string can occur at index time.
- the method can further comprise the filtering of anchor text of the data string to find a top-ranked set of anchor text based on frequency of occurrence in the document and computing an edit distance score for anchor text in the set.
- the edit distance score derived from computing the edit distance, can be input into a two-layer neural network after application of a transform function, the score generated based on calculating the edit distance associated with at least one of title information, anchor information, click information, or URL information.
- FIG. 11 illustrates a method of computing relevance of a document.
- a query string is processed as part of a search process to return a result set of documents.
- a data string is generated based on the document information extracted from a document of the result set, the document information includes one or more of title information, anchor text information, click information, and URL information from the document.
- the edit distance is computed between the data string and the query string based on term insertion, term deletion, and term position.
- a relevance score is calculated based on the edit distance, the relevance score used to rank the document in the result set.
- the method can further comprise computing a cost associated with each of the term insertion, term deletion and term position, and factoring the cost into computation of the relevance score, and splitting compound terms of the URL information at index time and filtering the anchor text information at index time to find a top-ranked set of anchor text based on frequency of occurrence of the anchor text in the document.
- the reading of occurrences of the terms of the query string can be performed to construct a string of query terms in order of appearance in a source URL string and filling space between the terms with word marks.
- a component can be, but is not limited to being, a process running on a processor, a processor, a hard disk drive, multiple storage drives (of optical and/or magnetic storage medium), an object, an executable, a thread of execution, a program, and/or a computer.
- a component can be, but is not limited to being, a process running on a processor, a processor, a hard disk drive, multiple storage drives (of optical and/or magnetic storage medium), an object, an executable, a thread of execution, a program, and/or a computer.
- an application running on a server and the server can be a component.
- One or more components can reside within a process and/or thread of execution, and a component can be localized on one computer and/or distributed between two or more computers.
- FIG. 12 there is illustrated a block diagram of a computing system 1200 operable to execute edit distance processing for search result ranking using TAUC features in accordance with the disclosed architecture.
- FIG. 12 and the following discussion are intended to provide a brief, general description of a suitable computing system 1200 in which the various aspects can be implemented. While the description above is in the general context of computer-executable instructions that may run on one or more computers, those skilled in the art will recognize that a novel embodiment also can be implemented in combination with other program modules and/or as a combination of hardware and software.
- program modules include routines, programs, components, data structures, etc., that perform particular tasks or implement particular abstract data types.
- inventive methods can be practiced with other computer system configurations, including single-processor or multiprocessor computer systems, minicomputers, mainframe computers, as well as personal computers, hand-held computing devices, microprocessor-based or programmable consumer electronics, and the like, each of which can be operatively coupled to one or more associated devices.
- the illustrated aspects can also be practiced in distributed computing environments where certain tasks are performed by remote processing devices that are linked through a communications network.
- program modules can be located in both local and remote memory storage devices.
- Computer-readable media can be any available media that can be accessed by the computer and includes volatile and non-volatile media, removable and non-removable media.
- Computer-readable media can comprise computer storage media and communication media.
- Computer storage media includes volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable instructions, data structures, program modules or other data.
- Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital video disk (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by the computer.
- the exemplary computing system 1200 for implementing various aspects includes a computer 1202 having a processing unit 1204 , a system memory 1206 and a system bus 1208 .
- the system bus 1208 provides an interface for system components including, but not limited to, the system memory 1206 to the processing unit 1204 .
- the processing unit 1204 can be any of various commercially available processors. Dual microprocessors and other multi-processor architectures may also be employed as the processing unit 1204 .
- the system bus 1208 can be any of several types of bus structure that may further interconnect to a memory bus (with or without a memory controller), a peripheral bus, and a local bus using any of a variety of commercially available bus architectures.
- the system memory 1206 can include non-volatile memory (NON-VOL) 1210 and/or volatile memory 1212 (e.g., random access memory (RAM)).
- NON-VOL non-volatile memory
- volatile memory 1212 e.g., random access memory (RAM)
- a basic input/output system (BIOS) can be stored in the non-volatile memory 1210 (e.g., ROM, EPROM, EEPROM, etc.), which BIOS are the basic routines that help to transfer information between elements within the computer 1202 , such as during start-up.
- the volatile memory 1212 can also include a high-speed RAM such as static RAM for caching data.
- the computer 1202 further includes an internal hard disk drive (HDD) 1214 (e.g., EIDE, SATA), which internal HDD 1214 may also be configured for external use in a suitable chassis, a magnetic floppy disk drive (FDD) 1216 , (e.g., to read from or write to a removable diskette 1218 ) and an optical disk drive 1220 , (e.g., reading a CD-ROM disk 1222 or, to read from or write to other high capacity optical media such as a DVD).
- the HDD 1214 , FDD 1216 and optical disk drive 1220 can be connected to the system bus 1208 by a HDD interface 1224 , an FDD interface 1226 and an optical drive interface 1228 , respectively.
- the HDD interface 1224 for external drive implementations can include at least one or both of Universal Serial Bus (USB) and IEEE 1394 interface technologies.
- the drives and associated computer-readable media provide nonvolatile storage of data, data structures, computer-executable instructions, and so forth.
- the drives and media accommodate the storage of any data in a suitable digital format.
- computer-readable media refers to a HDD, a removable magnetic diskette (e.g., FDD), and a removable optical media such as a CD or DVD, it should be appreciated by those skilled in the art that other types of media which are readable by a computer, such as zip drives, magnetic cassettes, flash memory cards, cartridges, and the like, may also be used in the exemplary operating environment, and further, that any such media may contain computer-executable instructions for performing novel methods of the disclosed architecture.
- a number of program modules can be stored in the drives and volatile memory 1212 , including an operating system 1230 , one or more application programs 1232 , other program modules 1234 , and program data 1236 .
- the one or more application programs 1232 , other program modules 1234 , and program data 1236 can include the system 100 and associated blocks, the system 500 and associated blocks, the document information 104 , TAUC data 602 , click information 610 , the data flow 700 (and algorithms), and block diagram 800 (and associated blocks).
- All or portions of the operating system, applications, modules, and/or data can also be cached in the volatile memory 1212 . It is to be appreciated that the disclosed architecture can be implemented with various commercially available operating systems or combinations of operating systems.
- a user can enter commands and information into the computer 1202 through one or more wire/wireless input devices, for example, a keyboard 1238 and a pointing device, such as a mouse 1240 .
- Other input devices may include a microphone, an IR remote control, a joystick, a game pad, a stylus pen, touch screen, or the like.
- These and other input devices are often connected to the processing unit 1204 through an input device interface 1242 that is coupled to the system bus 1208 , but can be connected by other interfaces such as a parallel port, IEEE 1394 serial port, a game port, a USB port, an IR interface, etc.
- a monitor 1244 or other type of display device is also connected to the system bus 1208 via an interface, such as a video adaptor 1246 .
- a computer typically includes other peripheral output devices (not shown), such as speakers, printers, etc.
- the computer 1202 may operate in a networked environment using logical connections via wire and/or wireless communications to one or more remote computers, such as a remote computer(s) 1248 .
- the remote computer(s) 1248 can be a workstation, a server computer, a router, a personal computer, portable computer, microprocessor-based entertainment appliance, a peer device or other common network node, and typically includes many or all of the elements described relative to the computer 1202 , although, for purposes of brevity, only a memory/storage device 1250 is illustrated.
- the logical connections depicted include wire/wireless connectivity to a local area network (LAN) 1252 and/or larger networks, for example, a wide area network (WAN) 1254 .
- LAN and WAN networking environments are commonplace in offices and companies, and facilitate enterprise-wide computer networks, such as intranets, all of which may connect to a global communications network, for example, the Internet.
- the computer 1202 When used in a LAN networking environment, the computer 1202 is connected to the LAN 1252 through a wire and/or wireless communication network interface or adaptor 1256 .
- the adaptor 1256 can facilitate wire and/or wireless communications to the LAN 1252 , which may also include a wireless access point disposed thereon for communicating with the wireless functionality of the adaptor 1256 .
- the computer 1202 can include a modem 1258 , or is connected to a communications server on the WAN 1254 , or has other means for establishing communications over the WAN 1254 , such as by way of the Internet.
- the modem 1258 which can be internal or external and a wire and/or wireless device, is connected to the system bus 1208 via the input device interface 1242 .
- program modules depicted relative to the computer 1202 can be stored in the remote memory/storage device 1250 . It will be appreciated that the network connections shown are exemplary and other means of establishing a communications link between the computers can be used.
- the computer 1202 is operable to communicate with wire and wireless devices or entities using the IEEE 802 family of standards, such as wireless devices operatively disposed in wireless communication (e.g., IEEE 802.11 over-the-air modulation techniques) with, for example, a printer, scanner, desktop and/or portable computer, personal digital assistant (PDA), communications satellite, any piece of equipment or location associated with a wirelessly detectable tag (e.g., a kiosk, news stand, restroom), and telephone.
- PDA personal digital assistant
- the communication can be a predefined structure as with a conventional network or simply an ad hoc communication between at least two devices.
- Wi-Fi networks use radio technologies called IEEE 802.11x (a, b, g, etc.) to provide secure, reliable, fast wireless connectivity.
- IEEE 802.11x a, b, g, etc.
- a Wi-Fi network can be used to connect computers to each other, to the Internet, and to wire networks (which use IEEE 802.3-related media and functions).
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
d[j,i] = min( | ||
d[j−1,i−1] if s[i]=t[j]; | ||
d[j−1,i] + (w1, if s[j] is present in both strings; else, w3); | ||
d[j,i−1] + (w2, if t[i] is present in both strings; else, w4) | ||
) | ||
TAUC(Title)=ED(Title)
where TAUC(Title) is later used as an input to the neural network after application of a transform function, and ED(Title) is the edit distance of the title.
TAUC(Anchor)=Min{ED(Anchori)} i: top N anchors;
The intuition is that if a good match exists with one of the anchors, then it is sufficient. TAUC(Anchor) is used as a neural network input after applying a transform function.
Startpos: 0 |
Urlparts = split(url, dictionary) |
// find terms in different url parts. |
For each (term in dictionary) |
{ |
Int pos = 0; |
For each(urlpart in urlparts) |
{ |
pos = urlpart.Find(term, pos); |
while (pos >= 0) |
{ |
// parts_separator is used to distinguish different parts |
at query time |
storeOccurrence(term, pos + |
urlpart.startpos*parts_separator); |
pos = url.Find(term, pos + term.length); |
} |
} |
setIndexStreamLength(parts_separator * urlparts.Count); |
} |
Claims (20)
Priority Applications (12)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/101,951 US8812493B2 (en) | 2008-04-11 | 2008-04-11 | Search results ranking using editing distance and document information |
TW098106721A TWI486800B (en) | 2008-04-11 | 2009-03-02 | System and method for search results ranking using editing distance and document information |
JP2011504031A JP5492187B2 (en) | 2008-04-11 | 2009-03-10 | Search result ranking using edit distance and document information |
KR1020107022177A KR101557294B1 (en) | 2008-04-11 | 2009-03-10 | Search results ranking using editing distance and document information |
BRPI0909092-4A BRPI0909092A2 (en) | 2008-04-11 | 2009-03-10 | sorting search results using editing distance and document information |
PCT/US2009/036597 WO2009126394A1 (en) | 2008-04-11 | 2009-03-10 | Search results ranking using editing distance and document information |
EP20090730808 EP2289007B1 (en) | 2008-04-11 | 2009-03-10 | Search results ranking using editing distance and document information |
AU2009234120A AU2009234120B2 (en) | 2008-04-11 | 2009-03-10 | Search results ranking using editing distance and document information |
CN200980112928.6A CN101990670B (en) | 2008-04-11 | 2009-03-10 | Search results ranking using editing distance and document information |
RU2010141559/08A RU2501078C2 (en) | 2008-04-11 | 2009-03-10 | Ranking search results using edit distance and document information |
IL207830A IL207830A (en) | 2008-04-11 | 2010-08-26 | Search results ranking using editing distance and document information |
ZA2010/06093A ZA201006093B (en) | 2008-04-11 | 2010-08-26 | Search results ranking using editing distance and document information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/101,951 US8812493B2 (en) | 2008-04-11 | 2008-04-11 | Search results ranking using editing distance and document information |
Publications (2)
Publication Number | Publication Date |
---|---|
US20090259651A1 US20090259651A1 (en) | 2009-10-15 |
US8812493B2 true US8812493B2 (en) | 2014-08-19 |
Family
ID=41162189
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/101,951 Active 2029-01-23 US8812493B2 (en) | 2008-04-11 | 2008-04-11 | Search results ranking using editing distance and document information |
Country Status (12)
Country | Link |
---|---|
US (1) | US8812493B2 (en) |
EP (1) | EP2289007B1 (en) |
JP (1) | JP5492187B2 (en) |
KR (1) | KR101557294B1 (en) |
CN (1) | CN101990670B (en) |
AU (1) | AU2009234120B2 (en) |
BR (1) | BRPI0909092A2 (en) |
IL (1) | IL207830A (en) |
RU (1) | RU2501078C2 (en) |
TW (1) | TWI486800B (en) |
WO (1) | WO2009126394A1 (en) |
ZA (1) | ZA201006093B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130262983A1 (en) * | 2012-03-30 | 2013-10-03 | Bmenu As | System, method, software arrangement and computer-accessible medium for a generator that automatically identifies regions of interest in electronic documents for transcoding |
US10650191B1 (en) | 2018-06-14 | 2020-05-12 | Elementary IP LLC | Document term extraction based on multiple metrics |
US20220159130A1 (en) * | 2020-11-18 | 2022-05-19 | Canon Kabushiki Kaisha | Information processing apparatus, information processing method, and non-transitory storage medium |
RU2821294C2 (en) * | 2021-10-18 | 2024-06-19 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for ranking set of documents from search result |
Families Citing this family (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7606793B2 (en) | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US9348912B2 (en) | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US8065310B2 (en) * | 2008-06-25 | 2011-11-22 | Microsoft Corporation | Topics in relevance ranking model for web search |
US20100312793A1 (en) * | 2009-06-08 | 2010-12-09 | International Business Machines Corporation | Displaying relevancy of results from multi-dimensional searches using heatmaps |
KR101141498B1 (en) * | 2010-01-14 | 2012-05-04 | 주식회사 와이즈넛 | Informational retrieval method using a proximity language model and recording medium threrof |
US10140339B2 (en) * | 2010-01-26 | 2018-11-27 | Paypal, Inc. | Methods and systems for simulating a search to generate an optimized scoring function |
TWI486797B (en) * | 2010-03-09 | 2015-06-01 | Alibaba Group Holding Ltd | Methods and devices for sorting search results |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US9189549B2 (en) * | 2010-11-08 | 2015-11-17 | Microsoft Technology Licensing, Llc | Presenting actions and providers associated with entities |
CA2747153A1 (en) * | 2011-07-19 | 2013-01-19 | Suleman Kaheer | Natural language processing dialog system for obtaining goods, services or information |
US8788436B2 (en) * | 2011-07-27 | 2014-07-22 | Microsoft Corporation | Utilization of features extracted from structured documents to improve search relevance |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US9235654B1 (en) * | 2012-02-06 | 2016-01-12 | Google Inc. | Query rewrites for generating auto-complete suggestions |
CN103077163B (en) * | 2012-12-24 | 2015-07-08 | 华为技术有限公司 | Data preprocessing method, device and system |
JP5981386B2 (en) * | 2013-04-18 | 2016-08-31 | 日本電信電話株式会社 | Representative page selection device and representative page selection program |
KR101322123B1 (en) * | 2013-06-14 | 2013-10-28 | 인하대학교 산학협력단 | Method for parallel computation of extended edit distance including swap operation |
CN104424279B (en) * | 2013-08-30 | 2018-11-20 | 腾讯科技(深圳)有限公司 | A kind of correlation calculations method and apparatus of text |
US9519859B2 (en) | 2013-09-06 | 2016-12-13 | Microsoft Technology Licensing, Llc | Deep structured semantic model produced using click-through data |
US9477654B2 (en) | 2014-04-01 | 2016-10-25 | Microsoft Corporation | Convolutional latent semantic models and their applications |
US9535960B2 (en) | 2014-04-14 | 2017-01-03 | Microsoft Corporation | Context-sensitive search using a deep learning model |
US10089580B2 (en) | 2014-08-11 | 2018-10-02 | Microsoft Technology Licensing, Llc | Generating and using a knowledge-enhanced model |
CN104572825B (en) * | 2014-12-04 | 2019-03-12 | 百度在线网络技术(北京)有限公司 | The recommended method and device of information |
US10489463B2 (en) * | 2015-02-12 | 2019-11-26 | Microsoft Technology Licensing, Llc | Finding documents describing solutions to computing issues |
CA2979579C (en) * | 2015-03-20 | 2020-02-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Relevance score assignment for artificial neural networks |
US11281639B2 (en) * | 2015-06-23 | 2022-03-22 | Microsoft Technology Licensing, Llc | Match fix-up to remove matching documents |
US11392568B2 (en) | 2015-06-23 | 2022-07-19 | Microsoft Technology Licensing, Llc | Reducing matching documents for a search query |
CN106815196B (en) * | 2015-11-27 | 2020-07-31 | 北京国双科技有限公司 | Method and device for counting the number of press releases |
CN105446957B (en) | 2015-12-03 | 2018-07-20 | 小米科技有限责任公司 | Similitude determines method, apparatus and terminal |
CN107203567A (en) * | 2016-03-18 | 2017-09-26 | 伊姆西公司 | Method and apparatus for searching for word string |
US10909450B2 (en) | 2016-03-29 | 2021-02-02 | Microsoft Technology Licensing, Llc | Multiple-action computational model training and operation |
CN106547871B (en) * | 2016-10-31 | 2020-04-07 | 北京百度网讯科技有限公司 | Neural network-based search result recall method and device |
CN107229701B (en) * | 2017-05-25 | 2018-07-03 | 腾讯科技(深圳)有限公司 | Ranking update method, device and computer equipment |
US20190251422A1 (en) * | 2018-02-09 | 2019-08-15 | Microsoft Technology Licensing, Llc | Deep neural network architecture for search |
CN109960757A (en) * | 2019-02-27 | 2019-07-02 | 北京搜狗科技发展有限公司 | Web search method and device |
RU2757174C2 (en) | 2019-09-05 | 2021-10-11 | Общество С Ограниченной Ответственностью «Яндекс» | Method and system for ranking digital objects based on target characteristic related to them |
CN110941743B (en) * | 2019-10-14 | 2023-09-15 | 广西壮族自治区科学技术情报研究所 | Scientific and technological project duplicate checking method for automatically realizing field weight distribution based on deep learning algorithm |
US10761839B1 (en) * | 2019-10-17 | 2020-09-01 | Globant España S.A. | Natural language search engine with a predictive writing tool for coding |
JP6840293B1 (en) * | 2019-11-28 | 2021-03-10 | 三菱電機株式会社 | Information processing equipment, information processing methods, and information processing programs |
CN111352549B (en) * | 2020-02-25 | 2022-01-07 | 腾讯科技(深圳)有限公司 | Data object display method, device, equipment and storage medium |
CN113360178B (en) * | 2021-05-31 | 2023-05-05 | 东风商用车有限公司 | Method, device and equipment for generating unique software identification code and readable storage medium |
US11409800B1 (en) | 2021-07-23 | 2022-08-09 | Bank Of America Corporation | Generating search queries for database searching |
US20230394100A1 (en) * | 2022-06-01 | 2023-12-07 | Ellipsis Marketing LTD | Webpage Title Generator |
KR20240161612A (en) | 2023-05-04 | 2024-11-12 | (주)테크디엔에이 | A System and Method for Retrieving Electronic Documents |
Citations (357)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5222236A (en) | 1988-04-29 | 1993-06-22 | Overdrive Systems, Inc. | Multiple integrated document assembly data processing system |
US5257577A (en) | 1991-04-01 | 1993-11-02 | Clark Melvin D | Apparatus for assist in recycling of refuse |
US5321833A (en) | 1990-08-29 | 1994-06-14 | Gte Laboratories Incorporated | Adaptive ranking system for information retrieval |
US5369778A (en) | 1987-08-21 | 1994-11-29 | Wang Laboratories, Inc. | Data processor that customizes program behavior by using a resource retrieval capability |
US5544360A (en) | 1992-11-23 | 1996-08-06 | Paragon Concepts, Inc. | Method for accessing computer files and data, using linked categories assigned to each data file record on entry of the data file record |
US5594660A (en) | 1994-09-30 | 1997-01-14 | Cirrus Logic, Inc. | Programmable audio-video synchronization method and apparatus for multimedia systems |
US5606609A (en) | 1994-09-19 | 1997-02-25 | Scientific-Atlanta | Electronic document verification system and method |
US5634124A (en) | 1987-08-21 | 1997-05-27 | Wang Laboratories, Inc. | Data integration by object management |
US5729730A (en) | 1995-03-28 | 1998-03-17 | Dex Information Systems, Inc. | Method and apparatus for improved information storage and retrieval system |
US5765150A (en) | 1996-08-09 | 1998-06-09 | Digital Equipment Corporation | Method for statistically projecting the ranking of information |
US5826269A (en) | 1995-06-21 | 1998-10-20 | Microsoft Corporation | Electronic mail interface for a network server |
US5828999A (en) | 1996-05-06 | 1998-10-27 | Apple Computer, Inc. | Method and system for deriving a large-span semantic language model for large-vocabulary recognition systems |
US5848404A (en) | 1997-03-24 | 1998-12-08 | International Business Machines Corporation | Fast query search in large dimension database |
US5870739A (en) | 1996-09-20 | 1999-02-09 | Novell, Inc. | Hybrid query apparatus and method |
US5870740A (en) | 1996-09-30 | 1999-02-09 | Apple Computer, Inc. | System and method for improving the ranking of information retrieval results for short queries |
US5890147A (en) | 1997-03-07 | 1999-03-30 | Microsoft Corporation | Scope testing of documents in a search engine using document to folder mapping |
US5893092A (en) | 1994-12-06 | 1999-04-06 | University Of Central Florida | Relevancy ranking using statistical ranking, semantics, relevancy feedback and small pieces of text |
US5893116A (en) | 1996-09-30 | 1999-04-06 | Novell, Inc. | Accessing network resources using network resource replicator and captured login script for use when the computer is disconnected from the network |
US5905866A (en) | 1996-04-30 | 1999-05-18 | A.I. Soft Corporation | Data-update monitoring in communications network |
US5913210A (en) | 1998-03-27 | 1999-06-15 | Call; Charles G. | Methods and apparatus for disseminating product information via the internet |
US5920859A (en) | 1997-02-05 | 1999-07-06 | Idd Enterprises, L.P. | Hypertext document retrieval system and method |
US5933822A (en) | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US5933851A (en) | 1995-09-29 | 1999-08-03 | Sony Corporation | Time-stamp and hash-based file modification monitor with multi-user notification and method thereof |
US5943670A (en) | 1997-11-21 | 1999-08-24 | International Business Machines Corporation | System and method for categorizing objects in combined categories |
JPH11232300A (en) | 1998-02-18 | 1999-08-27 | Nri & Ncc Co Ltd | Browsing client server system |
RU2138076C1 (en) | 1998-09-14 | 1999-09-20 | Закрытое акционерное общество "МедиаЛингва" | Data retrieval system in computer network |
US5956722A (en) | 1997-09-23 | 1999-09-21 | At&T Corp. | Method for effective indexing of partially dynamic documents |
US5960383A (en) | 1997-02-25 | 1999-09-28 | Digital Equipment Corporation | Extraction of key sections from texts using automatic indexing techniques |
EP0950961A2 (en) | 1998-04-17 | 1999-10-20 | Xerox Corporation | Methods for interactive visualization of spreading activation using time tubes and disk trees |
US5983216A (en) | 1997-09-12 | 1999-11-09 | Infoseek Corporation | Performing automated document collection and selection by providing a meta-index with meta-index values indentifying corresponding document collections |
US5987457A (en) | 1997-11-25 | 1999-11-16 | Acceleration Software International Corporation | Query refinement method for searching documents |
US6006225A (en) | 1998-06-15 | 1999-12-21 | Amazon.Com | Refining search queries by the suggestion of correlated terms from prior searches |
US6012053A (en) | 1997-06-23 | 2000-01-04 | Lycos, Inc. | Computer system with user-controlled relevance ranking of search results |
US6026398A (en) | 1997-10-16 | 2000-02-15 | Imarket, Incorporated | System and methods for searching and matching databases |
US6029164A (en) | 1997-06-16 | 2000-02-22 | Digital Equipment Corporation | Method and apparatus for organizing and accessing electronic mail messages using labels and full text and label indexing |
US6032196A (en) | 1995-12-13 | 2000-02-29 | Digital Equipment Corporation | System for adding a new entry to a web page table upon receiving a web page including a link to another web page not having a corresponding entry in the web page table |
US6038610A (en) | 1996-07-17 | 2000-03-14 | Microsoft Corporation | Storage of sitemaps at server sites for holding information regarding content |
US6041323A (en) | 1996-04-17 | 2000-03-21 | International Business Machines Corporation | Information search method, information search device, and storage medium for storing an information search program |
US6070158A (en) | 1996-08-14 | 2000-05-30 | Infoseek Corporation | Real-time document collection search engine with phrase indexing |
US6070191A (en) | 1997-10-17 | 2000-05-30 | Lucent Technologies Inc. | Data distribution techniques for load-balanced fault-tolerant web access |
JP2000194713A (en) | 1998-12-25 | 2000-07-14 | Nippon Telegr & Teleph Corp <Ntt> | Method and device for retrieving character string, and storage medium stored with character string retrieval program |
US6098064A (en) | 1998-05-22 | 2000-08-01 | Xerox Corporation | Prefetching and caching documents according to probability ranked need S list |
US6115709A (en) | 1998-09-18 | 2000-09-05 | Tacit Knowledge Systems, Inc. | Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions |
US6125361A (en) | 1998-04-10 | 2000-09-26 | International Business Machines Corporation | Feature diffusion across hyperlinks |
US6128701A (en) | 1997-10-28 | 2000-10-03 | Cache Flow, Inc. | Adaptive and predictive cache refresh policy |
US6145003A (en) | 1997-12-17 | 2000-11-07 | Microsoft Corporation | Method of web crawling utilizing address mapping |
EP1050830A2 (en) | 1999-05-05 | 2000-11-08 | Xerox Corporation | System and method for collaborative ranking of search results employing user and group profiles |
US6151624A (en) | 1998-02-03 | 2000-11-21 | Realnames Corporation | Navigating network resources based on metadata |
US6167369A (en) | 1998-12-23 | 2000-12-26 | Xerox Company | Automatic language identification using both N-gram and word information |
US6167402A (en) | 1998-04-27 | 2000-12-26 | Sun Microsystems, Inc. | High performance message store |
US6178419B1 (en) | 1996-07-31 | 2001-01-23 | British Telecommunications Plc | Data access system |
US6182065B1 (en) | 1996-11-06 | 2001-01-30 | International Business Machines Corp. | Method and system for weighting the search results of a database search engine |
US6182085B1 (en) | 1998-05-28 | 2001-01-30 | International Business Machines Corporation | Collaborative team crawling:Large scale information gathering over the internet |
US6182067B1 (en) | 1997-06-02 | 2001-01-30 | Knowledge Horizons Pty Ltd. | Methods and systems for knowledge management |
US6182113B1 (en) | 1997-09-16 | 2001-01-30 | International Business Machines Corporation | Dynamic multiplexing of hyperlinks and bookmarks |
US6185558B1 (en) | 1998-03-03 | 2001-02-06 | Amazon.Com, Inc. | Identifying the items most relevant to a current query based on items selected in connection with similar queries |
JP2001052017A (en) | 1999-08-11 | 2001-02-23 | Fuji Xerox Co Ltd | Hypertext analyzer |
US6199081B1 (en) | 1998-06-30 | 2001-03-06 | Microsoft Corporation | Automatic tagging of documents and exclusion by content |
US6202058B1 (en) | 1994-04-25 | 2001-03-13 | Apple Computer, Inc. | System for ranking the relevance of information objects accessed by computer users |
US6208988B1 (en) | 1998-06-01 | 2001-03-27 | Bigchalk.Com, Inc. | Method for identifying themes associated with a search query using metadata and for organizing documents responsive to the search query in accordance with the themes |
US6216123B1 (en) | 1998-06-24 | 2001-04-10 | Novell, Inc. | Method and system for rapid retrieval in a full text indexing system |
US6222559B1 (en) | 1996-10-02 | 2001-04-24 | Nippon Telegraph And Telephone Corporation | Method and apparatus for display of hierarchical structures |
JP2001117934A (en) | 1999-10-19 | 2001-04-27 | Hitachi Ltd | Electronic document management method and system, and recording medium |
US6240408B1 (en) | 1998-06-08 | 2001-05-29 | Kcsl, Inc. | Method and system for retrieving relevant documents from a database |
US6240407B1 (en) | 1998-04-29 | 2001-05-29 | International Business Machines Corp. | Method and apparatus for creating an index in a database system |
US6247013B1 (en) | 1997-06-30 | 2001-06-12 | Canon Kabushiki Kaisha | Hyper text reading system |
US6263364B1 (en) | 1999-11-02 | 2001-07-17 | Alta Vista Company | Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining document freshness |
US6269370B1 (en) | 1996-02-21 | 2001-07-31 | Infoseek Corporation | Web scan process |
EP1120717A2 (en) | 2000-01-28 | 2001-08-01 | Microsoft Corporation | Adaptive web crawling using a statistical model |
US6272507B1 (en) | 1997-04-09 | 2001-08-07 | Xerox Corporation | System for ranking search results from a collection of documents using spreading activation techniques |
US6285367B1 (en) | 1998-05-26 | 2001-09-04 | International Business Machines Corporation | Method and apparatus for displaying and navigating a graph |
US6285999B1 (en) | 1997-01-10 | 2001-09-04 | The Board Of Trustees Of The Leland Stanford Junior University | Method for node ranking in a linked database |
JP2001265774A (en) | 2000-03-16 | 2001-09-28 | Nippon Telegr & Teleph Corp <Ntt> | Method and device for retrieving information, recording medium with recorded information retrieval program and hypertext information retrieving system |
US6304864B1 (en) | 1999-04-20 | 2001-10-16 | Textwise Llc | System for retrieving multimedia information from the internet using multiple evolving intelligent agents |
US6314421B1 (en) | 1998-05-12 | 2001-11-06 | David M. Sharnoff | Method and apparatus for indexing documents for message filtering |
US6317741B1 (en) | 1996-08-09 | 2001-11-13 | Altavista Company | Technique for ranking records of a database |
US20010042076A1 (en) | 1997-06-30 | 2001-11-15 | Ryoji Fukuda | A hypertext reader which performs a reading process on a hierarchically constructed hypertext |
US6324551B1 (en) | 1998-08-31 | 2001-11-27 | Xerox Corporation | Self-contained document management based on document properties |
US6326962B1 (en) | 1996-12-23 | 2001-12-04 | Doubleagent Llc | Graphic user interface for database system |
US6336117B1 (en) | 1999-04-30 | 2002-01-01 | International Business Machines Corporation | Content-indexing search system and method providing search results consistent with content filtering and blocking policies implemented in a blocking engine |
DE10029644A1 (en) | 2000-06-16 | 2002-01-17 | Deutsche Telekom Ag | Hypertext documents evaluation method using search engine, involves calculating real relevance value for each document based on precalculated relevance value and cross references of document |
US20020016787A1 (en) | 2000-06-28 | 2002-02-07 | Matsushita Electric Industrial Co., Ltd. | Apparatus for retrieving similar documents and apparatus for extracting relevant keywords |
US6349308B1 (en) | 1998-02-25 | 2002-02-19 | Korea Advanced Institute Of Science & Technology | Inverted index storage structure using subindexes and large objects for tight coupling of information retrieval with database management systems |
US6351755B1 (en) | 1999-11-02 | 2002-02-26 | Alta Vista Company | System and method for associating an extensible set of data with documents downloaded by a web crawler |
US6351467B1 (en) | 1997-10-27 | 2002-02-26 | Hughes Electronics Corporation | System and method for multicasting multimedia content |
US20020026390A1 (en) | 2000-08-25 | 2002-02-28 | Jonas Ulenas | Method and apparatus for obtaining consumer product preferences through product selection and evaluation |
KR20020015838A (en) | 2000-08-23 | 2002-03-02 | 전홍건 | Method for re-adjusting ranking of document to use user's profile and entropy |
US20020032772A1 (en) | 2000-09-14 | 2002-03-14 | Bjorn Olstad | Method for searching and analysing information in data networks |
US6360215B1 (en) | 1998-11-03 | 2002-03-19 | Inktomi Corporation | Method and apparatus for retrieving documents based on information other than document content |
JP2002091843A (en) | 2000-09-11 | 2002-03-29 | Nippon Telegr & Teleph Corp <Ntt> | Device and method for selecting server and recording medium recording server selection program |
US6381597B1 (en) | 1999-10-07 | 2002-04-30 | U-Know Software Corporation | Electronic shopping agent which is capable of operating with vendor sites which have disparate formats |
US6385602B1 (en) | 1998-11-03 | 2002-05-07 | E-Centives, Inc. | Presentation of search results using dynamic categorization |
US20020055940A1 (en) | 2000-11-07 | 2002-05-09 | Charles Elkan | Method and system for selecting documents by measuring document quality |
JP2002132769A (en) | 2000-10-25 | 2002-05-10 | Nippon Telegr & Teleph Corp <Ntt> | Method and device for multilateral retrieval service and recording medium recording program therefor |
US6389436B1 (en) | 1997-12-15 | 2002-05-14 | International Business Machines Corporation | Enhanced hypertext categorization using hyperlinks |
JP2002140365A (en) | 2000-11-01 | 2002-05-17 | Mitsubishi Electric Corp | Data retrieving method |
US20020062323A1 (en) | 2000-11-20 | 2002-05-23 | Yozan Inc | Browser apparatus, server apparatus, computer-readable medium, search system and search method |
US20020078045A1 (en) | 2000-12-14 | 2002-06-20 | Rabindranath Dutta | System, method, and program for ranking search results using user category weighting |
US20020083054A1 (en) | 2000-12-27 | 2002-06-27 | Kyle Peltonen | Scoping queries in a search engine |
US6415319B1 (en) | 1997-02-07 | 2002-07-02 | Sun Microsystems, Inc. | Intelligent network browser using incremental conceptual indexer |
US6418452B1 (en) | 1999-11-03 | 2002-07-09 | International Business Machines Corporation | Network repository service directory for efficient web crawling |
US6418433B1 (en) | 1999-01-28 | 2002-07-09 | International Business Machines Corporation | System and method for focussed web crawling |
US6418453B1 (en) | 1999-11-03 | 2002-07-09 | International Business Machines Corporation | Network repository service for efficient web crawling |
JP2002202992A (en) | 2000-12-28 | 2002-07-19 | Speed System:Kk | Homepage retrieval system |
US6424966B1 (en) | 1998-06-30 | 2002-07-23 | Microsoft Corporation | Synchronizing crawler with notification source |
US20020099694A1 (en) | 2000-11-21 | 2002-07-25 | Diamond Theodore George | Full-text relevancy ranking |
US20020103798A1 (en) | 2001-02-01 | 2002-08-01 | Abrol Mani S. | Adaptive document ranking method based on user behavior |
US20020107861A1 (en) | 2000-12-07 | 2002-08-08 | Kerry Clendinning | System and method for collecting, associating, normalizing and presenting product and vendor information on a distributed network |
US20020107886A1 (en) | 2001-02-07 | 2002-08-08 | Gentner Donald R. | Method and apparatus for automatic document electronic versioning system |
US6442606B1 (en) | 1999-08-12 | 2002-08-27 | Inktomi Corporation | Method and apparatus for identifying spoof documents |
JP2002245089A (en) | 2001-02-19 | 2002-08-30 | Hitachi Eng Co Ltd | Web page search system, secondary information collection device, interface device |
US20020123988A1 (en) | 2001-03-02 | 2002-09-05 | Google, Inc. | Methods and apparatus for employing usage statistics in document retrieval |
US20020129015A1 (en) | 2001-01-18 | 2002-09-12 | Maureen Caudill | Method and system of ranking and clustering for document indexing and retrieval |
US20020129014A1 (en) | 2001-01-10 | 2002-09-12 | Kim Brian S. | Systems and methods of retrieving relevant information |
US6473752B1 (en) | 1997-12-04 | 2002-10-29 | Micron Technology, Inc. | Method and system for locating documents based on previously accessed documents |
US20020165873A1 (en) | 2001-02-22 | 2002-11-07 | International Business Machines Corporation | Retrieving handwritten documents using multiple document recognizers and techniques allowing both typed and handwritten queries |
US20020169595A1 (en) | 2001-03-30 | 2002-11-14 | Yevgeny Agichtein | Method for retrieving answers from an information retrieval system |
US20020169800A1 (en) | 2001-01-05 | 2002-11-14 | International Business Machines Corporation | XML: finding authoritative pages for mining communities based on page structure criteria |
US20020169754A1 (en) | 2001-05-08 | 2002-11-14 | Jianchang Mao | Apparatus and method for adaptively ranking search results |
US20020169770A1 (en) | 2001-04-27 | 2002-11-14 | Kim Brian Seong-Gon | Apparatus and method that categorize a collection of documents into a hierarchy of categories that are defined by the collection of documents |
US20020168106A1 (en) | 2001-05-11 | 2002-11-14 | Miroslav Trajkovic | Palette-based histogram matching with recursive histogram vector generation |
US6484204B1 (en) | 1997-05-06 | 2002-11-19 | At&T Corp. | System and method for allocating requests for objects and managing replicas of objects on a network |
JP2002366549A (en) | 2001-05-07 | 2002-12-20 | Nec Corp | Selective retrieval metasearch engine and method for performing selective retrieval |
US20030004952A1 (en) | 1999-10-18 | 2003-01-02 | Mark Nixon | Accessing and updating a configuration database from distributed physical locations within a process control system |
US6516312B1 (en) | 2000-04-04 | 2003-02-04 | International Business Machine Corporation | System and method for dynamically associating keywords with domain-specific search engine queries |
EP1282060A2 (en) | 2001-08-03 | 2003-02-05 | Overture Services, Inc. | System and method for providing place and price protection in a search result list generated by a computer network search engine |
US20030028520A1 (en) | 2001-06-20 | 2003-02-06 | Alpha Shamim A. | Method and system for response time optimization of data query rankings and retrieval |
US20030037074A1 (en) | 2001-05-01 | 2003-02-20 | Ibm Corporation | System and method for aggregating ranking results from various sources to improve the results of web searching |
US6526440B1 (en) | 2001-01-30 | 2003-02-25 | Google, Inc. | Ranking search results by reranking the results based on local inter-connectivity |
US20030046389A1 (en) | 2001-09-04 | 2003-03-06 | Thieme Laura M. | Method for monitoring a web site's keyword visibility in search engines and directories and resulting traffic from such keyword visibility |
JP2003076715A (en) | 2001-08-20 | 2003-03-14 | Nhn Corp | Method and system for retrieving web pages, program and recording medium |
US20030055810A1 (en) | 2001-09-18 | 2003-03-20 | International Business Machines Corporation | Front-end weight factor search criteria |
US20030053084A1 (en) | 2001-07-19 | 2003-03-20 | Geidl Erik M. | Electronic ink as a software object |
US6539376B1 (en) | 1999-11-15 | 2003-03-25 | International Business Machines Corporation | System and method for the automatic mining of new relationships |
US20030061201A1 (en) | 2001-08-13 | 2003-03-27 | Xerox Corporation | System for propagating enrichment between documents |
US20030065706A1 (en) | 2001-05-10 | 2003-04-03 | Smyth Barry Joseph | Intelligent internet website with hierarchical menu |
US6546388B1 (en) | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US6549896B1 (en) | 2000-04-07 | 2003-04-15 | Nec Usa, Inc. | System and method employing random walks for mining web page associations and usage to optimize user-oriented web page refresh and pre-fetch scheduling |
US6547829B1 (en) | 1999-06-30 | 2003-04-15 | Microsoft Corporation | Method and system for detecting duplicate documents in web crawls |
US6549897B1 (en) | 1998-10-09 | 2003-04-15 | Microsoft Corporation | Method and system for calculating phrase-document importance |
US20030074368A1 (en) | 1999-01-26 | 2003-04-17 | Hinrich Schuetze | System and method for quantitatively representing data objects in vector space |
US6553364B1 (en) | 1997-11-03 | 2003-04-22 | Yahoo! Inc. | Information retrieval from hierarchical compound documents |
US6557036B1 (en) | 1999-07-20 | 2003-04-29 | Sun Microsystems, Inc. | Methods and apparatus for site wide monitoring of electronic mail systems |
US6560600B1 (en) | 2000-10-25 | 2003-05-06 | Alta Vista Company | Method and apparatus for ranking Web page search results |
US20030088545A1 (en) | 2001-06-18 | 2003-05-08 | Pavitra Subramaniam | System and method to implement a persistent and dismissible search center frame |
US20030101183A1 (en) | 2001-11-26 | 2003-05-29 | Navin Kabra | Information retrieval index allowing updating while in use |
US6594682B2 (en) | 1997-10-28 | 2003-07-15 | Microsoft Corporation | Client-side system for scheduling delivery of web content and locally managing the web content |
US20030135490A1 (en) | 2002-01-15 | 2003-07-17 | Barrett Michael E. | Enhanced popularity ranking |
RU2001128643A (en) | 2001-10-24 | 2003-07-20 | Закрытое акционерное общество "МедиаЛингва" | A method for determining the rating of links and ranking the paths for users to crawl pages of an Internet site located in a processing device of an Internet network node |
US6598040B1 (en) | 2000-08-14 | 2003-07-22 | International Business Machines Corporation | Method and system for processing electronic search expressions |
US6598047B1 (en) | 1999-07-26 | 2003-07-22 | David W. Russell | Method and system for searching text |
US6598051B1 (en) | 2000-09-19 | 2003-07-22 | Altavista Company | Web page connectivity server |
JP2003208434A (en) | 2001-11-07 | 2003-07-25 | Nec Corp | Information retrieval system, and information retrieval method using the same |
US6601075B1 (en) | 2000-07-27 | 2003-07-29 | International Business Machines Corporation | System and method of ranking and retrieving documents based on authority scores of schemas and documents |
JP2003248696A (en) | 2002-02-22 | 2003-09-05 | Nippon Telegr & Teleph Corp <Ntt> | Page rating/filtering method, device, and program, and computer readable recording medium recording the program |
US6622140B1 (en) | 2000-11-15 | 2003-09-16 | Justsystem Corporation | Method and apparatus for analyzing affect and emotion in text |
US6628304B2 (en) | 1998-12-09 | 2003-09-30 | Cisco Technology, Inc. | Method and apparatus providing a graphical user interface for representing and navigating hierarchical networks |
US6631369B1 (en) | 1999-06-30 | 2003-10-07 | Microsoft Corporation | Method and system for incremental web crawling |
US6633867B1 (en) | 2000-04-05 | 2003-10-14 | International Business Machines Corporation | System and method for providing a session query within the context of a dynamic search result set |
US6633868B1 (en) | 2000-07-28 | 2003-10-14 | Shermann Loyall Min | System and method for context-based document retrieval |
US20030195882A1 (en) * | 2002-04-11 | 2003-10-16 | Lee Chung Hee | Homepage searching method using similarity recalculation based on URL substring relationship |
KR20030081209A (en) | 2003-08-19 | 2003-10-17 | 장두한 | Apparatus for cutting the surface of the weld zone |
US6636853B1 (en) | 1999-08-30 | 2003-10-21 | Morphism, Llc | Method and apparatus for representing and navigating search results |
US6638314B1 (en) | 1998-06-26 | 2003-10-28 | Microsoft Corporation | Method of web crawling utilizing crawl numbers |
US20030217007A1 (en) | 2002-01-29 | 2003-11-20 | Sony Corporation | Method for providing and obtaining content |
US20030217047A1 (en) | 1999-03-23 | 2003-11-20 | Insightful Corporation | Inverse inference engine for high performance web search |
US20030217052A1 (en) | 2000-08-24 | 2003-11-20 | Celebros Ltd. | Search engine method and apparatus |
US6654742B1 (en) | 1999-02-12 | 2003-11-25 | International Business Machines Corporation | Method and system for document collection final search result by arithmetical operations between search results sorted by multiple ranking metrics |
US20040003028A1 (en) | 2002-05-08 | 2004-01-01 | David Emmett | Automatic display of web content to smaller display devices: improved summarization and navigation |
US20040006559A1 (en) | 2002-05-29 | 2004-01-08 | Gange David M. | System, apparatus, and method for user tunable and selectable searching of a database using a weigthted quantized feature vector |
US6678692B1 (en) | 2000-07-10 | 2004-01-13 | Northrop Grumman Corporation | Hierarchy statistical analysis system and method |
US20040024752A1 (en) | 2002-08-05 | 2004-02-05 | Yahoo! Inc. | Method and apparatus for search ranking using human input and automated ranking |
US6701318B2 (en) | 1998-11-18 | 2004-03-02 | Harris Corporation | Multiple engine information retrieval and visualization system |
US20040049766A1 (en) | 2002-09-09 | 2004-03-11 | Bloch Joshua J. | Method and apparatus for associating metadata attributes with program elements |
US20040064442A1 (en) | 2002-09-27 | 2004-04-01 | Popovitch Steven Gregory | Incremental search engine |
US6718365B1 (en) | 2000-04-13 | 2004-04-06 | International Business Machines Corporation | Method, system, and program for ordering search results using an importance weighting |
US20040093328A1 (en) | 2001-02-08 | 2004-05-13 | Aditya Damle | Methods and systems for automated semantic knowledge leveraging graph theoretic analysis and the inherent structure of communication |
JP2004164555A (en) | 2002-09-17 | 2004-06-10 | Fuji Xerox Co Ltd | Apparatus and method for retrieval, and apparatus and method for index building |
US20040117351A1 (en) | 2002-12-14 | 2004-06-17 | International Business Machines Corporation | System and method for identifying and utilizing a secondary index to access a database using a management system without an internal catalogue of online metadata |
JP2004192657A (en) | 2004-02-09 | 2004-07-08 | Nec Corp | Information retrieval system, and recording medium recording information retrieval method and program for information retrieval |
US6763362B2 (en) | 2001-11-30 | 2004-07-13 | Micron Technology, Inc. | Method and system for updating a search engine |
US6766422B2 (en) | 2001-09-27 | 2004-07-20 | Siemens Information And Communication Networks, Inc. | Method and system for web caching based on predictive usage |
US20040141354A1 (en) * | 2003-01-18 | 2004-07-22 | Carnahan John M. | Query string matching method and apparatus |
US20040148278A1 (en) | 2003-01-22 | 2004-07-29 | Amir Milo | System and method for providing content warehouse |
US6772141B1 (en) | 1999-12-14 | 2004-08-03 | Novell, Inc. | Method and apparatus for organizing and using indexes utilizing a search decision table |
US6775659B2 (en) | 1998-08-26 | 2004-08-10 | Symtec Limited | Methods and devices for mapping data files |
US6775664B2 (en) | 1996-04-04 | 2004-08-10 | Lycos, Inc. | Information filter system and method for integrated content-based and collaborative/adaptive feedback queries |
US20040181515A1 (en) | 2003-03-13 | 2004-09-16 | International Business Machines Corporation | Group administration of universal resource identifiers with members identified in search result |
RU2236699C1 (en) | 2003-02-25 | 2004-09-20 | Открытое акционерное общество "Телепортал. Ру" | Method for searching and selecting information with increased relevance |
US20040186827A1 (en) | 2003-03-21 | 2004-09-23 | Anick Peter G. | Systems and methods for interactive search query refinement |
JP2004265015A (en) | 2003-02-28 | 2004-09-24 | Toyota Motor Corp | Index generator for content search |
US20040194099A1 (en) | 2003-03-31 | 2004-09-30 | John Lamping | System and method for providing preferred language ordering of search results |
US20040199497A1 (en) | 2000-02-08 | 2004-10-07 | Sybase, Inc. | System and Methodology for Extraction and Aggregation of Data from Dynamic Content |
US20040205497A1 (en) | 2001-10-22 | 2004-10-14 | Chiang Alexander | System for automatic generation of arbitrarily indexed hyperlinked text |
CA2279119C (en) | 1999-07-29 | 2004-10-19 | Ibm Canada Limited-Ibm Canada Limitee | Heuristic-based conditional data indexing |
US20040215606A1 (en) | 2003-04-25 | 2004-10-28 | David Cossock | Method and apparatus for machine learning a document relevance function |
US20040215664A1 (en) | 1999-03-31 | 2004-10-28 | Microsoft Corporation | Method for promoting contextual information to display pages containing hyperlinks |
US6829606B2 (en) | 2002-02-14 | 2004-12-07 | Infoglide Software Corporation | Similarity search engine for use with relational databases |
US20040249795A1 (en) | 2003-06-05 | 2004-12-09 | International Business Machines Corporation | Semantics-based searching for information in a distributed data processing system |
US20040254932A1 (en) | 2003-06-16 | 2004-12-16 | Vineet Gupta | System and method for providing preferred country biasing of search results |
US20040260695A1 (en) | 2003-06-20 | 2004-12-23 | Brill Eric D. | Systems and methods to tune a general-purpose search engine for a search entry point |
US20040267722A1 (en) | 2003-06-30 | 2004-12-30 | Larimore Stefan Isbein | Fast ranked full-text searching |
US20050033742A1 (en) | 2003-03-28 | 2005-02-10 | Kamvar Sepandar D. | Methods for ranking nodes in large directed graphs |
US6859800B1 (en) | 2000-04-26 | 2005-02-22 | Global Information Research And Technologies Llc | System for fulfilling an information need |
US20050044071A1 (en) | 2000-06-08 | 2005-02-24 | Ingenuity Systems, Inc. | Techniques for facilitating information acquisition and storage |
US6862710B1 (en) | 1999-03-23 | 2005-03-01 | Insightful Corporation | Internet navigation using soft hyperlinks |
US20050055347A9 (en) | 2000-12-08 | 2005-03-10 | Ingenuity Systems, Inc. | Method and system for performing information extraction and quality control for a knowledgebase |
US20050055340A1 (en) | 2002-07-26 | 2005-03-10 | Brainbow, Inc. | Neural-based internet search engine with fuzzy and learning processes implemented by backward propogation |
US6868411B2 (en) | 2001-08-13 | 2005-03-15 | Xerox Corporation | Fuzzy text categorizer |
US20050060304A1 (en) | 2002-11-19 | 2005-03-17 | Prashant Parikh | Navigational learning in a structured transaction processing system |
US20050060186A1 (en) | 2003-08-28 | 2005-03-17 | Blowers Paul A. | Prioritized presentation of medical device events |
US20050060311A1 (en) | 2003-09-12 | 2005-03-17 | Simon Tong | Methods and systems for improving a search ranking using related queries |
US20050060310A1 (en) | 2003-09-12 | 2005-03-17 | Simon Tong | Methods and systems for improving a search ranking using population information |
US6873982B1 (en) | 1999-07-16 | 2005-03-29 | International Business Machines Corporation | Ordering of database search results based on user feedback |
US20050071741A1 (en) | 2003-09-30 | 2005-03-31 | Anurag Acharya | Information retrieval based on historical data |
US20050071328A1 (en) | 2003-09-30 | 2005-03-31 | Lawrence Stephen R. | Personalization of web search |
US20050086192A1 (en) | 2003-10-16 | 2005-04-21 | Hitach, Ltd. | Method and apparatus for improving the integration between a search engine and one or more file servers |
US20050086206A1 (en) | 2003-10-15 | 2005-04-21 | International Business Machines Corporation | System, Method, and service for collaborative focused crawling of documents on a network |
US6886010B2 (en) | 2002-09-30 | 2005-04-26 | The United States Of America As Represented By The Secretary Of The Navy | Method for data and text mining and literature-based discovery |
US6886129B1 (en) | 1999-11-24 | 2005-04-26 | International Business Machines Corporation | Method and system for trawling the World-wide Web to identify implicitly-defined communities of web pages |
US20050089215A1 (en) | 2003-10-25 | 2005-04-28 | Carl Staelin | Image artifact reduction using a neural network |
US20050114324A1 (en) | 2003-09-14 | 2005-05-26 | Yaron Mayer | System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers |
US20050125392A1 (en) | 2003-12-08 | 2005-06-09 | Andy Curtis | Methods and systems for providing a response to a query |
US6910029B1 (en) | 2000-02-22 | 2005-06-21 | International Business Machines Corporation | System for weighted indexing of hierarchical documents |
US20050144162A1 (en) | 2003-12-29 | 2005-06-30 | Ping Liang | Advanced search, file system, and intelligent assistant agent |
US20050154746A1 (en) | 2004-01-09 | 2005-07-14 | Yahoo!, Inc. | Content presentation and management system associating base content and relevant additional content |
US20050154710A1 (en) | 2004-01-08 | 2005-07-14 | International Business Machines Corporation | Dynamic bitmap processing, identification and reusability |
EP1557770A1 (en) | 2004-01-23 | 2005-07-27 | Microsoft Corporation | Building and using subwebs for focused search |
US20050165781A1 (en) | 2004-01-26 | 2005-07-28 | Reiner Kraft | Method, system, and program for handling anchor text |
US6931397B1 (en) | 2000-02-11 | 2005-08-16 | International Business Machines Corporation | System and method for automatic generation of dynamic search abstracts contain metadata by crawler |
US6934714B2 (en) | 2002-03-04 | 2005-08-23 | Intelesis Engineering, Inc. | Method and system for identification and maintenance of families of data records |
US20050192936A1 (en) | 2004-02-12 | 2005-09-01 | Meek Christopher A. | Decision-theoretic web-crawling and predicting web-page change |
US20050192955A1 (en) | 2004-03-01 | 2005-09-01 | International Business Machines Corporation | Organizing related search results |
US6944609B2 (en) | 2001-10-18 | 2005-09-13 | Lycos, Inc. | Search results using editor feedback |
US20050210105A1 (en) | 2004-03-22 | 2005-09-22 | Fuji Xerox Co., Ltd. | Conference information processing apparatus, and conference information processing method and storage medium readable by computer |
US20050210006A1 (en) | 2004-03-18 | 2005-09-22 | Microsoft Corporation | Field weighting in text searching |
US20050210079A1 (en) | 2004-03-17 | 2005-09-22 | Edlund Stefan B | Method for synchronizing documents for disconnected operation |
US20050216533A1 (en) | 2004-03-29 | 2005-09-29 | Yahoo! Inc. | Search using graph colorization and personalized bookmark processing |
US6959326B1 (en) | 2000-08-24 | 2005-10-25 | International Business Machines Corporation | Method, system, and program for gathering indexable metadata on content at a data repository |
US20050240580A1 (en) | 2003-09-30 | 2005-10-27 | Zamir Oren E | Personalization of placed content ordering in search results |
US20050251499A1 (en) | 2004-05-04 | 2005-11-10 | Zezhen Huang | Method and system for searching documents using readers valuation |
US20050256865A1 (en) | 2004-05-14 | 2005-11-17 | Microsoft Corporation | Method and system for indexing and searching databases |
US20050262050A1 (en) | 2004-05-07 | 2005-11-24 | International Business Machines Corporation | System, method and service for ranking search results using a modular scoring system |
US6973490B1 (en) | 1999-06-23 | 2005-12-06 | Savvis Communications Corp. | Method and system for object-level web performance and analysis |
US20050283473A1 (en) | 2004-06-17 | 2005-12-22 | Armand Rousso | Apparatus, method and system of artificial intelligence for data searching applications |
US20050289133A1 (en) | 2004-06-25 | 2005-12-29 | Yan Arrouye | Methods and systems for managing data |
US20050289193A1 (en) | 2004-06-25 | 2005-12-29 | Yan Arrouye | Methods and systems for managing data |
US20060004732A1 (en) | 2002-02-26 | 2006-01-05 | Odom Paul S | Search engine methods and systems for generating relevant search results and advertisements |
US6990628B1 (en) | 1999-06-14 | 2006-01-24 | Yahoo! Inc. | Method and apparatus for measuring similarity among electronic documents |
US20060031183A1 (en) | 2004-08-04 | 2006-02-09 | Tolga Oral | System and method for enhancing keyword relevance by user's interest on the search result documents |
US6999959B1 (en) | 1997-10-10 | 2006-02-14 | Nec Laboratories America, Inc. | Meta search engine |
US20060036598A1 (en) | 2004-08-09 | 2006-02-16 | Jie Wu | Computerized method for ranking linked information items in distributed sources |
US7003442B1 (en) | 1998-06-24 | 2006-02-21 | Fujitsu Limited | Document file group organizing apparatus and method thereof |
US20060041521A1 (en) | 2004-08-04 | 2006-02-23 | Tolga Oral | System and method for providing graphical representations of search results in multiple related histograms |
US20060047649A1 (en) | 2003-12-29 | 2006-03-02 | Ping Liang | Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation |
US20060047643A1 (en) | 2004-08-31 | 2006-03-02 | Chirag Chaman | Method and system for a personalized search engine |
US7010532B1 (en) | 1997-12-31 | 2006-03-07 | International Business Machines Corporation | Low overhead methods and apparatus for shared access storage devices |
US20060059144A1 (en) | 2004-09-16 | 2006-03-16 | Telenor Asa | Method, system, and computer program product for searching for, navigating among, and ranking of documents in a personal web |
US7016540B1 (en) | 1999-11-24 | 2006-03-21 | Nec Corporation | Method and system for segmentation, classification, and summarization of video images |
US20060064411A1 (en) | 2004-09-22 | 2006-03-23 | William Gross | Search engine using user intent |
US20060069982A1 (en) | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Click distance determination |
US20060074781A1 (en) | 2004-10-06 | 2006-04-06 | Leano Hector V | System for facilitating turnkey real estate investment in Mexico |
US20060074883A1 (en) | 2004-10-05 | 2006-04-06 | Microsoft Corporation | Systems, methods, and interfaces for providing personalized search and information access |
US20060074903A1 (en) | 2004-09-30 | 2006-04-06 | Microsoft Corporation | System and method for ranking search results using click distance |
RU2273879C2 (en) | 2002-05-28 | 2006-04-10 | Владимир Владимирович Насыпный | Method for synthesis of self-teaching system for extracting knowledge from text documents for search engines |
US7028029B2 (en) | 2003-03-28 | 2006-04-11 | Google Inc. | Adaptive computation of ranking |
US20060095416A1 (en) | 2004-10-28 | 2006-05-04 | Yahoo! Inc. | Link-based spam detection |
US7051023B2 (en) | 2003-04-04 | 2006-05-23 | Yahoo! Inc. | Systems and methods for generating concept units from search queries |
US20060136411A1 (en) | 2004-12-21 | 2006-06-22 | Microsoft Corporation | Ranking search results using feature extraction |
US7072888B1 (en) | 1999-06-16 | 2006-07-04 | Triogo, Inc. | Process for improving search engine efficiency using feedback |
US20060149723A1 (en) | 2002-05-24 | 2006-07-06 | Microsoft Corporation | System and method for providing search results with configurable scoring formula |
US7076483B2 (en) | 2001-08-27 | 2006-07-11 | Xyleme Sa | Ranking nodes in a graph |
US7080073B1 (en) | 2000-08-18 | 2006-07-18 | Firstrain, Inc. | Method and apparatus for focused crawling |
US20060161534A1 (en) | 2005-01-18 | 2006-07-20 | Yahoo! Inc. | Matching and ranking of sponsored search listings incorporating web search technology and web content |
US7085755B2 (en) | 2002-11-07 | 2006-08-01 | Thomson Global Resources Ag | Electronic document repository management and access system |
US20060173828A1 (en) | 2005-02-01 | 2006-08-03 | Outland Research, Llc | Methods and apparatus for using personal background data to improve the organization of documents retrieved in response to a search query |
US20060173560A1 (en) | 2004-10-07 | 2006-08-03 | Bernard Widrow | System and method for cognitive memory and auto-associative neural network based pattern recognition |
US20060195440A1 (en) | 2005-02-25 | 2006-08-31 | Microsoft Corporation | Ranking results using multiple nested ranking |
US20060200460A1 (en) | 2005-03-03 | 2006-09-07 | Microsoft Corporation | System and method for ranking search results using file types |
US7107218B1 (en) | 1999-10-29 | 2006-09-12 | British Telecommunications Public Limited Company | Method and apparatus for processing queries |
US20060206460A1 (en) | 2005-03-14 | 2006-09-14 | Sanjay Gadkari | Biasing search results |
US20060206476A1 (en) | 2005-03-10 | 2006-09-14 | Yahoo!, Inc. | Reranking and increasing the relevance of the results of Internet searches |
US20060212423A1 (en) | 2005-03-16 | 2006-09-21 | Rosie Jones | System and method for biasing search results based on topic familiarity |
US20060224554A1 (en) | 2005-03-29 | 2006-10-05 | Bailey David R | Query revision using known highly-ranked queries |
US20060248074A1 (en) | 2005-04-28 | 2006-11-02 | International Business Machines Corporation | Term-statistics modification for category-based search |
US20060259481A1 (en) | 2005-05-12 | 2006-11-16 | Xerox Corporation | Method of analyzing documents |
US20060282306A1 (en) | 2005-06-10 | 2006-12-14 | Unicru, Inc. | Employee selection via adaptive assessment |
US20060282455A1 (en) | 2005-06-13 | 2006-12-14 | It Interactive Services Inc. | System and method for ranking web content |
US7152059B2 (en) | 2002-08-30 | 2006-12-19 | Emergency24, Inc. | System and method for predicting additional search results of a computerized database search user based on an initial search query |
US20060287993A1 (en) | 2005-06-21 | 2006-12-21 | Microsoft Corporation | High scale adaptive search systems and methods |
US20060294100A1 (en) | 2005-03-03 | 2006-12-28 | Microsoft Corporation | Ranking search results using language types |
US20070038622A1 (en) | 2005-08-15 | 2007-02-15 | Microsoft Corporation | Method ranking search results using biased click distance |
US20070038616A1 (en) | 2005-08-10 | 2007-02-15 | Guha Ramanathan V | Programmable search engine |
US7181438B1 (en) | 1999-07-21 | 2007-02-20 | Alberti Anemometer, Llc | Database access system |
US20070050338A1 (en) | 2005-08-29 | 2007-03-01 | Strohm Alan C | Mobile sitemaps |
US20070067284A1 (en) | 2005-09-21 | 2007-03-22 | Microsoft Corporation | Ranking functions using document usage statistics |
US20070073748A1 (en) | 2005-09-27 | 2007-03-29 | Barney Jonathan A | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
US20070085716A1 (en) | 2005-09-30 | 2007-04-19 | International Business Machines Corporation | System and method for detecting matches of small edit distance |
US20070094285A1 (en) | 2005-10-21 | 2007-04-26 | Microsoft Corporation | Question answering over structured content on the web |
US20070106659A1 (en) | 2005-03-18 | 2007-05-10 | Yunshan Lu | Search engine that applies feedback from users to improve search results |
US7228301B2 (en) | 2003-06-27 | 2007-06-05 | Microsoft Corporation | Method for normalizing document metadata to improve search results using an alias relationship directory service |
US7231399B1 (en) | 2003-11-14 | 2007-06-12 | Google Inc. | Ranking documents based on large data sets |
US20070150473A1 (en) | 2005-12-22 | 2007-06-28 | Microsoft Corporation | Search By Document Type And Relevance |
US7243102B1 (en) | 2004-07-01 | 2007-07-10 | Microsoft Corporation | Machine directed improvement of ranking algorithms |
US7246128B2 (en) | 2002-06-12 | 2007-07-17 | Jordahl Jena J | Data storage, retrieval, manipulation and display tools enabling multiple hierarchical points of view |
US7260573B1 (en) | 2004-05-17 | 2007-08-21 | Google Inc. | Personalizing anchor text scores in a search engine |
US20070198459A1 (en) | 2006-02-14 | 2007-08-23 | Boone Gary N | System and method for online information analysis |
EP1462950B1 (en) | 2003-03-27 | 2007-08-29 | Sony Deutschland GmbH | Method for language modelling |
US7283997B1 (en) | 2003-05-14 | 2007-10-16 | Apple Inc. | System and method for ranking the relevance of documents retrieved by a query |
US20070260597A1 (en) | 2006-05-02 | 2007-11-08 | Mark Cramer | Dynamic search engine results employing user behavior |
US20070276829A1 (en) | 2004-03-31 | 2007-11-29 | Niniane Wang | Systems and methods for ranking implicit search results |
EP1862916A1 (en) | 2006-06-01 | 2007-12-05 | Microsoft Corporation | Indexing Documents for Information Retrieval based on additional feedback fields |
US7308643B1 (en) | 2003-07-03 | 2007-12-11 | Google Inc. | Anchor tag indexing in a web crawler system |
US20080005068A1 (en) | 2006-06-28 | 2008-01-03 | Microsoft Corporation | Context-based search, retrieval, and awareness |
US20080016053A1 (en) | 2006-07-14 | 2008-01-17 | Bea Systems, Inc. | Administration Console to Select Rank Factors |
JP2008033931A (en) | 2006-07-26 | 2008-02-14 | Xerox Corp | Method for enrichment of text, method for acquiring text in response to query, and system |
US7346604B1 (en) | 1999-10-15 | 2008-03-18 | Hewlett-Packard Development Company, L.P. | Method for ranking hypertext search results by analysis of hyperlinks from expert documents and keyword scope |
US7386527B2 (en) | 2002-12-06 | 2008-06-10 | Kofax, Inc. | Effective multi-class support vector machine classification |
US20080140641A1 (en) | 2006-12-07 | 2008-06-12 | Yahoo! Inc. | Knowledge and interests based search term ranking for search results validation |
JP2008146424A (en) | 2006-12-12 | 2008-06-26 | Nippon Telegr & Teleph Corp <Ntt> | Xml document conformity calculation method, its program, and information processor |
US20080154888A1 (en) | 2006-12-11 | 2008-06-26 | Florian Michel Buron | Viewport-Relative Scoring For Location Search Queries |
US20080195596A1 (en) | 2007-02-09 | 2008-08-14 | Jacob Sisk | System and method for associative matching |
US7428530B2 (en) | 2004-07-01 | 2008-09-23 | Microsoft Corporation | Dispersing search engine results by using page category information |
US20090006358A1 (en) | 2007-06-27 | 2009-01-01 | Microsoft Corporation | Search results |
US20090006356A1 (en) | 2007-06-27 | 2009-01-01 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US20090024606A1 (en) | 2007-07-20 | 2009-01-22 | Google Inc. | Identifying and Linking Similar Passages in a Digital Text Corpus |
US20090070306A1 (en) * | 2007-09-07 | 2009-03-12 | Mihai Stroe | Systems and Methods for Processing Inoperative Document Links |
US7519529B1 (en) | 2001-06-29 | 2009-04-14 | Microsoft Corporation | System and methods for inferring informational goals and preferred level of detail of results in response to questions posed to an automated information-retrieval or question-answering service |
US20090106221A1 (en) | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
US20090106235A1 (en) | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Document Length as a Static Relevance Feature for Ranking Search Results |
US20090106223A1 (en) | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
JP4274533B2 (en) | 2003-07-16 | 2009-06-10 | キヤノン株式会社 | Solid-state imaging device and driving method thereof |
US20090157607A1 (en) | 2007-12-12 | 2009-06-18 | Yahoo! Inc. | Unsupervised detection of web pages corresponding to a similarity class |
US20090164929A1 (en) | 2007-12-20 | 2009-06-25 | Microsoft Corporation | Customizing Search Results |
JP2009146248A (en) | 2007-12-17 | 2009-07-02 | Fujifilm Corp | Content presenting system and program |
US7580568B1 (en) | 2004-03-31 | 2009-08-25 | Google Inc. | Methods and systems for identifying an image as a representative image for an article |
US20090240680A1 (en) | 2008-03-20 | 2009-09-24 | Microsoft Corporation | Techniques to perform relative ranking for search results |
US7606793B2 (en) | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
JP2009252179A (en) | 2008-04-10 | 2009-10-29 | Ntt Docomo Inc | Recommendation information evaluation device and recommendation information evaluation method |
US20090276421A1 (en) | 2008-05-04 | 2009-11-05 | Gang Qiu | Method and System for Re-ranking Search Results |
US20090307209A1 (en) | 2008-06-10 | 2009-12-10 | David Carmel | Term-statistics modification for category-based search |
US7644107B2 (en) | 2004-09-30 | 2010-01-05 | Microsoft Corporation | System and method for batched indexing of network documents |
US7689559B2 (en) | 2006-02-08 | 2010-03-30 | Telenor Asa | Document similarity scoring and ranking method, device and computer program product |
US7689531B1 (en) | 2005-09-28 | 2010-03-30 | Trend Micro Incorporated | Automatic charset detection using support vector machines with charset grouping |
US7693829B1 (en) | 2005-04-25 | 2010-04-06 | Google Inc. | Search engine with fill-the-blanks capability |
US7716225B1 (en) | 2004-06-17 | 2010-05-11 | Google Inc. | Ranking documents based on user behavior and/or feature data |
US7720830B2 (en) | 2006-07-31 | 2010-05-18 | Microsoft Corporation | Hierarchical conditional random fields for web extraction |
US7739277B2 (en) | 2004-09-30 | 2010-06-15 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US20110106850A1 (en) | 2009-10-29 | 2011-05-05 | Microsoft Corporation | Relevant Individual Searching Using Managed Property and Ranking Features |
US20110137893A1 (en) | 2009-12-04 | 2011-06-09 | Microsoft Corporation | Custom ranking model schema |
US7962462B1 (en) * | 2005-05-31 | 2011-06-14 | Google Inc. | Deriving and using document and site quality signals from search query streams |
US20110235909A1 (en) | 2010-03-26 | 2011-09-29 | International Business Machines Corporation | Analyzing documents using stored templates |
US20110295850A1 (en) | 2010-06-01 | 2011-12-01 | Microsoft Corporation | Detection of junk in search result ranking |
US8326829B2 (en) | 2008-10-17 | 2012-12-04 | Centurylink Intellectual Property Llc | System and method for displaying publication dates for search results |
US8370331B2 (en) | 2010-07-02 | 2013-02-05 | Business Objects Software Limited | Dynamic visualization of search results on a graphical user interface |
US8412702B2 (en) | 2008-03-12 | 2013-04-02 | Yahoo! Inc. | System, method, and/or apparatus for reordering search results |
US20130198174A1 (en) | 2012-01-27 | 2013-08-01 | Microsoft Corporation | Re-ranking search results |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2937519B2 (en) * | 1991-03-08 | 1999-08-23 | 株式会社東芝 | Document search device |
JPH10124524A (en) * | 1996-10-23 | 1998-05-15 | Toshiba Corp | Device for retrieving document and method therefor |
US6366907B1 (en) * | 1999-12-15 | 2002-04-02 | Napster, Inc. | Real-time search engine |
TW530224B (en) * | 2001-12-07 | 2003-05-01 | Inst Information Industry | Relation establishment system and method for key words in search engine |
JP2004054588A (en) * | 2002-07-19 | 2004-02-19 | Just Syst Corp | Document search device, document search method, and program for causing computer to execute the method |
TW575813B (en) * | 2002-10-11 | 2004-02-11 | Intumit Inc | System and method using external search engine as foundation for segmentation of word |
TWI284818B (en) * | 2005-07-21 | 2007-08-01 | Bridgewell Inc | Database searching engine system |
-
2008
- 2008-04-11 US US12/101,951 patent/US8812493B2/en active Active
-
2009
- 2009-03-02 TW TW098106721A patent/TWI486800B/en not_active IP Right Cessation
- 2009-03-10 CN CN200980112928.6A patent/CN101990670B/en active Active
- 2009-03-10 EP EP20090730808 patent/EP2289007B1/en active Active
- 2009-03-10 WO PCT/US2009/036597 patent/WO2009126394A1/en active Application Filing
- 2009-03-10 RU RU2010141559/08A patent/RU2501078C2/en active
- 2009-03-10 JP JP2011504031A patent/JP5492187B2/en active Active
- 2009-03-10 KR KR1020107022177A patent/KR101557294B1/en active Active
- 2009-03-10 BR BRPI0909092-4A patent/BRPI0909092A2/en not_active IP Right Cessation
- 2009-03-10 AU AU2009234120A patent/AU2009234120B2/en active Active
-
2010
- 2010-08-26 ZA ZA2010/06093A patent/ZA201006093B/en unknown
- 2010-08-26 IL IL207830A patent/IL207830A/en active IP Right Grant
Patent Citations (406)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5369778A (en) | 1987-08-21 | 1994-11-29 | Wang Laboratories, Inc. | Data processor that customizes program behavior by using a resource retrieval capability |
US5634124A (en) | 1987-08-21 | 1997-05-27 | Wang Laboratories, Inc. | Data integration by object management |
US5222236A (en) | 1988-04-29 | 1993-06-22 | Overdrive Systems, Inc. | Multiple integrated document assembly data processing system |
US5321833A (en) | 1990-08-29 | 1994-06-14 | Gte Laboratories Incorporated | Adaptive ranking system for information retrieval |
US5257577A (en) | 1991-04-01 | 1993-11-02 | Clark Melvin D | Apparatus for assist in recycling of refuse |
US5544360A (en) | 1992-11-23 | 1996-08-06 | Paragon Concepts, Inc. | Method for accessing computer files and data, using linked categories assigned to each data file record on entry of the data file record |
US6202058B1 (en) | 1994-04-25 | 2001-03-13 | Apple Computer, Inc. | System for ranking the relevance of information objects accessed by computer users |
US5606609A (en) | 1994-09-19 | 1997-02-25 | Scientific-Atlanta | Electronic document verification system and method |
US5594660A (en) | 1994-09-30 | 1997-01-14 | Cirrus Logic, Inc. | Programmable audio-video synchronization method and apparatus for multimedia systems |
US5893092A (en) | 1994-12-06 | 1999-04-06 | University Of Central Florida | Relevancy ranking using statistical ranking, semantics, relevancy feedback and small pieces of text |
US5729730A (en) | 1995-03-28 | 1998-03-17 | Dex Information Systems, Inc. | Method and apparatus for improved information storage and retrieval system |
US5826269A (en) | 1995-06-21 | 1998-10-20 | Microsoft Corporation | Electronic mail interface for a network server |
US5933851A (en) | 1995-09-29 | 1999-08-03 | Sony Corporation | Time-stamp and hash-based file modification monitor with multi-user notification and method thereof |
US6032196A (en) | 1995-12-13 | 2000-02-29 | Digital Equipment Corporation | System for adding a new entry to a web page table upon receiving a web page including a link to another web page not having a corresponding entry in the web page table |
US6269370B1 (en) | 1996-02-21 | 2001-07-31 | Infoseek Corporation | Web scan process |
US6775664B2 (en) | 1996-04-04 | 2004-08-10 | Lycos, Inc. | Information filter system and method for integrated content-based and collaborative/adaptive feedback queries |
US6041323A (en) | 1996-04-17 | 2000-03-21 | International Business Machines Corporation | Information search method, information search device, and storage medium for storing an information search program |
US5905866A (en) | 1996-04-30 | 1999-05-18 | A.I. Soft Corporation | Data-update monitoring in communications network |
US5828999A (en) | 1996-05-06 | 1998-10-27 | Apple Computer, Inc. | Method and system for deriving a large-span semantic language model for large-vocabulary recognition systems |
US6038610A (en) | 1996-07-17 | 2000-03-14 | Microsoft Corporation | Storage of sitemaps at server sites for holding information regarding content |
US6178419B1 (en) | 1996-07-31 | 2001-01-23 | British Telecommunications Plc | Data access system |
US6317741B1 (en) | 1996-08-09 | 2001-11-13 | Altavista Company | Technique for ranking records of a database |
US5765150A (en) | 1996-08-09 | 1998-06-09 | Digital Equipment Corporation | Method for statistically projecting the ranking of information |
US6070158A (en) | 1996-08-14 | 2000-05-30 | Infoseek Corporation | Real-time document collection search engine with phrase indexing |
US5870739A (en) | 1996-09-20 | 1999-02-09 | Novell, Inc. | Hybrid query apparatus and method |
US5893116A (en) | 1996-09-30 | 1999-04-06 | Novell, Inc. | Accessing network resources using network resource replicator and captured login script for use when the computer is disconnected from the network |
US5870740A (en) | 1996-09-30 | 1999-02-09 | Apple Computer, Inc. | System and method for improving the ranking of information retrieval results for short queries |
US6222559B1 (en) | 1996-10-02 | 2001-04-24 | Nippon Telegraph And Telephone Corporation | Method and apparatus for display of hierarchical structures |
US6182065B1 (en) | 1996-11-06 | 2001-01-30 | International Business Machines Corp. | Method and system for weighting the search results of a database search engine |
US6326962B1 (en) | 1996-12-23 | 2001-12-04 | Doubleagent Llc | Graphic user interface for database system |
US6285999B1 (en) | 1997-01-10 | 2001-09-04 | The Board Of Trustees Of The Leland Stanford Junior University | Method for node ranking in a linked database |
US5920859A (en) | 1997-02-05 | 1999-07-06 | Idd Enterprises, L.P. | Hypertext document retrieval system and method |
US6415319B1 (en) | 1997-02-07 | 2002-07-02 | Sun Microsystems, Inc. | Intelligent network browser using incremental conceptual indexer |
US5960383A (en) | 1997-02-25 | 1999-09-28 | Digital Equipment Corporation | Extraction of key sections from texts using automatic indexing techniques |
US5890147A (en) | 1997-03-07 | 1999-03-30 | Microsoft Corporation | Scope testing of documents in a search engine using document to folder mapping |
US5848404A (en) | 1997-03-24 | 1998-12-08 | International Business Machines Corporation | Fast query search in large dimension database |
US6272507B1 (en) | 1997-04-09 | 2001-08-07 | Xerox Corporation | System for ranking search results from a collection of documents using spreading activation techniques |
US6484204B1 (en) | 1997-05-06 | 2002-11-19 | At&T Corp. | System and method for allocating requests for objects and managing replicas of objects on a network |
US6182067B1 (en) | 1997-06-02 | 2001-01-30 | Knowledge Horizons Pty Ltd. | Methods and systems for knowledge management |
US6029164A (en) | 1997-06-16 | 2000-02-22 | Digital Equipment Corporation | Method and apparatus for organizing and accessing electronic mail messages using labels and full text and label indexing |
US6012053A (en) | 1997-06-23 | 2000-01-04 | Lycos, Inc. | Computer system with user-controlled relevance ranking of search results |
US6247013B1 (en) | 1997-06-30 | 2001-06-12 | Canon Kabushiki Kaisha | Hyper text reading system |
US20010042076A1 (en) | 1997-06-30 | 2001-11-15 | Ryoji Fukuda | A hypertext reader which performs a reading process on a hierarchically constructed hypertext |
US5933822A (en) | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US5983216A (en) | 1997-09-12 | 1999-11-09 | Infoseek Corporation | Performing automated document collection and selection by providing a meta-index with meta-index values indentifying corresponding document collections |
US6182113B1 (en) | 1997-09-16 | 2001-01-30 | International Business Machines Corporation | Dynamic multiplexing of hyperlinks and bookmarks |
US5956722A (en) | 1997-09-23 | 1999-09-21 | At&T Corp. | Method for effective indexing of partially dynamic documents |
US6999959B1 (en) | 1997-10-10 | 2006-02-14 | Nec Laboratories America, Inc. | Meta search engine |
US6026398A (en) | 1997-10-16 | 2000-02-15 | Imarket, Incorporated | System and methods for searching and matching databases |
US6070191A (en) | 1997-10-17 | 2000-05-30 | Lucent Technologies Inc. | Data distribution techniques for load-balanced fault-tolerant web access |
US6351467B1 (en) | 1997-10-27 | 2002-02-26 | Hughes Electronics Corporation | System and method for multicasting multimedia content |
US6594682B2 (en) | 1997-10-28 | 2003-07-15 | Microsoft Corporation | Client-side system for scheduling delivery of web content and locally managing the web content |
US6128701A (en) | 1997-10-28 | 2000-10-03 | Cache Flow, Inc. | Adaptive and predictive cache refresh policy |
US6553364B1 (en) | 1997-11-03 | 2003-04-22 | Yahoo! Inc. | Information retrieval from hierarchical compound documents |
US5943670A (en) | 1997-11-21 | 1999-08-24 | International Business Machines Corporation | System and method for categorizing objects in combined categories |
US5987457A (en) | 1997-11-25 | 1999-11-16 | Acceleration Software International Corporation | Query refinement method for searching documents |
US6473752B1 (en) | 1997-12-04 | 2002-10-29 | Micron Technology, Inc. | Method and system for locating documents based on previously accessed documents |
US6389436B1 (en) | 1997-12-15 | 2002-05-14 | International Business Machines Corporation | Enhanced hypertext categorization using hyperlinks |
US6145003A (en) | 1997-12-17 | 2000-11-07 | Microsoft Corporation | Method of web crawling utilizing address mapping |
US7010532B1 (en) | 1997-12-31 | 2006-03-07 | International Business Machines Corporation | Low overhead methods and apparatus for shared access storage devices |
US6151624A (en) | 1998-02-03 | 2000-11-21 | Realnames Corporation | Navigating network resources based on metadata |
JPH11232300A (en) | 1998-02-18 | 1999-08-27 | Nri & Ncc Co Ltd | Browsing client server system |
US6349308B1 (en) | 1998-02-25 | 2002-02-19 | Korea Advanced Institute Of Science & Technology | Inverted index storage structure using subindexes and large objects for tight coupling of information retrieval with database management systems |
US6185558B1 (en) | 1998-03-03 | 2001-02-06 | Amazon.Com, Inc. | Identifying the items most relevant to a current query based on items selected in connection with similar queries |
US5913210A (en) | 1998-03-27 | 1999-06-15 | Call; Charles G. | Methods and apparatus for disseminating product information via the internet |
US6125361A (en) | 1998-04-10 | 2000-09-26 | International Business Machines Corporation | Feature diffusion across hyperlinks |
EP0950961A3 (en) | 1998-04-17 | 2000-03-22 | Xerox Corporation | Methods for interactive visualization of spreading activation using time tubes and disk trees |
EP0950961A2 (en) | 1998-04-17 | 1999-10-20 | Xerox Corporation | Methods for interactive visualization of spreading activation using time tubes and disk trees |
US6167402A (en) | 1998-04-27 | 2000-12-26 | Sun Microsystems, Inc. | High performance message store |
US6240407B1 (en) | 1998-04-29 | 2001-05-29 | International Business Machines Corp. | Method and apparatus for creating an index in a database system |
US6314421B1 (en) | 1998-05-12 | 2001-11-06 | David M. Sharnoff | Method and apparatus for indexing documents for message filtering |
US6098064A (en) | 1998-05-22 | 2000-08-01 | Xerox Corporation | Prefetching and caching documents according to probability ranked need S list |
US6285367B1 (en) | 1998-05-26 | 2001-09-04 | International Business Machines Corporation | Method and apparatus for displaying and navigating a graph |
US6182085B1 (en) | 1998-05-28 | 2001-01-30 | International Business Machines Corporation | Collaborative team crawling:Large scale information gathering over the internet |
US6208988B1 (en) | 1998-06-01 | 2001-03-27 | Bigchalk.Com, Inc. | Method for identifying themes associated with a search query using metadata and for organizing documents responsive to the search query in accordance with the themes |
US6240408B1 (en) | 1998-06-08 | 2001-05-29 | Kcsl, Inc. | Method and system for retrieving relevant documents from a database |
US6006225A (en) | 1998-06-15 | 1999-12-21 | Amazon.Com | Refining search queries by the suggestion of correlated terms from prior searches |
US6216123B1 (en) | 1998-06-24 | 2001-04-10 | Novell, Inc. | Method and system for rapid retrieval in a full text indexing system |
US7003442B1 (en) | 1998-06-24 | 2006-02-21 | Fujitsu Limited | Document file group organizing apparatus and method thereof |
US6638314B1 (en) | 1998-06-26 | 2003-10-28 | Microsoft Corporation | Method of web crawling utilizing crawl numbers |
US6199081B1 (en) | 1998-06-30 | 2001-03-06 | Microsoft Corporation | Automatic tagging of documents and exclusion by content |
US6424966B1 (en) | 1998-06-30 | 2002-07-23 | Microsoft Corporation | Synchronizing crawler with notification source |
US6775659B2 (en) | 1998-08-26 | 2004-08-10 | Symtec Limited | Methods and devices for mapping data files |
US6324551B1 (en) | 1998-08-31 | 2001-11-27 | Xerox Corporation | Self-contained document management based on document properties |
RU2138076C1 (en) | 1998-09-14 | 1999-09-20 | Закрытое акционерное общество "МедиаЛингва" | Data retrieval system in computer network |
US6115709A (en) | 1998-09-18 | 2000-09-05 | Tacit Knowledge Systems, Inc. | Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions |
US6549897B1 (en) | 1998-10-09 | 2003-04-15 | Microsoft Corporation | Method and system for calculating phrase-document importance |
US6360215B1 (en) | 1998-11-03 | 2002-03-19 | Inktomi Corporation | Method and apparatus for retrieving documents based on information other than document content |
US6385602B1 (en) | 1998-11-03 | 2002-05-07 | E-Centives, Inc. | Presentation of search results using dynamic categorization |
US6701318B2 (en) | 1998-11-18 | 2004-03-02 | Harris Corporation | Multiple engine information retrieval and visualization system |
US6628304B2 (en) | 1998-12-09 | 2003-09-30 | Cisco Technology, Inc. | Method and apparatus providing a graphical user interface for representing and navigating hierarchical networks |
US6167369A (en) | 1998-12-23 | 2000-12-26 | Xerox Company | Automatic language identification using both N-gram and word information |
JP2000194713A (en) | 1998-12-25 | 2000-07-14 | Nippon Telegr & Teleph Corp <Ntt> | Method and device for retrieving character string, and storage medium stored with character string retrieval program |
US20030074368A1 (en) | 1999-01-26 | 2003-04-17 | Hinrich Schuetze | System and method for quantitatively representing data objects in vector space |
US6418433B1 (en) | 1999-01-28 | 2002-07-09 | International Business Machines Corporation | System and method for focussed web crawling |
US6654742B1 (en) | 1999-02-12 | 2003-11-25 | International Business Machines Corporation | Method and system for document collection final search result by arithmetical operations between search results sorted by multiple ranking metrics |
US6862710B1 (en) | 1999-03-23 | 2005-03-01 | Insightful Corporation | Internet navigation using soft hyperlinks |
US20030217047A1 (en) | 1999-03-23 | 2003-11-20 | Insightful Corporation | Inverse inference engine for high performance web search |
US20040215664A1 (en) | 1999-03-31 | 2004-10-28 | Microsoft Corporation | Method for promoting contextual information to display pages containing hyperlinks |
US6304864B1 (en) | 1999-04-20 | 2001-10-16 | Textwise Llc | System for retrieving multimedia information from the internet using multiple evolving intelligent agents |
US6336117B1 (en) | 1999-04-30 | 2002-01-01 | International Business Machines Corporation | Content-indexing search system and method providing search results consistent with content filtering and blocking policies implemented in a blocking engine |
US6327590B1 (en) | 1999-05-05 | 2001-12-04 | Xerox Corporation | System and method for collaborative ranking of search results employing user and group profiles derived from document collection content analysis |
EP1050830A2 (en) | 1999-05-05 | 2000-11-08 | Xerox Corporation | System and method for collaborative ranking of search results employing user and group profiles |
US6990628B1 (en) | 1999-06-14 | 2006-01-24 | Yahoo! Inc. | Method and apparatus for measuring similarity among electronic documents |
US7072888B1 (en) | 1999-06-16 | 2006-07-04 | Triogo, Inc. | Process for improving search engine efficiency using feedback |
US6973490B1 (en) | 1999-06-23 | 2005-12-06 | Savvis Communications Corp. | Method and system for object-level web performance and analysis |
US6547829B1 (en) | 1999-06-30 | 2003-04-15 | Microsoft Corporation | Method and system for detecting duplicate documents in web crawls |
US6631369B1 (en) | 1999-06-30 | 2003-10-07 | Microsoft Corporation | Method and system for incremental web crawling |
US6873982B1 (en) | 1999-07-16 | 2005-03-29 | International Business Machines Corporation | Ordering of database search results based on user feedback |
US6557036B1 (en) | 1999-07-20 | 2003-04-29 | Sun Microsystems, Inc. | Methods and apparatus for site wide monitoring of electronic mail systems |
US7181438B1 (en) | 1999-07-21 | 2007-02-20 | Alberti Anemometer, Llc | Database access system |
US6598047B1 (en) | 1999-07-26 | 2003-07-22 | David W. Russell | Method and system for searching text |
CA2279119C (en) | 1999-07-29 | 2004-10-19 | Ibm Canada Limited-Ibm Canada Limitee | Heuristic-based conditional data indexing |
JP2001052017A (en) | 1999-08-11 | 2001-02-23 | Fuji Xerox Co Ltd | Hypertext analyzer |
US6442606B1 (en) | 1999-08-12 | 2002-08-27 | Inktomi Corporation | Method and apparatus for identifying spoof documents |
US6636853B1 (en) | 1999-08-30 | 2003-10-21 | Morphism, Llc | Method and apparatus for representing and navigating search results |
US6381597B1 (en) | 1999-10-07 | 2002-04-30 | U-Know Software Corporation | Electronic shopping agent which is capable of operating with vendor sites which have disparate formats |
US7346604B1 (en) | 1999-10-15 | 2008-03-18 | Hewlett-Packard Development Company, L.P. | Method for ranking hypertext search results by analysis of hyperlinks from expert documents and keyword scope |
US20030004952A1 (en) | 1999-10-18 | 2003-01-02 | Mark Nixon | Accessing and updating a configuration database from distributed physical locations within a process control system |
JP2001117934A (en) | 1999-10-19 | 2001-04-27 | Hitachi Ltd | Electronic document management method and system, and recording medium |
US7107218B1 (en) | 1999-10-29 | 2006-09-12 | British Telecommunications Public Limited Company | Method and apparatus for processing queries |
US6263364B1 (en) | 1999-11-02 | 2001-07-17 | Alta Vista Company | Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining document freshness |
US6351755B1 (en) | 1999-11-02 | 2002-02-26 | Alta Vista Company | System and method for associating an extensible set of data with documents downloaded by a web crawler |
US6418453B1 (en) | 1999-11-03 | 2002-07-09 | International Business Machines Corporation | Network repository service for efficient web crawling |
US6418452B1 (en) | 1999-11-03 | 2002-07-09 | International Business Machines Corporation | Network repository service directory for efficient web crawling |
US6539376B1 (en) | 1999-11-15 | 2003-03-25 | International Business Machines Corporation | System and method for the automatic mining of new relationships |
US6886129B1 (en) | 1999-11-24 | 2005-04-26 | International Business Machines Corporation | Method and system for trawling the World-wide Web to identify implicitly-defined communities of web pages |
US7016540B1 (en) | 1999-11-24 | 2006-03-21 | Nec Corporation | Method and system for segmentation, classification, and summarization of video images |
US6772141B1 (en) | 1999-12-14 | 2004-08-03 | Novell, Inc. | Method and apparatus for organizing and using indexes utilizing a search decision table |
US6546388B1 (en) | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US6718324B2 (en) | 2000-01-14 | 2004-04-06 | International Business Machines Corporation | Metadata search results ranking system |
US7328401B2 (en) | 2000-01-28 | 2008-02-05 | Microsoft Corporation | Adaptive web crawling using a statistical model |
US7603616B2 (en) | 2000-01-28 | 2009-10-13 | Microsoft Corporation | Proxy server using a statistical model |
US6883135B1 (en) | 2000-01-28 | 2005-04-19 | Microsoft Corporation | Proxy server using a statistical model |
US20050086583A1 (en) | 2000-01-28 | 2005-04-21 | Microsoft Corporation | Proxy server using a statistical model |
EP1120717A2 (en) | 2000-01-28 | 2001-08-01 | Microsoft Corporation | Adaptive web crawling using a statistical model |
US20040199497A1 (en) | 2000-02-08 | 2004-10-07 | Sybase, Inc. | System and Methodology for Extraction and Aggregation of Data from Dynamic Content |
US6931397B1 (en) | 2000-02-11 | 2005-08-16 | International Business Machines Corporation | System and method for automatic generation of dynamic search abstracts contain metadata by crawler |
US6910029B1 (en) | 2000-02-22 | 2005-06-21 | International Business Machines Corporation | System for weighted indexing of hierarchical documents |
JP2001265774A (en) | 2000-03-16 | 2001-09-28 | Nippon Telegr & Teleph Corp <Ntt> | Method and device for retrieving information, recording medium with recorded information retrieval program and hypertext information retrieving system |
US6516312B1 (en) | 2000-04-04 | 2003-02-04 | International Business Machine Corporation | System and method for dynamically associating keywords with domain-specific search engine queries |
US6633867B1 (en) | 2000-04-05 | 2003-10-14 | International Business Machines Corporation | System and method for providing a session query within the context of a dynamic search result set |
US6549896B1 (en) | 2000-04-07 | 2003-04-15 | Nec Usa, Inc. | System and method employing random walks for mining web page associations and usage to optimize user-oriented web page refresh and pre-fetch scheduling |
US6718365B1 (en) | 2000-04-13 | 2004-04-06 | International Business Machines Corporation | Method, system, and program for ordering search results using an importance weighting |
US6859800B1 (en) | 2000-04-26 | 2005-02-22 | Global Information Research And Technologies Llc | System for fulfilling an information need |
US20050044071A1 (en) | 2000-06-08 | 2005-02-24 | Ingenuity Systems, Inc. | Techniques for facilitating information acquisition and storage |
DE10029644A1 (en) | 2000-06-16 | 2002-01-17 | Deutsche Telekom Ag | Hypertext documents evaluation method using search engine, involves calculating real relevance value for each document based on precalculated relevance value and cross references of document |
US6671683B2 (en) | 2000-06-28 | 2003-12-30 | Matsushita Electric Industrial Co., Ltd. | Apparatus for retrieving similar documents and apparatus for extracting relevant keywords |
US20020016787A1 (en) | 2000-06-28 | 2002-02-07 | Matsushita Electric Industrial Co., Ltd. | Apparatus for retrieving similar documents and apparatus for extracting relevant keywords |
US6678692B1 (en) | 2000-07-10 | 2004-01-13 | Northrop Grumman Corporation | Hierarchy statistical analysis system and method |
US6601075B1 (en) | 2000-07-27 | 2003-07-29 | International Business Machines Corporation | System and method of ranking and retrieving documents based on authority scores of schemas and documents |
US6633868B1 (en) | 2000-07-28 | 2003-10-14 | Shermann Loyall Min | System and method for context-based document retrieval |
US6598040B1 (en) | 2000-08-14 | 2003-07-22 | International Business Machines Corporation | Method and system for processing electronic search expressions |
US7080073B1 (en) | 2000-08-18 | 2006-07-18 | Firstrain, Inc. | Method and apparatus for focused crawling |
KR20020015838A (en) | 2000-08-23 | 2002-03-02 | 전홍건 | Method for re-adjusting ranking of document to use user's profile and entropy |
US6959326B1 (en) | 2000-08-24 | 2005-10-25 | International Business Machines Corporation | Method, system, and program for gathering indexable metadata on content at a data repository |
US20030217052A1 (en) | 2000-08-24 | 2003-11-20 | Celebros Ltd. | Search engine method and apparatus |
US20020026390A1 (en) | 2000-08-25 | 2002-02-28 | Jonas Ulenas | Method and apparatus for obtaining consumer product preferences through product selection and evaluation |
JP2002091843A (en) | 2000-09-11 | 2002-03-29 | Nippon Telegr & Teleph Corp <Ntt> | Device and method for selecting server and recording medium recording server selection program |
US20020032772A1 (en) | 2000-09-14 | 2002-03-14 | Bjorn Olstad | Method for searching and analysing information in data networks |
US6598051B1 (en) | 2000-09-19 | 2003-07-22 | Altavista Company | Web page connectivity server |
US6560600B1 (en) | 2000-10-25 | 2003-05-06 | Alta Vista Company | Method and apparatus for ranking Web page search results |
JP2002132769A (en) | 2000-10-25 | 2002-05-10 | Nippon Telegr & Teleph Corp <Ntt> | Method and device for multilateral retrieval service and recording medium recording program therefor |
US6871202B2 (en) | 2000-10-25 | 2005-03-22 | Overture Services, Inc. | Method and apparatus for ranking web page search results |
JP2002140365A (en) | 2000-11-01 | 2002-05-17 | Mitsubishi Electric Corp | Data retrieving method |
US20020055940A1 (en) | 2000-11-07 | 2002-05-09 | Charles Elkan | Method and system for selecting documents by measuring document quality |
US6622140B1 (en) | 2000-11-15 | 2003-09-16 | Justsystem Corporation | Method and apparatus for analyzing affect and emotion in text |
US20020062323A1 (en) | 2000-11-20 | 2002-05-23 | Yozan Inc | Browser apparatus, server apparatus, computer-readable medium, search system and search method |
JP2002157271A (en) | 2000-11-20 | 2002-05-31 | Yozan Inc | Browser device, server device, recording medium, retrieving system and retrieving method |
US20020099694A1 (en) | 2000-11-21 | 2002-07-25 | Diamond Theodore George | Full-text relevancy ranking |
US20050187965A1 (en) | 2000-11-21 | 2005-08-25 | Abajian Aram C. | Grouping multimedia and streaming media search results |
US20020107861A1 (en) | 2000-12-07 | 2002-08-08 | Kerry Clendinning | System and method for collecting, associating, normalizing and presenting product and vendor information on a distributed network |
US20050055347A9 (en) | 2000-12-08 | 2005-03-10 | Ingenuity Systems, Inc. | Method and system for performing information extraction and quality control for a knowledgebase |
US20020078045A1 (en) | 2000-12-14 | 2002-06-20 | Rabindranath Dutta | System, method, and program for ranking search results using user category weighting |
US20020083054A1 (en) | 2000-12-27 | 2002-06-27 | Kyle Peltonen | Scoping queries in a search engine |
US6898592B2 (en) | 2000-12-27 | 2005-05-24 | Microsoft Corporation | Scoping queries in a search engine |
US7415459B2 (en) | 2000-12-27 | 2008-08-19 | Microsoft Corporation | Scoping queries in a search engine |
US7065523B2 (en) | 2000-12-27 | 2006-06-20 | Microsoft Corporation | Scoping queries in a search engine |
JP2002202992A (en) | 2000-12-28 | 2002-07-19 | Speed System:Kk | Homepage retrieval system |
US20020169800A1 (en) | 2001-01-05 | 2002-11-14 | International Business Machines Corporation | XML: finding authoritative pages for mining communities based on page structure criteria |
US6778997B2 (en) | 2001-01-05 | 2004-08-17 | International Business Machines Corporation | XML: finding authoritative pages for mining communities based on page structure criteria |
US20020129014A1 (en) | 2001-01-10 | 2002-09-12 | Kim Brian S. | Systems and methods of retrieving relevant information |
US20030208482A1 (en) | 2001-01-10 | 2003-11-06 | Kim Brian S. | Systems and methods of retrieving relevant information |
US7356530B2 (en) | 2001-01-10 | 2008-04-08 | Looksmart, Ltd. | Systems and methods of retrieving relevant information |
US6766316B2 (en) | 2001-01-18 | 2004-07-20 | Science Applications International Corporation | Method and system of ranking and clustering for document indexing and retrieval |
US20020129015A1 (en) | 2001-01-18 | 2002-09-12 | Maureen Caudill | Method and system of ranking and clustering for document indexing and retrieval |
US20040111408A1 (en) | 2001-01-18 | 2004-06-10 | Science Applications International Corporation | Method and system of ranking and clustering for document indexing and retrieval |
US7496561B2 (en) | 2001-01-18 | 2009-02-24 | Science Applications International Corporation | Method and system of ranking and clustering for document indexing and retrieval |
US6526440B1 (en) | 2001-01-30 | 2003-02-25 | Google, Inc. | Ranking search results by reranking the results based on local inter-connectivity |
US20020103798A1 (en) | 2001-02-01 | 2002-08-01 | Abrol Mani S. | Adaptive document ranking method based on user behavior |
US20020107886A1 (en) | 2001-02-07 | 2002-08-08 | Gentner Donald R. | Method and apparatus for automatic document electronic versioning system |
US20040093328A1 (en) | 2001-02-08 | 2004-05-13 | Aditya Damle | Methods and systems for automated semantic knowledge leveraging graph theoretic analysis and the inherent structure of communication |
JP2002245089A (en) | 2001-02-19 | 2002-08-30 | Hitachi Eng Co Ltd | Web page search system, secondary information collection device, interface device |
US20020165873A1 (en) | 2001-02-22 | 2002-11-07 | International Business Machines Corporation | Retrieving handwritten documents using multiple document recognizers and techniques allowing both typed and handwritten queries |
US20020123988A1 (en) | 2001-03-02 | 2002-09-05 | Google, Inc. | Methods and apparatus for employing usage statistics in document retrieval |
US20020169595A1 (en) | 2001-03-30 | 2002-11-14 | Yevgeny Agichtein | Method for retrieving answers from an information retrieval system |
US20020169770A1 (en) | 2001-04-27 | 2002-11-14 | Kim Brian Seong-Gon | Apparatus and method that categorize a collection of documents into a hierarchy of categories that are defined by the collection of documents |
US20030037074A1 (en) | 2001-05-01 | 2003-02-20 | Ibm Corporation | System and method for aggregating ranking results from various sources to improve the results of web searching |
JP2002366549A (en) | 2001-05-07 | 2002-12-20 | Nec Corp | Selective retrieval metasearch engine and method for performing selective retrieval |
US6738764B2 (en) | 2001-05-08 | 2004-05-18 | Verity, Inc. | Apparatus and method for adaptively ranking search results |
US20020169754A1 (en) | 2001-05-08 | 2002-11-14 | Jianchang Mao | Apparatus and method for adaptively ranking search results |
US20030065706A1 (en) | 2001-05-10 | 2003-04-03 | Smyth Barry Joseph | Intelligent internet website with hierarchical menu |
US20020168106A1 (en) | 2001-05-11 | 2002-11-14 | Miroslav Trajkovic | Palette-based histogram matching with recursive histogram vector generation |
US20030088545A1 (en) | 2001-06-18 | 2003-05-08 | Pavitra Subramaniam | System and method to implement a persistent and dismissible search center frame |
US20030028520A1 (en) | 2001-06-20 | 2003-02-06 | Alpha Shamim A. | Method and system for response time optimization of data query rankings and retrieval |
US7519529B1 (en) | 2001-06-29 | 2009-04-14 | Microsoft Corporation | System and methods for inferring informational goals and preferred level of detail of results in response to questions posed to an automated information-retrieval or question-answering service |
US20030053084A1 (en) | 2001-07-19 | 2003-03-20 | Geidl Erik M. | Electronic ink as a software object |
US7039234B2 (en) | 2001-07-19 | 2006-05-02 | Microsoft Corporation | Electronic ink as a software object |
EP1282060A2 (en) | 2001-08-03 | 2003-02-05 | Overture Services, Inc. | System and method for providing place and price protection in a search result list generated by a computer network search engine |
US6868411B2 (en) | 2001-08-13 | 2005-03-15 | Xerox Corporation | Fuzzy text categorizer |
US20030061201A1 (en) | 2001-08-13 | 2003-03-27 | Xerox Corporation | System for propagating enrichment between documents |
JP2003076715A (en) | 2001-08-20 | 2003-03-14 | Nhn Corp | Method and system for retrieving web pages, program and recording medium |
US7076483B2 (en) | 2001-08-27 | 2006-07-11 | Xyleme Sa | Ranking nodes in a graph |
US20030046389A1 (en) | 2001-09-04 | 2003-03-06 | Thieme Laura M. | Method for monitoring a web site's keyword visibility in search engines and directories and resulting traffic from such keyword visibility |
US20030055810A1 (en) | 2001-09-18 | 2003-03-20 | International Business Machines Corporation | Front-end weight factor search criteria |
US6766422B2 (en) | 2001-09-27 | 2004-07-20 | Siemens Information And Communication Networks, Inc. | Method and system for web caching based on predictive usage |
US6944609B2 (en) | 2001-10-18 | 2005-09-13 | Lycos, Inc. | Search results using editor feedback |
US20040205497A1 (en) | 2001-10-22 | 2004-10-14 | Chiang Alexander | System for automatic generation of arbitrarily indexed hyperlinked text |
RU2001128643A (en) | 2001-10-24 | 2003-07-20 | Закрытое акционерное общество "МедиаЛингва" | A method for determining the rating of links and ranking the paths for users to crawl pages of an Internet site located in a processing device of an Internet network node |
JP2003208434A (en) | 2001-11-07 | 2003-07-25 | Nec Corp | Information retrieval system, and information retrieval method using the same |
US20030101183A1 (en) | 2001-11-26 | 2003-05-29 | Navin Kabra | Information retrieval index allowing updating while in use |
US6763362B2 (en) | 2001-11-30 | 2004-07-13 | Micron Technology, Inc. | Method and system for updating a search engine |
US20030135490A1 (en) | 2002-01-15 | 2003-07-17 | Barrett Michael E. | Enhanced popularity ranking |
US20030217007A1 (en) | 2002-01-29 | 2003-11-20 | Sony Corporation | Method for providing and obtaining content |
US6829606B2 (en) | 2002-02-14 | 2004-12-07 | Infoglide Software Corporation | Similarity search engine for use with relational databases |
JP2003248696A (en) | 2002-02-22 | 2003-09-05 | Nippon Telegr & Teleph Corp <Ntt> | Page rating/filtering method, device, and program, and computer readable recording medium recording the program |
US20060004732A1 (en) | 2002-02-26 | 2006-01-05 | Odom Paul S | Search engine methods and systems for generating relevant search results and advertisements |
US6934714B2 (en) | 2002-03-04 | 2005-08-23 | Intelesis Engineering, Inc. | Method and system for identification and maintenance of families of data records |
KR20030080826A (en) | 2002-04-11 | 2003-10-17 | 한국전자통신연구원 | Effective homepage searching method using similarity recalculation based on url substring relationship |
US20030195882A1 (en) * | 2002-04-11 | 2003-10-16 | Lee Chung Hee | Homepage searching method using similarity recalculation based on URL substring relationship |
US20040003028A1 (en) | 2002-05-08 | 2004-01-01 | David Emmett | Automatic display of web content to smaller display devices: improved summarization and navigation |
US20060149723A1 (en) | 2002-05-24 | 2006-07-06 | Microsoft Corporation | System and method for providing search results with configurable scoring formula |
RU2273879C2 (en) | 2002-05-28 | 2006-04-10 | Владимир Владимирович Насыпный | Method for synthesis of self-teaching system for extracting knowledge from text documents for search engines |
US20040006559A1 (en) | 2002-05-29 | 2004-01-08 | Gange David M. | System, apparatus, and method for user tunable and selectable searching of a database using a weigthted quantized feature vector |
US7246128B2 (en) | 2002-06-12 | 2007-07-17 | Jordahl Jena J | Data storage, retrieval, manipulation and display tools enabling multiple hierarchical points of view |
US20050055340A1 (en) | 2002-07-26 | 2005-03-10 | Brainbow, Inc. | Neural-based internet search engine with fuzzy and learning processes implemented by backward propogation |
US20040024752A1 (en) | 2002-08-05 | 2004-02-05 | Yahoo! Inc. | Method and apparatus for search ranking using human input and automated ranking |
US7152059B2 (en) | 2002-08-30 | 2006-12-19 | Emergency24, Inc. | System and method for predicting additional search results of a computerized database search user based on an initial search query |
US20040049766A1 (en) | 2002-09-09 | 2004-03-11 | Bloch Joshua J. | Method and apparatus for associating metadata attributes with program elements |
JP2004164555A (en) | 2002-09-17 | 2004-06-10 | Fuji Xerox Co Ltd | Apparatus and method for retrieval, and apparatus and method for index building |
US20040064442A1 (en) | 2002-09-27 | 2004-04-01 | Popovitch Steven Gregory | Incremental search engine |
US6886010B2 (en) | 2002-09-30 | 2005-04-26 | The United States Of America As Represented By The Secretary Of The Navy | Method for data and text mining and literature-based discovery |
US7085755B2 (en) | 2002-11-07 | 2006-08-01 | Thomson Global Resources Ag | Electronic document repository management and access system |
US20050060304A1 (en) | 2002-11-19 | 2005-03-17 | Prashant Parikh | Navigational learning in a structured transaction processing system |
US7257574B2 (en) | 2002-11-19 | 2007-08-14 | Prashant Parikh | Navigational learning in a structured transaction processing system |
US7386527B2 (en) | 2002-12-06 | 2008-06-10 | Kofax, Inc. | Effective multi-class support vector machine classification |
US20040117351A1 (en) | 2002-12-14 | 2004-06-17 | International Business Machines Corporation | System and method for identifying and utilizing a secondary index to access a database using a management system without an internal catalogue of online metadata |
US20040141354A1 (en) * | 2003-01-18 | 2004-07-22 | Carnahan John M. | Query string matching method and apparatus |
US20040148278A1 (en) | 2003-01-22 | 2004-07-29 | Amir Milo | System and method for providing content warehouse |
RU2236699C1 (en) | 2003-02-25 | 2004-09-20 | Открытое акционерное общество "Телепортал. Ру" | Method for searching and selecting information with increased relevance |
JP2004265015A (en) | 2003-02-28 | 2004-09-24 | Toyota Motor Corp | Index generator for content search |
US20040181515A1 (en) | 2003-03-13 | 2004-09-16 | International Business Machines Corporation | Group administration of universal resource identifiers with members identified in search result |
US6947930B2 (en) | 2003-03-21 | 2005-09-20 | Overture Services, Inc. | Systems and methods for interactive search query refinement |
US20040186827A1 (en) | 2003-03-21 | 2004-09-23 | Anick Peter G. | Systems and methods for interactive search query refinement |
EP1462950B1 (en) | 2003-03-27 | 2007-08-29 | Sony Deutschland GmbH | Method for language modelling |
US20050033742A1 (en) | 2003-03-28 | 2005-02-10 | Kamvar Sepandar D. | Methods for ranking nodes in large directed graphs |
US7028029B2 (en) | 2003-03-28 | 2006-04-11 | Google Inc. | Adaptive computation of ranking |
RU2319202C2 (en) | 2003-03-31 | 2008-03-10 | Гугл Инк. | System and method for providing preferred language for sorting search results |
US20040194099A1 (en) | 2003-03-31 | 2004-09-30 | John Lamping | System and method for providing preferred language ordering of search results |
US7051023B2 (en) | 2003-04-04 | 2006-05-23 | Yahoo! Inc. | Systems and methods for generating concept units from search queries |
US20040215606A1 (en) | 2003-04-25 | 2004-10-28 | David Cossock | Method and apparatus for machine learning a document relevance function |
US7197497B2 (en) | 2003-04-25 | 2007-03-27 | Overture Services, Inc. | Method and apparatus for machine learning a document relevance function |
US7283997B1 (en) | 2003-05-14 | 2007-10-16 | Apple Inc. | System and method for ranking the relevance of documents retrieved by a query |
US20040249795A1 (en) | 2003-06-05 | 2004-12-09 | International Business Machines Corporation | Semantics-based searching for information in a distributed data processing system |
US20040254932A1 (en) | 2003-06-16 | 2004-12-16 | Vineet Gupta | System and method for providing preferred country biasing of search results |
US20040260695A1 (en) | 2003-06-20 | 2004-12-23 | Brill Eric D. | Systems and methods to tune a general-purpose search engine for a search entry point |
US7228301B2 (en) | 2003-06-27 | 2007-06-05 | Microsoft Corporation | Method for normalizing document metadata to improve search results using an alias relationship directory service |
US20040267722A1 (en) | 2003-06-30 | 2004-12-30 | Larimore Stefan Isbein | Fast ranked full-text searching |
US7308643B1 (en) | 2003-07-03 | 2007-12-11 | Google Inc. | Anchor tag indexing in a web crawler system |
JP4274533B2 (en) | 2003-07-16 | 2009-06-10 | キヤノン株式会社 | Solid-state imaging device and driving method thereof |
KR20030081209A (en) | 2003-08-19 | 2003-10-17 | 장두한 | Apparatus for cutting the surface of the weld zone |
US20050060186A1 (en) | 2003-08-28 | 2005-03-17 | Blowers Paul A. | Prioritized presentation of medical device events |
US20050060311A1 (en) | 2003-09-12 | 2005-03-17 | Simon Tong | Methods and systems for improving a search ranking using related queries |
US20050060310A1 (en) | 2003-09-12 | 2005-03-17 | Simon Tong | Methods and systems for improving a search ranking using population information |
US20050114324A1 (en) | 2003-09-14 | 2005-05-26 | Yaron Mayer | System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers |
US20050240580A1 (en) | 2003-09-30 | 2005-10-27 | Zamir Oren E | Personalization of placed content ordering in search results |
US20050071328A1 (en) | 2003-09-30 | 2005-03-31 | Lawrence Stephen R. | Personalization of web search |
US20050071741A1 (en) | 2003-09-30 | 2005-03-31 | Anurag Acharya | Information retrieval based on historical data |
JP2007507798A (en) | 2003-09-30 | 2007-03-29 | グーグル・インク | Method for scoring a document, method for ranking a document and system for scoring a document |
US7346839B2 (en) | 2003-09-30 | 2008-03-18 | Google Inc. | Information retrieval based on historical data |
US20050086206A1 (en) | 2003-10-15 | 2005-04-21 | International Business Machines Corporation | System, Method, and service for collaborative focused crawling of documents on a network |
US20050086192A1 (en) | 2003-10-16 | 2005-04-21 | Hitach, Ltd. | Method and apparatus for improving the integration between a search engine and one or more file servers |
US20050089215A1 (en) | 2003-10-25 | 2005-04-28 | Carl Staelin | Image artifact reduction using a neural network |
US7231399B1 (en) | 2003-11-14 | 2007-06-12 | Google Inc. | Ranking documents based on large data sets |
US20050125392A1 (en) | 2003-12-08 | 2005-06-09 | Andy Curtis | Methods and systems for providing a response to a query |
US20050144162A1 (en) | 2003-12-29 | 2005-06-30 | Ping Liang | Advanced search, file system, and intelligent assistant agent |
US20060047649A1 (en) | 2003-12-29 | 2006-03-02 | Ping Liang | Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation |
US20050154710A1 (en) | 2004-01-08 | 2005-07-14 | International Business Machines Corporation | Dynamic bitmap processing, identification and reusability |
US20050154746A1 (en) | 2004-01-09 | 2005-07-14 | Yahoo!, Inc. | Content presentation and management system associating base content and relevant additional content |
US20050165753A1 (en) | 2004-01-23 | 2005-07-28 | Harr Chen | Building and using subwebs for focused search |
EP1557770A1 (en) | 2004-01-23 | 2005-07-27 | Microsoft Corporation | Building and using subwebs for focused search |
US20050165781A1 (en) | 2004-01-26 | 2005-07-28 | Reiner Kraft | Method, system, and program for handling anchor text |
JP2004192657A (en) | 2004-02-09 | 2004-07-08 | Nec Corp | Information retrieval system, and recording medium recording information retrieval method and program for information retrieval |
US20050192936A1 (en) | 2004-02-12 | 2005-09-01 | Meek Christopher A. | Decision-theoretic web-crawling and predicting web-page change |
US7281002B2 (en) | 2004-03-01 | 2007-10-09 | International Business Machine Corporation | Organizing related search results |
US20050192955A1 (en) | 2004-03-01 | 2005-09-01 | International Business Machines Corporation | Organizing related search results |
US20050210079A1 (en) | 2004-03-17 | 2005-09-22 | Edlund Stefan B | Method for synchronizing documents for disconnected operation |
US20050210006A1 (en) | 2004-03-18 | 2005-09-22 | Microsoft Corporation | Field weighting in text searching |
US7584221B2 (en) | 2004-03-18 | 2009-09-01 | Microsoft Corporation | Field weighting in text searching |
US20050210105A1 (en) | 2004-03-22 | 2005-09-22 | Fuji Xerox Co., Ltd. | Conference information processing apparatus, and conference information processing method and storage medium readable by computer |
US20050216533A1 (en) | 2004-03-29 | 2005-09-29 | Yahoo! Inc. | Search using graph colorization and personalized bookmark processing |
US7580568B1 (en) | 2004-03-31 | 2009-08-25 | Google Inc. | Methods and systems for identifying an image as a representative image for an article |
US20070276829A1 (en) | 2004-03-31 | 2007-11-29 | Niniane Wang | Systems and methods for ranking implicit search results |
US20050251499A1 (en) | 2004-05-04 | 2005-11-10 | Zezhen Huang | Method and system for searching documents using readers valuation |
US20050262050A1 (en) | 2004-05-07 | 2005-11-24 | International Business Machines Corporation | System, method and service for ranking search results using a modular scoring system |
US7257577B2 (en) | 2004-05-07 | 2007-08-14 | International Business Machines Corporation | System, method and service for ranking search results using a modular scoring system |
US20050256865A1 (en) | 2004-05-14 | 2005-11-17 | Microsoft Corporation | Method and system for indexing and searching databases |
US7260573B1 (en) | 2004-05-17 | 2007-08-21 | Google Inc. | Personalizing anchor text scores in a search engine |
US7716225B1 (en) | 2004-06-17 | 2010-05-11 | Google Inc. | Ranking documents based on user behavior and/or feature data |
US20050283473A1 (en) | 2004-06-17 | 2005-12-22 | Armand Rousso | Apparatus, method and system of artificial intelligence for data searching applications |
US20050289193A1 (en) | 2004-06-25 | 2005-12-29 | Yan Arrouye | Methods and systems for managing data |
US20050289133A1 (en) | 2004-06-25 | 2005-12-29 | Yan Arrouye | Methods and systems for managing data |
US7243102B1 (en) | 2004-07-01 | 2007-07-10 | Microsoft Corporation | Machine directed improvement of ranking algorithms |
US7428530B2 (en) | 2004-07-01 | 2008-09-23 | Microsoft Corporation | Dispersing search engine results by using page category information |
US20060041521A1 (en) | 2004-08-04 | 2006-02-23 | Tolga Oral | System and method for providing graphical representations of search results in multiple related histograms |
US20060031183A1 (en) | 2004-08-04 | 2006-02-09 | Tolga Oral | System and method for enhancing keyword relevance by user's interest on the search result documents |
US20060036598A1 (en) | 2004-08-09 | 2006-02-16 | Jie Wu | Computerized method for ranking linked information items in distributed sources |
US20060047643A1 (en) | 2004-08-31 | 2006-03-02 | Chirag Chaman | Method and system for a personalized search engine |
US20060059144A1 (en) | 2004-09-16 | 2006-03-16 | Telenor Asa | Method, system, and computer program product for searching for, navigating among, and ranking of documents in a personal web |
US20060064411A1 (en) | 2004-09-22 | 2006-03-23 | William Gross | Search engine using user intent |
US7606793B2 (en) | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US7827181B2 (en) | 2004-09-30 | 2010-11-02 | Microsoft Corporation | Click distance determination |
JP4950444B2 (en) | 2004-09-30 | 2012-06-13 | マイクロソフト コーポレーション | System and method for ranking search results using click distance |
US20060074903A1 (en) | 2004-09-30 | 2006-04-06 | Microsoft Corporation | System and method for ranking search results using click distance |
US7761448B2 (en) | 2004-09-30 | 2010-07-20 | Microsoft Corporation | System and method for ranking search results using click distance |
US20100268707A1 (en) | 2004-09-30 | 2010-10-21 | Microsoft Corporation | System and method for ranking search results using click distance |
US20060069982A1 (en) | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Click distance determination |
US8082246B2 (en) | 2004-09-30 | 2011-12-20 | Microsoft Corporation | System and method for ranking search results using click distance |
US7644107B2 (en) | 2004-09-30 | 2010-01-05 | Microsoft Corporation | System and method for batched indexing of network documents |
US7739277B2 (en) | 2004-09-30 | 2010-06-15 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US20060074883A1 (en) | 2004-10-05 | 2006-04-06 | Microsoft Corporation | Systems, methods, and interfaces for providing personalized search and information access |
US20060074781A1 (en) | 2004-10-06 | 2006-04-06 | Leano Hector V | System for facilitating turnkey real estate investment in Mexico |
US20060173560A1 (en) | 2004-10-07 | 2006-08-03 | Bernard Widrow | System and method for cognitive memory and auto-associative neural network based pattern recognition |
US20060095416A1 (en) | 2004-10-28 | 2006-05-04 | Yahoo! Inc. | Link-based spam detection |
US20060136411A1 (en) | 2004-12-21 | 2006-06-22 | Microsoft Corporation | Ranking search results using feature extraction |
US7716198B2 (en) | 2004-12-21 | 2010-05-11 | Microsoft Corporation | Ranking search results using feature extraction |
US20060161534A1 (en) | 2005-01-18 | 2006-07-20 | Yahoo! Inc. | Matching and ranking of sponsored search listings incorporating web search technology and web content |
US20060173828A1 (en) | 2005-02-01 | 2006-08-03 | Outland Research, Llc | Methods and apparatus for using personal background data to improve the organization of documents retrieved in response to a search query |
US20060195440A1 (en) | 2005-02-25 | 2006-08-31 | Microsoft Corporation | Ranking results using multiple nested ranking |
US20060294100A1 (en) | 2005-03-03 | 2006-12-28 | Microsoft Corporation | Ranking search results using language types |
US7792833B2 (en) | 2005-03-03 | 2010-09-07 | Microsoft Corporation | Ranking search results using language types |
US20060200460A1 (en) | 2005-03-03 | 2006-09-07 | Microsoft Corporation | System and method for ranking search results using file types |
US20060206476A1 (en) | 2005-03-10 | 2006-09-14 | Yahoo!, Inc. | Reranking and increasing the relevance of the results of Internet searches |
US20060206460A1 (en) | 2005-03-14 | 2006-09-14 | Sanjay Gadkari | Biasing search results |
US20060212423A1 (en) | 2005-03-16 | 2006-09-21 | Rosie Jones | System and method for biasing search results based on topic familiarity |
US20070106659A1 (en) | 2005-03-18 | 2007-05-10 | Yunshan Lu | Search engine that applies feedback from users to improve search results |
US20060224554A1 (en) | 2005-03-29 | 2006-10-05 | Bailey David R | Query revision using known highly-ranked queries |
US7693829B1 (en) | 2005-04-25 | 2010-04-06 | Google Inc. | Search engine with fill-the-blanks capability |
US20060248074A1 (en) | 2005-04-28 | 2006-11-02 | International Business Machines Corporation | Term-statistics modification for category-based search |
US20060259481A1 (en) | 2005-05-12 | 2006-11-16 | Xerox Corporation | Method of analyzing documents |
US7962462B1 (en) * | 2005-05-31 | 2011-06-14 | Google Inc. | Deriving and using document and site quality signals from search query streams |
US20060282306A1 (en) | 2005-06-10 | 2006-12-14 | Unicru, Inc. | Employee selection via adaptive assessment |
US20060282455A1 (en) | 2005-06-13 | 2006-12-14 | It Interactive Services Inc. | System and method for ranking web content |
US20060287993A1 (en) | 2005-06-21 | 2006-12-21 | Microsoft Corporation | High scale adaptive search systems and methods |
US20070038616A1 (en) | 2005-08-10 | 2007-02-15 | Guha Ramanathan V | Programmable search engine |
US7599917B2 (en) | 2005-08-15 | 2009-10-06 | Microsoft Corporation | Ranking search results using biased click distance |
US20070038622A1 (en) | 2005-08-15 | 2007-02-15 | Microsoft Corporation | Method ranking search results using biased click distance |
US20070050338A1 (en) | 2005-08-29 | 2007-03-01 | Strohm Alan C | Mobile sitemaps |
US7499919B2 (en) | 2005-09-21 | 2009-03-03 | Microsoft Corporation | Ranking functions using document usage statistics |
US20070067284A1 (en) | 2005-09-21 | 2007-03-22 | Microsoft Corporation | Ranking functions using document usage statistics |
US20100191744A1 (en) | 2005-09-21 | 2010-07-29 | Dmitriy Meyerzon | Ranking functions using document usage statistics |
US20070073748A1 (en) | 2005-09-27 | 2007-03-29 | Barney Jonathan A | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
US7716226B2 (en) | 2005-09-27 | 2010-05-11 | Patentratings, Llc | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
US7689531B1 (en) | 2005-09-28 | 2010-03-30 | Trend Micro Incorporated | Automatic charset detection using support vector machines with charset grouping |
US20070085716A1 (en) | 2005-09-30 | 2007-04-19 | International Business Machines Corporation | System and method for detecting matches of small edit distance |
US20070094285A1 (en) | 2005-10-21 | 2007-04-26 | Microsoft Corporation | Question answering over structured content on the web |
US20070150473A1 (en) | 2005-12-22 | 2007-06-28 | Microsoft Corporation | Search By Document Type And Relevance |
US7689559B2 (en) | 2006-02-08 | 2010-03-30 | Telenor Asa | Document similarity scoring and ranking method, device and computer program product |
US20070198459A1 (en) | 2006-02-14 | 2007-08-23 | Boone Gary N | System and method for online information analysis |
US20070260597A1 (en) | 2006-05-02 | 2007-11-08 | Mark Cramer | Dynamic search engine results employing user behavior |
EP1862916A1 (en) | 2006-06-01 | 2007-12-05 | Microsoft Corporation | Indexing Documents for Information Retrieval based on additional feedback fields |
US20080005068A1 (en) | 2006-06-28 | 2008-01-03 | Microsoft Corporation | Context-based search, retrieval, and awareness |
US20080016053A1 (en) | 2006-07-14 | 2008-01-17 | Bea Systems, Inc. | Administration Console to Select Rank Factors |
JP2008033931A (en) | 2006-07-26 | 2008-02-14 | Xerox Corp | Method for enrichment of text, method for acquiring text in response to query, and system |
US7720830B2 (en) | 2006-07-31 | 2010-05-18 | Microsoft Corporation | Hierarchical conditional random fields for web extraction |
JP2009509275A5 (en) | 2006-09-20 | 2009-10-08 | ||
US20080140641A1 (en) | 2006-12-07 | 2008-06-12 | Yahoo! Inc. | Knowledge and interests based search term ranking for search results validation |
US20080154888A1 (en) | 2006-12-11 | 2008-06-26 | Florian Michel Buron | Viewport-Relative Scoring For Location Search Queries |
JP2008146424A (en) | 2006-12-12 | 2008-06-26 | Nippon Telegr & Teleph Corp <Ntt> | Xml document conformity calculation method, its program, and information processor |
US7685084B2 (en) | 2007-02-09 | 2010-03-23 | Yahoo! Inc. | Term expansion using associative matching of labeled term pairs |
US20080195596A1 (en) | 2007-02-09 | 2008-08-14 | Jacob Sisk | System and method for associative matching |
US8412717B2 (en) | 2007-06-27 | 2013-04-02 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US20090006356A1 (en) | 2007-06-27 | 2009-01-01 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US20090006358A1 (en) | 2007-06-27 | 2009-01-01 | Microsoft Corporation | Search results |
US20090024606A1 (en) | 2007-07-20 | 2009-01-22 | Google Inc. | Identifying and Linking Similar Passages in a Digital Text Corpus |
US20090070306A1 (en) * | 2007-09-07 | 2009-03-12 | Mihai Stroe | Systems and Methods for Processing Inoperative Document Links |
US20090106223A1 (en) | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US20090106221A1 (en) | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
US7840569B2 (en) | 2007-10-18 | 2010-11-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US20090106235A1 (en) | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Document Length as a Static Relevance Feature for Ranking Search Results |
US20090157607A1 (en) | 2007-12-12 | 2009-06-18 | Yahoo! Inc. | Unsupervised detection of web pages corresponding to a similarity class |
JP2009146248A (en) | 2007-12-17 | 2009-07-02 | Fujifilm Corp | Content presenting system and program |
US20090164929A1 (en) | 2007-12-20 | 2009-06-25 | Microsoft Corporation | Customizing Search Results |
US8412702B2 (en) | 2008-03-12 | 2013-04-02 | Yahoo! Inc. | System, method, and/or apparatus for reordering search results |
US20090240680A1 (en) | 2008-03-20 | 2009-09-24 | Microsoft Corporation | Techniques to perform relative ranking for search results |
JP2009252179A (en) | 2008-04-10 | 2009-10-29 | Ntt Docomo Inc | Recommendation information evaluation device and recommendation information evaluation method |
US20090276421A1 (en) | 2008-05-04 | 2009-11-05 | Gang Qiu | Method and System for Re-ranking Search Results |
US20090307209A1 (en) | 2008-06-10 | 2009-12-10 | David Carmel | Term-statistics modification for category-based search |
ZA201100293B (en) | 2008-09-10 | 2012-04-25 | Microsoft Corp | Document length as a static relevance feature for ranking search results |
US8326829B2 (en) | 2008-10-17 | 2012-12-04 | Centurylink Intellectual Property Llc | System and method for displaying publication dates for search results |
US20110106850A1 (en) | 2009-10-29 | 2011-05-05 | Microsoft Corporation | Relevant Individual Searching Using Managed Property and Ranking Features |
US20110137893A1 (en) | 2009-12-04 | 2011-06-09 | Microsoft Corporation | Custom ranking model schema |
US20110235909A1 (en) | 2010-03-26 | 2011-09-29 | International Business Machines Corporation | Analyzing documents using stored templates |
US20110295850A1 (en) | 2010-06-01 | 2011-12-01 | Microsoft Corporation | Detection of junk in search result ranking |
US8370331B2 (en) | 2010-07-02 | 2013-02-05 | Business Objects Software Limited | Dynamic visualization of search results on a graphical user interface |
US20130198174A1 (en) | 2012-01-27 | 2013-08-01 | Microsoft Corporation | Re-ranking search results |
Non-Patent Citations (442)
Title |
---|
"International Search Report", Mailed Aug. 28, 2009, Application No. PCT/US2009/036597, Filed Date Mar. 10, 2009, pp. 1-11. |
"Microsoft FAST Search Server 2010 for SharePoint, Evaluation Guide", Published on Aug. 12, 2010, Available at: http://www.microsoft.com/downloads/info.aspx?na=41&srcfamilyid=f1e3fb39-6959-4185-8b28-5315300b6e6b&srcdisplaylang=en&u=http%3a%2f%2download.microsoft.com%2fdownload%2fA%2f7%2fF%2fA7F98D88-BC15-4F3C-8B71-D42A5ED79964%, 60 pgs. |
"Okapi Similarity Measurement (Okapi"), 11th International Web Conference, www2002, 2002, p. 1. |
Agarwal et al., "Ranking Database Queries Using User Feedback: A Neural Network Approach", Fall 2006, 9 pp. |
Agichtein, "Improving Web Search Ranking by Incorporating User Behavior Information", SIGIR'06, Aug. 6-11, 2006, ACM, 2006. |
Australian Exam Report in Application No. 2008 00521-7, mailed Mar. 11, 2009, 4 pgs. |
Australian First Examiners Report in 2006279520 mailed Oct. 5, 2010. |
Australian Notice of Allowance in Application 2006279520, mailed Mar. 2, 2011, 3 pgs. |
Australian Office Action in Application 2009234120, mailed Feb. 26, 2014, 3 pgs. |
Bandinelli, Luca, "Using Microsoft SharePoint Products and Technologies in Multilingual Scenarios", http://www.microsoft.com/technet/prodtechnol/office/sps2003/maintain/spmultil.mspx, published on Nov. 1, 2003, printed on May 22, 2006, 32 pp. |
Becker, Hila et al., "Learning Similarity Metrics for Event Identification in Social Media," Published Date: Feb. 4-6, 2010, http://infolab.stanford.edu/˜mor/research/becker-wsdm10.pdf, 10 pgs. |
Bohm et al., "Multidimensional Index Structures in Relational Databases", Journal of Intelligent Information Systems, Jul. 2000, vol. 15, Issue 1, pp. 1-20, found at: http://springerlink.com/content/n345270t27538741/fulltext.pdf. |
Brin, S. et al., "The Anatomy of a Large-Scale Hypertextual Web Search Engine", Proceedings of the Seventh International World-Wide Web Conference, Online! Apr. 14, 1998, pp. 1-26. |
Burges, et al., "Learning to Rank with Nonsmooth Cost Functions". |
Canadian Notice of Allowance in Application 2618854, received Jan. 13, 2014, 1 pg. |
Canadian Office Action mailed Mar. 27, 2013 cited in Appln No. 2,618,854. |
Carmel, D. et al., "Searching XML Documents Via XML Fragments", SIGIR Toronto, Canada, Jul.-Aug. 2003, pp. 151-158. |
Chakrabarti, S., "Recent Results in Automatic Web Resource Discovery", ACM Computing Surveys, vol. 31, No. 4es, Dec. 1999, pp. 1-7. |
Chen, Hsinchun et al., "A Smart Itsy Bitsy Spider for the Web", Journal of the American Society for Information Science, 49(7), 1998, pp. 604-618. |
Chen, Michael et al., Cha Cha, "A System for Organizing Intranet Search Results", Computer Science Department, University of California, Berkeley, 1999, pp. 1-12. |
Chinese Application 200510088213.5, Notice of Allowance mailed Apr. 20, 2010, , 4 pgs. |
Chinese Application No. 200510088212.0, First Office Action mailed Jul. 4, 2008, 10 pgs. |
Chinese Application No. 200510088212.0, Notice of Allowance mailed Jan. 8, 2010, 4 pgs. |
Chinese Decision on Reexamination cited in 200680029645.1, mailed Dec. 14, 2012, 15 pp. |
Chinese Decision on Re-Examination in Application 200510084707.6 mailed Aug. 22, 2011, 12 pgs. |
Chinese Decision on Rejection in 200680029645.1 mailed Aug. 12, 2010. |
Chinese Final Rejection in 200510084707.6 mailed Aug. 21, 2009, 13 pgs. |
Chinese Final Rejection in 200510088213.5 mailed Mar. 6, 2009. |
Chinese First Office Action in 200510084707.6 mailed Mar. 28, 2008, 10 pgs. |
Chinese First Office Action in 200680034531.6 mailed Sep. 11, 2009, 7 pgs. |
Chinese First Office Action in 200980112928.6 mailed Jun. 8, 2012. |
Chinese First Office Action in Chinese Application/Patent No. 200880112416.5, mailed Aug. 12, 2011, 11 pgs. |
Chinese First Official Action in 200510088213.5 mailed May 9, 2008. |
Chinese First Official Action in 200510088527.5 mailed Apr. 18, 2008. |
Chinese First Official Action in 200680029645.1 mailed Jun. 19, 2009. |
Chinese First Official Action in 200680035828.4 mailed Jun. 19, 2009. |
Chinese Notice of Allowance in 200510088527.5 mailed Jul. 24, 2009, 4 pgs. |
Chinese Notice of Allowance in 200680034531.6 mailed Oct. 14, 2010, 6 pgs. |
Chinese Notice of Allowance in Application 200510084707.6, mailed Sep. 25, 2012, 4 pgs. |
Chinese Notice of Allowance in Application 200880112416.5, mailed Jul. 18, 2012, 4 pgs. |
Chinese Notice of Allowance in Application 2009801129286, mailed Aug. 30, 2013, 4 pgs. |
Chinese Notice of Reexamination dated Aug. 20, 2012 cited in Appln No. 200680029645.1. |
Chinese Second Office Action in 200510084707.6 mailed Nov. 7, 2008, 10 pgs. |
Chinese Second Office Action in 200680029645.1 mailed Apr. 6, 2010. |
Chinese Second Office Action mailed Mar. 4, 2013 cited in Appln No. 200980112928.6. |
Chinese Second Official Action in 200510088213.5 mailed Oct. 10, 2008. |
Chinese Second Official Action in 200510088527.5 mailed Dec. 26, 2008. |
Chinese Third Office Action in 200510084707.6 mailed Feb. 20, 2009, 12 pgs. |
Chinese Third Official Action in 200510088213.5 mailed Sep. 4, 2009. |
Cho et al., "Efficient Crawling Through URL Ordering", In Proceedings of the 7th International World Wide Web Conference, Apr. 1998, pp. 161-180. |
Conlon, M., "Inserts Made Simple", American Printer, Nov. 1, 2002, retrieved from internet on Dec. 17, 2010: http://americanprinter.com/press/other/printing-inserts-made-simple/, 4 pp. |
Craswell, N. et al., "TREC12 Web Track as CSIRO", TREC 12, Nov. 2003, 11 pp. |
Cutler, M. et al., "A New Study on Using HTML Structures to Improve Retrieval", 11th IEEE International Conference on Chicago, IL, Nov. 9-11, 1999, pp. 406-409. |
Desmet, P. et al., "Estimation of Product Category Sales Responsiveness to Allocated Shelf Space", Intern. J. of Research in Marketing, vol. 15, No. 5, Dec. 9, 1998, pp. 443-457. |
Ding, Chen et al., "An Improved Usage-Based Ranking", obtained online Jul. 1, 2009 at: http://www.springerlink.com/content/h0jut6d1dnrk5227/fulltext.pdf, 8 pgs. |
Egyptian Official Action in PCT 269/2008 mailed Feb. 1, 2010. |
Eiron, et al., "Analysis of Anchor Text for Web Search", SIGIR 2003, ACM. |
EP 2nd Office Action in Application 05105672.9, mailed Oct. 15, 2009, 4 pgs. |
EP Communication to cancel the oral summons in Application 05105048.2, mailed Jul. 16, 2012, 1 pg. |
EP Exam Report in EP 00309121.2-1522 mailed Jul. 4, 2003. |
EP Exam Report in EP 00309121.2-1527 mailed Feb. 8, 2007. |
EP Exam Report in EP 00309121.2-1527 mailed Jun. 16, 2004. |
EP Exam Report in EP 05105048.2-2201 mailed Apr. 23, 2007. |
EP Examination Report in Application 05105672.9, mailed Oct. 24, 2006, 4 pgs. |
EP Notice of Allowance in Application 05105048.2, mailed Aug. 13, 2012, 8 pgs. |
EP Office Action in Application 05105107.6, mailed Mar. 28, 2008, 6 pgs. |
EP Result of consultation in Application 05105048.2, mailed Aug. 8, 2012, 3 pgs. |
EP Search Report in Application 05105107.6, mailed Apr. 7, 2006, 3 pgs. |
EP Search Report in Application 05105672.9, mailed Feb. 6, 2006, 3 pgs. |
EP Search Report in EP 00309121 mailed Jul. 18, 2002. |
EP Search Report in EP 05105048 mailed Jan. 17, 2006. |
EP Search Report in EP 05105110 dated Aug. 11, 2006. |
EP Summons to Attend Oral Proceedings in EP 05105048.2-2201 mailed Apr. 3, 2012. |
European Communication in Application 05105107.6, mailed Dec. 17, 2012, 4 pgs. |
European Extended Search Report in Application 06836141.9 mailed Dec. 27, 2011, 8 pgs. |
European Extended Search Report in Application 097308084, mailed Oct. 2, 2012, 7 pgs. |
European Notice of Allowance in Application 00309121.2, mailed Jun. 15, 2009, 5 pgs. |
European Notice of Allowance in Application EP 06836141.9, mailed Jan. 31, 2013, 6 pgs. |
European Official Action in 05105110.0/1527 mailed Aug. 4, 2010. |
European Report on Result of Consultation in Application EP 06836141.9, mailed Jan. 9, 2013, 3 pgs. |
European Search Report in 08840594.9-2201 mailed Feb. 23, 2011. |
European Search Report in 08840594.9-2201 mailed Jan. 21, 2011. |
European Search Report in Application 06789800.7 mailed Oct. 13, 2011, 11 pgs. |
Extended European Search Report in Application 06804098.9, mailed Dec. 19, 2011, 7 pgs. |
Fagin, R. et al., "Searching the Workplace Web", IBM Almaden Research Center, In Proceedings of the Twelfth International World Wide Web Conference, Budapest, May 20, 2003, 21 pgs. |
Fagin, Ronald, "Searching the Workplace Web", Mar. 3, 2005, pp. 1-10. |
Fiedler, J. et al., Using the Web Efficiently: Mobile Crawlers, 17th Annual Int'l. Conference of the Association of Management on Computer Science, Aug. 1999, pp. 324-329. |
Gross, Christian, Microsoft Interactive Developer, No. 2, "Integrating the Microsoft Index Server with Active Server pp.", Jun. 1997, 21 pgs. |
Hawking, D. et al., "Overview of the TREC-8 Web Track", TREC, Feb. 2000, pp. 1-18. |
Hawking, D., "Overview of the TREC-9 Track", TREC, 2000, pp. 1-16. |
Hawking., D. et al., "Overview of TREC-7 Very Large Collection Track", TREC, Jan. 1999, pp. 1-13. |
Heery, Rachel, "Review of Metadata Formats", Program, vol. 30, No. 4, Oct. 1996, 1996 IEEE, pp. 345-373. |
Hiemstra, D. et al., "Relevance Feedback for Best Match Term Weighting Algorithms in Information Retrieval", Proceedings of the Joint DELOS-NSF Workshop on Personalisation and Recommender Systems in Digital Libraries, ERCIM Workshop Proceedings 01/W03, pp. 37-42, Jun. 2001. |
Hoeber, Orland et al., "Evaluating the Effectiveness of Term Frequency Histograms for Supporting Interactive Web Search Tasks," Published Date: Feb. 25-27, 2008, http://delivery.acm.org/10.1145/1400000/1394484/p360-hoeber.pdf?key1=1394484&key2=1611170721&coll=GUIDE&dl=GUIDE&CFID=83362159&CFTOKEN=63982632, 9 pgs. |
Horikawa, Akira, "Table design correcting room of Access user", Visual Basic Magazine, vol. 6, No. 3, pp. 158-170, Shoeisha Col. Ltd., Japan, Mar. 1, 2000. (No English translation). As cited in 50037.0292JP01, 309549.03, JP 2005-175174. |
Huang et al., "Design and Implementation of a Chinese Full-Text Retrieval System Based on Probabilistic Model", IEEE, 1993, pp. 1090-1093. |
Huuhka "Google: Data Structures and Algorithms". |
Indonesian Notice of Allowance in Application W00200800848 mailed Jun. 9, 2011, 4 pgs. |
Japanese Appeal Decision and Notice of Allowance in Application 2005-175174, mailed Jun. 18, 2013, 4 pgs. |
Japanese Appeal Decision in 2008-527094 (Appeal No. 2010-011037) mailed Nov. 4, 2011—31 pgs., only first page translated. |
Japanese Final Notice of Reason for Rejection in Application 2011-527079, mailed May 5, 2014, 6 pgs. |
Japanese Final Notice of Rejection in Application No. 2005-187816 mailed Mar. 16, 2012, 5 pgs. |
Japanese Final Rejection in 2005-175172 mailed Jun. 7, 2011, 5 pgs. |
Japanese Final Rejection in 2008-527094 mailed Jan. 22, 2010. |
Japanese Final Rejection in JP Application 2008-532469, mailed Jan. 29, 2010, 19 pgs. |
Japanese Interrogation in Application 2005-175174, mailed Jul. 24, 2012, 7 pgs. |
Japanese Notice of Allowance in 2005-175172 mailed Mar. 6, 2012, 6 pgs. |
Japanese Notice of Allowance in 2005-175173 mailed Jun. 7, 2011, 6 pgs. |
Japanese Notice of Allowance in Application 2011-021985, mailed Dec. 25, 2012, 6 pgs. |
Japanese Notice of Allowance in Application 2011-194741, mailed Sep. 6, 2013, 4 pgs. |
Japanese Notice of Allowance in Application 2011-504031, mailed Jan. 30, 2014, 4 pgs. |
Japanese Notice of Allowance in JP Application 2008-532469, mailed Feb. 22, 2011, 6 pgs. |
Japanese Notice of Final Rejection in 2005-175174, mailed Aug. 5, 2011, 5 pgs. |
Japanese Notice of Rejection in 2005-175172 mailed Sep. 28, 2010. |
Japanese Notice of Rejection in 2005-175173 mailed Oct. 1, 2010. |
Japanese Notice of Rejection in 2005-175174 , mailed Oct. 29, 2010, 13 pgs. |
Japanese Notice of Rejection in 2008-527094 mailed Sep. 11, 2009. |
Japanese Notice of Rejection in Application 2011-194741, mailed May 14, 2013, 4 pgs. |
Japanese Notice of Rejection in Application 2011-266249, mailed Sep. 2, 2013, 7 pgs. |
Japanese Notice of Rejection in Application 2011-504031, mailed May 14, 2013, 4 pgs. |
Japanese Notice of Rejection in Application 2011-527079, mailed Oct. 8, 2013, 15 pgs. |
Japanese Notice of Rejection in Application No. 2005-187816 mailed May 20, 2011, 13 pgs. |
Japanese Office Action in JP Application 2008-532469, mailed Sep. 29, 2009, 18 pgs. |
Jones, K. et al., "A probabilistic model of information retrieval: development and status", Department of Information Science, City University, London, Aug. 1998, 76 pgs. |
Kazama, K., "A Searching and Ranking Scheme Using Hyperlinks and Anchor Texts", IPSJ SIG Technical Report, vol. 2000, No. 71, Information Processing Society of Japan, Japan, Jul. 28, 2000, pp. 17-24. |
Kleinberg, Jon M., "Authoritative Sources in a Hyperlinked Environment", Proceedings of the aCM-SIAM symposium on Discrete Algorithms, 1998, 34 pp. |
Korean Notice of Preliminary Rejection in Application 1020087006775, mailed Feb. 4, 2013, 1 pg. |
Korean Notice of Preliminary Rejection mailed Feb. 4, 2013 cited in 10-2008-7007702. |
Korean Notice of Preliminary Rejection mailed Jan. 21, 2013 cited in 10-2008-7003121. |
Korean Official Action in 2005-0057199 mailed Aug. 4, 2011, pgs. |
Korean Official Action in 2005-0057199 mailed Mar. 26, 2012, 5 pgs. |
Kotsakis, E., "Structured Information Retrieval in XML Documents", Proceedings of the ACM Symposium on Applied Computing, Madrid, Spain, 2002, pp. 663-667. |
Kucuk, Mehmet Emin, et al., "Application of Metadata Concepts to Discovery of Internet Resources", ADVIS 2000, INCS 1909, pp. 304-313, 2000. |
Kwok, K.L., "A Network Approach to Probabilistic Information Retrieval", ACM Transactions on Information Systems, vol. 13, No. 3, Jul. 1995, pp. 324-353. |
Lalmas, M., "Uniform Representation of Content and Structure for Structured Document Retrieval", 20th SGES International Conference on Knowledge Based Systems and Applied Artificial Intelligence, Cambridge, UK, Dec. 2000, pp. 1-12. |
Lam et al, "Automatic document classification based on probabilistic reasoning: model and performance analysis," Oct. 12-15, 1997, IEEE, Computational Cybernetics and Simulation vol. 3, pp. 2719-2723. |
Larkey, Leah S., et al., "Collection Selection and Results Merging with Topically Organized U.S. Patents and TREC Data", Proceedings of the Ninth International Conference on Information Knowledge Management, CIKM 2000, Nov. 6-11, 2000, pp. 282-289. |
Lee, J.K.W. et al., "Intelligent Agents for Matching Information Providers and Consumers on the Worl-Wide Web", IEEE, 1997, pp. 189-199. |
Ljosland, Mildrid, "Evaluation of Web Search Engines and the Search for Better Ranking Algorithms," http://www.aitel.hist.no/~mildrid/dring/paper/SIGIR.html, SIGIR99 Workshop on Evaluation of Reb Retrieval, Aug. 19, 1999, 5 pages. |
Ljosland, Mildrid, "Evaluation of Web Search Engines and the Search for Better Ranking Algorithms," http://www.aitel.hist.no/˜mildrid/dring/paper/SIGIR.html, SIGIR99 Workshop on Evaluation of Reb Retrieval, Aug. 19, 1999, 5 pages. |
Losee, R. et al., "Research in Information Organization", Literature Review, School of Information and Library Science, Section 4, pp. 53-96, Jan. 2001. |
Losee, Robert M. et al., "Measuring Search Engine Quality and Query Difficulty: Ranking with Target and Freestyle," http://ils.unc.edu/~losee/paril.pdf, Journal of the American Society for Information Science, Jul. 29, 1999, 20 pages. |
Losee, Robert M. et al., "Measuring Search Engine Quality and Query Difficulty: Ranking with Target and Freestyle," http://ils.unc.edu/˜losee/paril.pdf, Journal of the American Society for Information Science, Jul. 29, 1999, 20 pages. |
Luxenburger et al., "Matching Task Profiles and User Needs in Personalized Web Search", CIKM Proceeding of the 17th ACM Conference on Information and Knowledge Mining, Oct. 2008, pp. 689-698. |
Malaysia Adverse Report in Application PI20063920, mailed Jul. 31, 2012, 3 pgs. |
Malaysia Adverse Search Report in Application PI20080638, mailed Jul. 31, 2012, 4 pgs. |
Malaysian Notice of Allowance in Application PI 20080638, mailed Jun. 28, 2013, 2 pgs. |
Malaysian Notice of Allowance in Application PI20063920, mailed Dec. 14, 2012, 2 pgs. |
Malaysian Substantive Examination Report dated Jul. 31, 2012 cited in Appln No. PI 20063920. |
Managing External Content in Microsoft Office SharePoint Portal Server 2003, http://www.microsoft.com/technet/prodtechnol/sppt/reskit/c2261881x.mspx, published on Jun. 9, 2004, printed on May 22, 2006, 20 pp. |
Manning, C. et al., "CS276A Text Information Retrieval, Mining, and Exploitation: Lecture 12", Stanford University CS276A/SYMBSYS2391/LING2391 Test Information Retrieval, Mining, and Exploitation, Fall 2002, last modified Nov. 18, 2002, 8 pgs. |
Matsuo, Y., "A New Definition of Subjective Distance Between Web Pages," IPSJ Journal, vol. 44, No. 1, Information Processing Society of Japan, Japan, Jan. 15, 2003, pp. 88-94. |
Matveeva, Irina et al., "High Accuracy Retrieval with Multiple Nested Ranker," http://people.cs.uchicago.edu/~matveeva/RankerSIGIR06.pdf, SIGIR'06, Seattle, WA Aug. 6-11, 2006, 8 pages. |
Matveeva, Irina et al., "High Accuracy Retrieval with Multiple Nested Ranker," http://people.cs.uchicago.edu/˜matveeva/RankerSIGIR06.pdf, SIGIR'06, Seattle, WA Aug. 6-11, 2006, 8 pages. |
Mexican Office Action with Summary in PA/a/2008/002173 mailed Jun. 5, 2012. |
Microsoft Full-Text Search Technologies, http://www.microsoft.com/technet/prodtechnol/sppt/sharepoint/evaluate/featfunc/mssearc . . . , published on Jun. 1, 2001, printed on May 22, 2006, 13 pp. |
Microsoft SharePoint Portal Server 2001 Resource Kit: Chapter 24, Analyzing the Default Query for the Dashboard, http://www.microsoft.com/technet/prodtechnol/sppt/sharepoint/reskit/part5/c24spprk.mspx, printed on May 22, 2006, 5 pp. |
Microsoft SharePoint Portal Server 2001 White Paper, "Microsoft SharePoint Portal Server: Advanced Technologies for Information Search and Retrieval," http://download.microsoft.com/download/3/7/a/37a762d7-dbe6-4b51-a6ec-f6136f44fd65/SPS—Search.doc, Jun. 2002, 12 pages. |
Mittal et al., "Framework for Synthesizing Semantic-Level Indices", Multimedia Tools and Applications, Jun. 2003, vol. 20, Iss. 2., pp. 1-24, found online at: http://www.springerlink.com/content/tv632274r1267305/fulltext.pdf. |
MSDN, "Understanding Ranking," http://msdn.microsoft.com/en-us/library/ms142524.aspx, Sep. 2007, 4 pages. |
Murata, Shin Ya, et al., "Ranking Search Results based on Information Needs in Conjunction with Click-Log Analysis", Journal of Japan Database Society, Japan Database Society, Mar. 27, 2009, vol. 7, Part 4, pp. 37-42. |
Najork, Marc et al., "Breadth-First Crawling Yields High-Quality pp.", ACM, Compaq Systems Research Center, Hong Kong, 2001, pp. 114-118. |
Ncik Creswell, Stephen Robertson, Hugo Zaragoza and Michael Taylor, Relevance Weighting for Query Independent Evidence, Aug. 15-19, 2005, ACM, p. 416-423. * |
Nelson, Chris, "Use of Metadata Registries for Searching for Statistical Data", IEEE 2002, Dimension EDI Ltd., pp. 232-235, 2002. |
New Zealand Examination Report in Application No. 566532, mailed Oct. 15, 2009, 2 pgs. |
Nie, Jien Yun, "Introduction to Information Retrieval", University of Montreal Canada, 1989 pp. 1-11. |
Numerico, T., "Search engines organization of information and Web Topology", http://www.cafm.lsbu.ac.uk/eminars/sse/numerico-6-dec-2004.pdf, Dec. 6, 2004, 32 pgs. |
Ogilvie, P. et al., "Combining Document Representations for Known-Item Search", Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, 2003, pp. 143-150. |
Page, L. et al., "The PageRank Citation Ranking: Bringing Order to the Web", Internet Citation, found online at: http://citeseer.nj.nec.com/page98pagerank.html, retrieved Sep. 16, 2002, 18 pgs. |
PCT International Search Report and Written Opinion in Application PCT/US2011/033125, mailed Dec. 15, 2011, 8 pgs. |
PCT International Search Report, Application No. PCT/US2006/037206, mailed Jan. 16, 2007, 10 pgs. |
PCT Search Report in Application PCT/US2013/022825, mailed Apr. 30, 2013, 11 pgs. |
PCT Search Report in PCT/US2006/031965 mailed Jan. 11, 2007. |
PCT Search Report in PCT/US2008/011894 mailed Feb. 27, 2009, 12 pgs. |
PCT Search Report in PCT/US2009/063333 dated Apr. 22, 2010, 10 pgs. |
Pera, Maria S. et al., "Using Word Similarity to Eradicate Junk Emails," Published Date: Nov. 6-8, 2007, http://delivery.acm.org/10.1445/1330000/1321581/p943-pera.pdf?key1=1321581&key2=842117072&coll=GUIDE&dl=GUIDE&CFID=83362328&CFTOKEN=17563913, 4 pgs. |
Philippines Letters Patent in Application 12008500189, issued Jan. 6, 2012, 2 pgs. |
Philippines Office Action in 1-2008-500189 mailed Mar. 11, 2011, 1 page. |
Philippines Official Action in 1-2008-500189 mailed Jun. 22, 2011, 1 page. |
Philippines Official Action in 1-2008-500433 mailed Mar. 24, 2011, 1 page. |
Planning Your Information Structure Using Microsoft Office SharePoint Portal Server 2003, http://www.microsoft.com/technet/prodtechnol/sppt/reskit/c0861881x.mspx, published on Jun. 9, 2004, printed on May 22, 2006, 22 pp. |
Radlinski, Filip, et al. "Query Chains: Learning to Rank from Implicit Feedback, "http://delivery.acm.org/10.1145/1090000/1081899/p239-radlinski. pdf?key1=1081899&key2=3628533811&coll=GUIDE& CFID=27212902&CFTOKEN=53118399, KDD'05, Chicago, IL, Aug. 21-24, 2005,10 pages. |
Robertson, S. et al., "Okapi at TREC-3", Centre for Interactive Systems Research Department of Information Science, Third Text Retrieval Conference, 1995, 19 pp. |
Robertson, S. et al., "Okapi at TREC-4", 1996, 24 pp. |
Robertson, S. et al., "Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval", Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1994, pp. 232-241. |
Russian Application No. 2008105758, Notice of Allowance mailed Dec. 16, 2010, 5 pgs. |
Russian Notice of Allowance in Application 2011108842, mailed Dec. 16, 2013, 7 pgs. (English translation). |
Russian Notice of Allowance in Application No. 2008110731/08, mailed Oct. 25, 2010, 7 pgs. |
Russian Notice of Allowance in Application No. 2010141559, mailed Jun. 27, 2013, 6 pgs. |
Russian Office Action in Application 2010141559, mailed Jan. 28, 2013, 6 pgs. |
Russian Official Action in 2008105758 mailed Jun. 29, 2010. |
Russian Official Action in 2010141559 mailed Jan. 28, 2013, 4 pp. |
Schulz, Stefan, et al., "Indexing Medical WWW Documents by Morphemes", MEDINFO 2001 Proceedings of the 10th World Congress on Medical Informatics, Park I, IOS Press, Inc., pp. 266-270, 2001. |
Senecal, Sylvain, "Consumers' Decision-Making Process and Their Online Shopping Behavior: A Clickstream Analysis", Jun. 1, 2004, pp. 1600-1607. |
Shamsfard, Mehrnoush, et al., "ORank: An Ontology Based System for Ranking Documents," http://www.waset.org/ijcs/v1/v1-3-30.pdf, International Journal of Computer Science, vol. 1, No. 3, Apr. 10, 2006, pp. 225-231. |
SharePoint Portal Server 2001 Planning and Installation Guide, http://www.microsoft.com/technet/prodtechnol/sppt/sharepoint/plan/planinst.mspx, printed on May 22, 2006, 86 pp. |
Singhal, A. et al., "AT&T at TREC-9", Proceedings of the Ninth Text Retrieval Conference, NIST Special Publication 500-249, 'Online! 2001, pp. 103-105. |
Singhal, A. et al., "Document Length Normalization", Cornell University, vol. 32, No. 5, 1996, pp. 619-633. |
Smyth, Barry, "Relevance at a Distance—An Investigation of Distance-Biased Personalization on the Mobile Internet", no date, pp. 1-6. |
Song, et al., "Exploring URL Hit Priors for Web Search", vol. 3936, Springer Berlin / Heidelberg, 2006. |
South Africa Notice of Allowance in Application No. 2008/02250 mailed Jul. 23, 2009, 1 page. |
Sturdy, Derek, "Squirrels and nuts: metadata and knowledge management", Business Information Review, 18(4), pp. 34-42, Dec. 2001. |
Svore, Krysta M. et al., "Improving Web Spam Classifaction using Rank-time Features," Published Date: May 8, 2007, http://www2007.org/workshops/paper—101.pdf, 8 pgs. |
Taiwan Office Action dated Oct. 19, 2012 cited in Appln No. PI 6546. |
Taiwanese Notice of Allowance in Application 95129817, mailed Jan. 29, 2013, 4 pgs. |
Taiwanese Search Report in Application 95129817, mailed Oct. 19, 2012, 1 pg. |
Takeda, Takaharu et al., "Multi-Document Summarization by efficient text processing", Proceedings of the FIT2007, Sixth Forum on Information Technology, vol. 2, No. E-014, pp. 165-168, Information Processing Society of Japan, Japan, Aug. 22, 2007. (not an English document). |
Taylor, et al., "Optimisation Methods for Ranking Functions with Multiple Parameters", CIKM'06, Nov. 5-11, 2006, ACM, 2006. |
U.S. Appl. No. 09/493,748, Advisory Action mailed Jan. 4, 2005, 2 pgs. |
U.S. Appl. No. 09/493,748, Amendment and Response filed Apr. 20, 2004, 16 pgs. |
U.S. Appl. No. 09/493,748, Amendment and Response filed Oct. 12, 2004, 18 pgs. |
U.S. Appl. No. 09/493,748, filed Jan. 28, 2000 entitled "Adaptive Web Crawling Using a Statistical Model". |
U.S. Appl. No. 09/493,748, Final Office Action mailed Jul. 20, 2004, 14 pgs. |
U.S. Appl. No. 09/493,748, Office Action mailed Sep. 25, 2003, 11 pgs. |
U.S. Appl. No. 09/603,695, Advisory Action mailed Aug. 27, 2004, 3 pgs. |
U.S. Appl. No. 09/603,695, Amendment and Response filed Feb. 27, 2004, 13 pgs. |
U.S. Appl. No. 09/603,695, Amendment and Response filed Jul. 22, 2004, 13 pgs. |
U.S. Appl. No. 09/603,695, Amendment and Response filed Nov. 5, 2004, 9 pgs. |
U.S. Appl. No. 09/603,695, Final Office Action mailed May 18, 2004, 12 pgs. |
U.S. Appl. No. 09/603,695, Notice of Allowance mailed Dec. 21, 2004, 8 pgs. |
U.S. Appl. No. 09/603,695, Office Action mailed Nov. 7, 2003, 11 pgs. |
U.S. Appl. No. 09/749,005, Amendment and Response filed Apr. 28, 2003, 12 pgs. |
U.S. Appl. No. 09/749,005, Amendment and Response filed Jun. 21, 2004, 14 pgs. |
U.S. Appl. No. 09/749,005, Notice of Allowance mailed Apr. 7, 2005, 4 pgs. |
U.S. Appl. No. 09/749,005, Notice of Allowance mailed Aug. 30, 2004, 9 pgs. |
U.S. Appl. No. 09/749,005, Notice of Allowance mailed Mar. 4, 2005, 4 pgs. |
U.S. Appl. No. 09/749,005, Office Action mailed Jun. 12, 2003, 10 pgs. |
U.S. Appl. No. 09/749,005, Office Action mailed Oct. 28, 2002, 12 pgs. |
U.S. Appl. No. 10/609,315, Amendment and Response filed Mar. 17, 2006, 14 pgs. |
U.S. Appl. No. 10/609,315, Amendment and Response filed Nov. 29, 2006, 23 pgs. |
U.S. Appl. No. 10/609,315, Notice of Allowance mailed Jan. 24, 2007, 6 pgs. |
U.S. Appl. No. 10/609,315, Notice of Allowance mailed May 30, 2007, 4 pgs. |
U.S. Appl. No. 10/804,326, Advisory Action mailed Feb. 21, 2008, 3 pgs. |
U.S. Appl. No. 10/804,326, Amendment and Response filed Feb. 11, 2008, 28 pgs. |
U.S. Appl. No. 10/804,326, Amendment and Response filed Jun. 10, 2008, 27 pgs. |
U.S. Appl. No. 10/804,326, Amendment and Response filed Mar. 16, 2007, 21 pgs. |
U.S. Appl. No. 10/804,326, Amendment and Response filed Mar. 9, 2009, 8 pgs. |
U.S. Appl. No. 10/804,326, Amendment and Response filed Sep. 7, 2007, 26 pgs. |
U.S. Appl. No. 10/804,326, Final Office Action mailed Dec. 11, 2007, 24 pgs. |
U.S. Appl. No. 10/804,326, Notice of Allowance mailed May 29, 2009, 8 pgs. |
U.S. Appl. No. 10/951,123, Advisory Action mailed Dec. 31, 2007, 3 pgs. |
U.S. Appl. No. 10/951,123, Amendment and Response filed Apr. 25, 2007, 15 pgs. |
U.S. Appl. No. 10/951,123, Amendment and Response filed Apr. 6, 2009, 18 pgs. |
U.S. Appl. No. 10/951,123, Amendment and Response filed Dec. 13, 2007, 10 pgs. |
U.S. Appl. No. 10/951,123, Amendment and Response filed Jan. 14, 2008, 10 pgs. |
U.S. Appl. No. 10/951,123, Amendment and Response filed Sep. 17, 2008, 15 pgs. |
U.S. Appl. No. 10/951,123, Final Office Action mailed Jan. 5, 2009, 23 pgs. |
U.S. Appl. No. 10/951,123, Final Office Action mailed Jul. 13, 2007, 15 pgs. |
U.S. Appl. No. 10/951,123, Notice of Allowance mailed Jun. 25, 2009, 5 pgs. |
U.S. Appl. No. 10/951,123, Office Action mailed Jan. 25, 2007, 16 pgs. |
U.S. Appl. No. 10/951,123, Office Action mailed Mar. 18, 2008, 20 pgs. |
U.S. Appl. No. 10/955,462 Amendment and Response filed Aug. 8, 2007, 21 pgs. |
U.S. Appl. No. 10/955,462 Amendment and Response filed Mar. 10, 2008, 17 pgs. |
U.S. Appl. No. 10/955,462 Amendment and Response filed Mar. 5, 2007, 18 pgs. |
U.S. Appl. No. 10/955,462 Notice of Allowance mailed Feb. 24, 2009, 7 pgs. |
U.S. Appl. No. 10/955,462 Notice of Allowance mailed Jan. 25, 2010, 6 pgs. |
U.S. Appl. No. 10/955,462 Notice of Allowance mailed Jun. 10, 2009, 6 pgs. |
U.S. Appl. No. 10/955,462 Notice of Allowance mailed Jun. 17, 2008, 12 pgs. |
U.S. Appl. No. 10/955,462 Notice of Allowance mailed Oct. 16, 2009, 7 pgs. |
U.S. Appl. No. 10/955,462 Notice of Allowance mailed Sep. 23, 2008, 6 pgs. |
U.S. Appl. No. 10/955,983, Amendment and Response filed Aug. 22, 2007, 13 pgs. |
U.S. Appl. No. 10/955,983, Amendment and Response filed Mar. 18, 2009, 18 pgs. |
U.S. Appl. No. 10/955,983, Amendment and Response filed May 13, 2008, 14 pgs. |
U.S. Appl. No. 10/955,983, Amendment and Response filed Oct. 13, 2009, 12 pgs. |
U.S. Appl. No. 10/955,983, Amendment and Response filed Sep. 25, 2008, 13 pgs. |
U.S. Appl. No. 10/955,983, Notice of Allowance mailed Jan. 12, 2010, 10 pgs. |
U.S. Appl. No. 10/955,983, Notice of Allowance mailed Jun. 4, 2010, 5 pgs. |
U.S. Appl. No. 10/956,891, Advisory Action mailed Mar. 21, 2008, 3 pgs. |
U.S. Appl. No. 10/956,891, Amendment and Response filed Aug. 22, 2007, 11 pgs. |
U.S. Appl. No. 10/956,891, Amendment and Response filed Jun. 1, 2009, 12 pgs. |
U.S. Appl. No. 10/956,891, Amendment and Response filed Mar. 3, 2008, 11 pgs. |
U.S. Appl. No. 10/956,891, Amendment and Response filed May 1, 2008, 11 pgs. |
U.S. Appl. No. 10/956,891, Amendment and Response filed Oct. 16, 2008, 12 pgs. |
U.S. Appl. No. 10/956,891, Final Office Action filed Nov. 1, 2007, 18 pgs. |
U.S. Appl. No. 10/956,891, Final Office Action mailed Dec. 31, 2008, 16 pgs. |
U.S. Appl. No. 10/956,891, Notice of Allowance mailed Aug. 20, 2009, 7 pgs. |
U.S. Appl. No. 10/956,891, Office Action mailed Jul. 16, 2008, 19 pgs. |
U.S. Appl. No. 10/956,891, Office Action mailed Mar. 22, 2007, 15 pgs. |
U.S. Appl. No. 10/959,330, Amendment and Response filed Jan. 6, 2006, 10 pgs. |
U.S. Appl. No. 10/959,330, Amendment and Response filed Sep. 14, 2005, 12 pgs. |
U.S. Appl. No. 10/959,330, Notice of Allowance mailed Apr. 3, 2006, 6 pgs. |
U.S. Appl. No. 10/959,330, Office Action mailed Dec. 14, 2005, 6 pgs. |
U.S. Appl. No. 10/959,330, Office Action mailed Jun. 27, 2005, 10 pgs. |
U.S. Appl. No. 10/968,716, Amendment and Response filed Aug. 13, 2007, 6 pgs. |
U.S. Appl. No. 10/968,716, Amendment and Response filed Jan. 25, 2008, 8 pgs. |
U.S. Appl. No. 10/968,716, Amendment and Response filed Jun. 15, 2007, 13 pgs. |
U.S. Appl. No. 10/968,716, Notice of Allowance mailed Jun. 2, 2008, 8 pgs. |
U.S. Appl. No. 10/968,716, Office Action mailed Mar. 15, 2007, 13 pgs. |
U.S. Appl. No. 10/968,716, Office Action mailed Oct. 26, 2007, 14 pgs. |
U.S. Appl. No. 10/981,962, Advisory Action mailed Jan. 23, 2007, 3 pgs. |
U.S. Appl. No. 10/981,962, Amendment and Response filed Aug. 18, 2008, 10 pgs. |
U.S. Appl. No. 10/981,962, Amendment and Response filed Feb. 7, 2007, 1 pg. |
U.S. Appl. No. 10/981,962, Amendment and Response filed Jul. 27, 2007, 16 pgs. |
U.S. Appl. No. 10/981,962, Amendment and Response filed Jun. 27, 2006, 23 pgs. |
U.S. Appl. No. 10/981,962, Amendment and Response filed Nov. 27, 2007, 10 pgs. |
U.S. Appl. No. 10/981,962, Notice of Allowance mailed Aug. 20, 2009, 6 pgs. |
U.S. Appl. No. 10/981,962, Notice of Allowance mailed Jan. 29, 2009, 6 pgs. |
U.S. Appl. No. 10/981,962, Notice of Allowance mailed Jan. 9, 2009, 6 pgs. |
U.S. Appl. No. 10/981,962, Notice of Allowance mailed May 8, 2009, 6 pgs. |
U.S. Appl. No. 10/981,962, Notice of Allowance mailed Oct. 15, 2008, 6 pgs. |
U.S. Appl. No. 10/981,962, Notice of Allowance mailed Sep. 11, 2008, 14 pgs. |
U.S. Appl. No. 10/981,962, Office Action mailed Nov. 13, 2007, 3 pgs. |
U.S. Appl. No. 11/019,091, Amendment and Response filed Dec. 20, 2007, 23 pgs. |
U.S. Appl. No. 11/019,091, Amendment and Response filed Jun. 11, 2009, 12 pgs. |
U.S. Appl. No. 11/019,091, Amendment and Response filed Nov. 30, 2009, 11 pgs. |
U.S. Appl. No. 11/019,091, Amendment and Response filed Oct. 3, 2008, 15 pgs. |
U.S. Appl. No. 11/019,091, Notice of Allowance mailed Dec. 23, 2009, 16 pgs. |
U.S. Appl. No. 11/022,054, Amendment and Response filed Aug. 24, 2007, 19 pgs. |
U.S. Appl. No. 11/022,054, Notice of Allowance mailed Nov. 15, 2007, 10 pgs. |
U.S. Appl. No. 11/022,054, Office Action mailed Jun. 19, 2007, 19 pgs. |
U.S. Appl. No. 11/073,381, Amendment and Response filed Dec. 13, 2010, 10 pgs. |
U.S. Appl. No. 11/073,381, Amendment and Response filed Dec. 28, 2009, 9 pgs. |
U.S. Appl. No. 11/073,381, Amendment and Response filed Dec. 9, 2008, 11 pgs. |
U.S. Appl. No. 11/073,381, Amendment and Response filed Jul. 15, 2009, 10 pgs. |
U.S. Appl. No. 11/073,381, Amendment and Response filed Jul. 9, 2010, 10 pgs. |
U.S. Appl. No. 11/073,381, Amendment and Response filed Mar. 18, 2008, 14 pgs. |
U.S. Appl. No. 11/206,286, Amendment and Response filed Jul. 22, 2009, 3 pgs. |
U.S. Appl. No. 11/206,286, Amendment and Response filed Mar. 24, 2009, 13 pgs. |
U.S. Appl. No. 11/206,286, Amendment and Response filed Sep. 30, 2008, 11 pgs. |
U.S. Appl. No. 11/206,286, Notice of Allowance mailed Apr. 22, 2009, 9 pgs. |
U.S. Appl. No. 11/231,955, filed Sep. 21, 2005, Amendment and Response filed Apr. 30, 2008, 12 pgs. |
U.S. Appl. No. 11/231,955, filed Sep. 21, 2005, Amendment and Response filed Sep. 15, 2008, 16 pgs. |
U.S. Appl. No. 11/231,955, filed Sep. 21, 2005, Final Office Action mailed Jun. 4, 2008, 8 pgs. |
U.S. Appl. No. 11/231,955, filed Sep. 21, 2005, Notice of Allowance mailed Oct. 21, 2008, 5 pgs. |
U.S. Appl. No. 11/231,955, filed Sep. 21, 2005, Office Action mailed Jan. 30, 2008, 8 pgs. |
U.S. Appl. No. 11/238,906, Amendment and Response filed Feb. 26, 2009, 9 pgs. |
U.S. Appl. No. 11/238,906, Amendment and Response filed Jun. 9, 2008, 10 pgs. |
U.S. Appl. No. 11/238,906, Amendment and Response filed May 28, 2010, 9 pgs. |
U.S. Appl. No. 11/238,906, Amendment and Response filed Sep. 1, 2009, 9 pgs. |
U.S. Appl. No. 11/238,906, Notice of Allowance mailed Aug. 5, 2010, 4 pgs. |
U.S. Appl. No. 11/238,906, Notice of Allowance mailed Jul. 22, 2010, 10 pgs. |
U.S. Appl. No. 11/412,723, Amendment and Response filed Jun. 23, 2009, 11 pgs. |
U.S. Appl. No. 11/412,723, Amendment and Response filed May 31, 2010, 11 pgs. |
U.S. Appl. No. 11/412,723, Amendment and Response filed Nov. 26, 2008, 10 pgs. |
U.S. Appl. No. 11/412,723, Amendment and Response filed Nov. 30, 2009, 10 pgs. |
U.S. Appl. No. 11/412,723, Notice of Allowance mailed Jul. 9, 2010, 10 pgs. |
U.S. Appl. No. 11/874,579 filed Oct. 18, 2007, Amendment and Response filed May 16, 2011, 14 pgs. |
U.S. Appl. No. 11/874,579, filed Oct. 18, 2007, Amendment and Response filed Dec. 10, 2013, 17 pgs. |
U.S. Appl. No. 11/874,579, filed Oct. 18, 2007, Amendment and Response filed Nov. 22, 2010, 8 pgs. |
U.S. Appl. No. 11/874,579, Office Action mailed Mar. 28, 2014, 30 pgs. |
U.S. Appl. No. 11/874,579, Office Action mailed Sep. 10, 2013, 27 pgs. |
U.S. Appl. No. 11/874,844, Amendment and Response filed Mar. 15, 2010, 16 pgs. |
U.S. Appl. No. 11/874,844, Notice of Allowance mailed Jun. 25, 2010, 2 pgs. |
U.S. Appl. No. 11/874,844, Notice of Allowance mailed May 18, 2010, 9 pgs. |
U.S. Appl. No. 12/207,910, Amendment and Response filed Mar. 12, 2012, 13 pgs. |
U.S. Appl. No. 12/207,910, Amendment and Response filed Sep. 7, 2011, 14 pgs. |
U.S. Appl. No. 12/207,910, Notice of Allowance mailed Apr. 16, 2014, 19 pgs. |
U.S. Appl. No. 12/207,910, Office Action mailed Dec. 12, 2011, 27 pgs. |
U.S. Appl. No. 12/359,939 filed Jan. 26, 2009, Amendment and Response filed Oct. 26, 2012, 11 pgs. |
U.S. Appl. No. 12/359,939, Amendment and Response filed Mar. 11, 2014, 10 pgs. |
U.S. Appl. No. 12/359,939, Amendment and Response filed Mar. 23, 2012, 11 pgs. |
U.S. Appl. No. 12/359,939, filed Jan. 26, 2009, Amendment and Response filed Jul. 21, 2011, 8 pgs. |
U.S. Appl. No. 12/359,939, filed Jan. 26, 2009, Amendment and Response filed May 23, 2011, 8 pgs. |
U.S. Appl. No. 12/359,939, filed Jan. 26, 2009, Amendment and Response filed Nov. 29, 2012, 9 pgs. |
U.S. Appl. No. 12/359,939, filed Jan. 26, 2009, Amendment and Response filed Sep. 28, 2011, 14 pgs. |
U.S. Appl. No. 12/359,939, filed Jan. 26, 2009, Office Action mailed Dec. 6, 2011, 14 pgs. |
U.S. Appl. No. 12/359,939, filed Jan. 26, 2009, Office Action mailed Jan. 21, 2011, 15 pgs. |
U.S. Appl. No. 12/359,939, Office Action mailed Apr. 9, 2014, 18 pgs. |
U.S. Appl. No. 12/359,939, Office Action mailed Jan. 2, 2014, 18 pgs. |
U.S. Appl. No. 12/359,939, Office Action mailed Jul. 17, 2012, 21 pgs. |
U.S. Appl. No. 12/359,939, Office Action mailed Jun. 17, 2013, 19 pgs. |
U.S. Appl. No. 12/359,939, Office Action mailed Oct. 11, 2013, 11 pgs. |
U.S. Appl. No. 12/569,028, Amendment and Response filed Aug. 2, 2013, 17 pgs. |
U.S. Appl. No. 12/569,028, Amendment and Response filed Dec. 28, 2011, 8 pgs. |
U.S. Appl. No. 12/569,028, Amendment and Response filed Jan. 15, 2013, 14 pgs. |
U.S. Appl. No. 12/569,028, Amendment and Response filed Jan. 28, 2014, 13 pgs. |
U.S. Appl. No. 12/569,028, Amendment and Response filed Jun. 27, 2012, 8 pgs. |
U.S. Appl. No. 12/569,028, Notice of Allowance mailed Feb. 21, 2014, 8 pgs. |
U.S. Appl. No. 12/569,028, Office Action mailed Apr. 2, 2013, 21 pgs. |
U.S. Appl. No. 12/569,028, Office Action mailed Aug. 28, 2013, 21 pgs. |
U.S. Appl. No. 12/569,028, Office Action mailed Feb. 27, 2012, 11 pgs. |
U.S. Appl. No. 12/569,028, Office Action mailed Oct. 15, 2012, 14 pgs. |
U.S. Appl. No. 12/569,028, Office Action mailed Sep. 28, 2011, 14 pgs. |
U.S. Appl. No. 12/791,756, Amendment and Response after Allowance filed Apr. 4, 2014, 3 pgs. |
U.S. Appl. No. 12/791,756, Amendment and Response filed Apr. 30, 2012, 12 pgs. |
U.S. Appl. No. 12/791,756, Amendment and Response filed Dec. 24, 2103, 19 pgs. |
U.S. Appl. No. 12/791,756, Amendment and Response filed Sep. 26, 2012, 14 pgs. |
U.S. Appl. No. 12/791,756, Notice of Allowance mailed Feb. 7, 2014, 10 pgs. |
U.S. Appl. No. 12/791,756, Office Action mailed Jan. 31, 2012, 18 pgs. |
U.S. Appl. No. 12/791,756, Office Action mailed Jun. 26, 2012, 26 pgs. |
U.S. Appl. No. 12/791,756, Office Action mailed Oct. 3, 2013, 32 pgs. |
U.S. Appl. No. 12/828,508, Amendment and Response filed Jan. 13, 2011, 11 pgs. |
U.S. Appl. No. 12/828,508, Amendment and Response filed Sep. 6, 2011, 3 pgs. |
U.S. Appl. No. 12/828,508, filed Jul. 1, 2010 entitled "System and Method for Ranking Search Results Using Click Distance". |
U.S. Appl. No. 12/828,508, Notice of Allowance mailed Jul. 6, 2011, 8 pgs. |
U.S. Appl. No. 12/828,508, Notice of Allowance mailed Mar. 31, 2011, 9 pgs. |
U.S. Appl. No. 13/360,536, filed Jan. 27, 2012 entitled "Re-Ranking Search Results". |
U.S. Appl. No. 13/360,536, Office Action mailed Mar. 20, 2014, 14 pgs. |
U.S. Official Action in U.S. Appl. No. 10/609,315 mailed Dec. 15, 2005, 13 pgs. |
U.S. Official Action in U.S. Appl. No. 10/609,315 mailed Jun. 1, 2006, 12 pgs. |
U.S. Official Action in U.S. Appl. No. 10/804,326 mailed Dec. 10, 2008, 7 pgs. |
U.S. Official Action in U.S. Appl. No. 10/804,326 mailed Jun. 7, 2007, 19 pgs. |
U.S. Official Action in U.S. Appl. No. 10/804,326 mailed Oct. 16, 2006, 18 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,462 mailed May 11, 2007, 26 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,462 mailed Nov. 3, 2006, 19 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,462 mailed Sep. 10, 2007, 22 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,983 mailed Dec. 18, 2008, 29 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,983 mailed Jul. 21, 2008, 28 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,983 mailed Jun. 10, 2009, 30 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,983 mailed Mar. 22, 2007, 25 pgs. |
U.S. Official Action in U.S. Appl. No. 10/955,983 mailed Nov. 13, 2007, 27 pgs. |
U.S. Official Action in U.S. Appl. No. 10/981,962 mailed Apr. 30, 2007, 21 pgs. |
U.S. Official Action in U.S. Appl. No. 10/981,962 mailed Apr. 5, 2006, 15 pgs. |
U.S. Official Action in U.S. Appl. No. 10/981,962 mailed Mar. 17, 2008, 20 pgs. |
U.S. Official Action in U.S. Appl. No. 10/981,962 mailed Sep. 21, 2006, 16 pgs. |
U.S. Official Action in U.S. Appl. No. 11/019,091 mailed Apr. 3, 2008. |
U.S. Official Action in U.S. Appl. No. 11/019,091 mailed Dec. 11, 2008, 24 pgs. |
U.S. Official Action in U.S. Appl. No. 11/019,091 mailed Jun. 20, 2007. |
U.S. Official Action in U.S. Appl. No. 11/019,091 mailed Sep. 1, 2009, 26 pgs. |
U.S. Official Action in U.S. Appl. No. 11/073,381 mailed Apr. 12, 2010, 25 pgs. |
U.S. Official Action in U.S. Appl. No. 11/073,381 mailed Apr. 15, 2009, 20 pgs. |
U.S. Official Action in U.S. Appl. No. 11/073,381 mailed Feb. 23, 2011, 27 pgs. |
U.S. Official Action in U.S. Appl. No. 11/073,381 mailed Jul. 10, 2008, 19 pgs. |
U.S. Official Action in U.S. Appl. No. 11/073,381 mailed Sep. 13, 2010, 24 pgs. |
U.S. Official Action in U.S. Appl. No. 11/073,381 mailed Sep. 18, 2007, 17 pgs. |
U.S. Official Action in U.S. Appl. No. 11/073,381 mailed Sep. 29, 2009, 21 pgs. |
U.S. Official Action in U.S. Appl. No. 11/206,286 mailed Dec. 24, 2008, 16 pgs. |
U.S. Official Action in U.S. Appl. No. 11/206,286 mailed Jul. 14, 2008, 15 pgs. |
U.S. Official Action in U.S. Appl. No. 11/238,906 mailed Dec. 18, 2009, 21 pgs. |
U.S. Official Action in U.S. Appl. No. 11/238,906 mailed Jan. 8, 2008, 18 pgs. |
U.S. Official Action in U.S. Appl. No. 11/238,906 mailed May 19, 2009, 20 pgs. |
U.S. Official Action in U.S. Appl. No. 11/238,906 mailed Sep. 16, 2008, 17 pgs. |
U.S. Official Action in U.S. Appl. No. 11/412,723 mailed Mar. 11, 2010, 20 pgs. |
U.S. Official Action in U.S. Appl. No. 11/412,723 mailed Mar. 6, 2009, 22 pgs. |
U.S. Official Action in U.S. Appl. No. 11/412,723 mailed May 28, 2008, 22 pgs. |
U.S. Official Action in U.S. Appl. No. 11/412,723 mailed Sep. 3, 2009, 20 pgs. |
U.S. Official Action in U.S. Appl. No. 11/874,579 mailed Jan. 14, 2011, 23 pgs. |
U.S. Official Action in U.S. Appl. No. 11/874,579 mailed Jun. 22, 2010, 23 pgs. |
U.S. Official Action in U.S. Appl. No. 11/874,844 mailed Nov. 13, 2009, 14 pgs. |
U.S. Official Action in U.S. Appl. No. 12/207,910 mailed Jun. 7, 2011, 30 pgs. |
U.S. Official Action in U.S. Appl. No. 12/828,508 mailed Aug. 13, 2010, 16 pgs. |
Utiyama, Masao et al., "Implementation of an IR package", IPSJ SIG Notes, vol. 2001, No. 74 (2001-FI-63-8), pp. 57-64, Information Processing Society of Japan, Japan, Jul. 25, 2001. (not an English document). |
Voorhees, E., "Overview of TREC 2002", Gaithersburg, Maryland, Nov. 19-22, 15 pp. |
Web Page "Reuters: Reuters Corpus", http://about.reuter.com/researchandstandards/corpus/, viewed Mar. 18, 2004. |
Wen, Ji-Rong, "Query Clustering Using User Logs", Jan. 2002, pp. 59-81. |
Westerveld, T. et al., "Retrieving Web pages using Content, Links, URLs and Anchors", Proceedings of the Tenth Text Retrieval Conference, NIST Special Publication, 'Online! Oct. 2001, pp. 1-10. |
Wilkinson, R., "Effective Retrieval of Structured Documents", Annual ACM Conference on Research and Development, 1994, 7 pp. |
Xue, Gui-Rong et al., "Optimizing Web Search Using Web Click-Through Data," http://people.cs.vt.edu/˜xwensi/Publication/p118-xue.pdf, CIKM'04, Nov. 8-13, 2004, 9 pages. |
Yi, Jeonghe,e et al., "Metadata Based Web Mining for Topic-Specific Information Gathering", IEEE, pp. 359-368, 2000. |
Yi, Jeonghee, et al., "Using Metadata to Enhance Web Information Gathering", D.Suciu and G. Vossen (eds.): WebDB 2000, LNCS 1997, pp. 38-57, 2001. |
Yuwono, Budi and Lee, Dik L., "Search and Ranking Algorithms for Locating Resources on the World Wide Web", IEEE, 1996, pp. 164-170. |
Zamir, O. et al., "Grouper: A Dynamic Clustering Interface to Web Search Results", Computer Networks (Amsterdam, Netherlands: 1999), 31(11-16): 1361-1374, 1999. |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130262983A1 (en) * | 2012-03-30 | 2013-10-03 | Bmenu As | System, method, software arrangement and computer-accessible medium for a generator that automatically identifies regions of interest in electronic documents for transcoding |
US9535888B2 (en) * | 2012-03-30 | 2017-01-03 | Bmenu As | System, method, software arrangement and computer-accessible medium for a generator that automatically identifies regions of interest in electronic documents for transcoding |
US10650191B1 (en) | 2018-06-14 | 2020-05-12 | Elementary IP LLC | Document term extraction based on multiple metrics |
US20220159130A1 (en) * | 2020-11-18 | 2022-05-19 | Canon Kabushiki Kaisha | Information processing apparatus, information processing method, and non-transitory storage medium |
US11637937B2 (en) * | 2020-11-18 | 2023-04-25 | Canon Kabushiki Kaisha | Information processing apparatus, information processing method, and non-transitory storage medium |
RU2821294C2 (en) * | 2021-10-18 | 2024-06-19 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for ranking set of documents from search result |
Also Published As
Publication number | Publication date |
---|---|
BRPI0909092A2 (en) | 2019-02-26 |
JP2011516989A (en) | 2011-05-26 |
TW200945079A (en) | 2009-11-01 |
TWI486800B (en) | 2015-06-01 |
ZA201006093B (en) | 2011-10-26 |
RU2501078C2 (en) | 2013-12-10 |
KR20110009098A (en) | 2011-01-27 |
AU2009234120A1 (en) | 2009-10-15 |
JP5492187B2 (en) | 2014-05-14 |
AU2009234120B2 (en) | 2014-05-22 |
IL207830A0 (en) | 2010-12-30 |
CN101990670A (en) | 2011-03-23 |
EP2289007B1 (en) | 2015-04-22 |
RU2010141559A (en) | 2012-04-20 |
EP2289007A4 (en) | 2012-10-31 |
US20090259651A1 (en) | 2009-10-15 |
EP2289007A1 (en) | 2011-03-02 |
KR101557294B1 (en) | 2015-10-06 |
CN101990670B (en) | 2013-12-18 |
IL207830A (en) | 2015-03-31 |
WO2009126394A1 (en) | 2009-10-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8812493B2 (en) | Search results ranking using editing distance and document information | |
KR101190230B1 (en) | Identification of phrases in IR systems | |
JP4944405B2 (en) | Phrase-based indexing method in information retrieval system | |
JP5175005B2 (en) | Phrase-based search method in information search system | |
JP4944406B2 (en) | How to generate document descriptions based on phrases | |
US8051073B2 (en) | System and method for measuring the quality of document sets | |
US7912816B2 (en) | Adaptive archive data management | |
US20060294100A1 (en) | Ranking search results using language types | |
US7024405B2 (en) | Method and apparatus for improved internet searching | |
US8423885B1 (en) | Updating search engine document index based on calculated age of changed portions in a document | |
US20110208715A1 (en) | Automatically mining intents of a group of queries | |
US8103686B2 (en) | Extracting similar entities from lists/tables | |
Jain et al. | Building query optimizers for information extraction: the sqout project | |
Aung et al. | To construct implicit link structure by using frequent sequence miner (fs-miner) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: MICROSOFT CORPORATION, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:TANKOVICH, VLADIMIR;LI, HANG;MEYERZON, DMITRIY;AND OTHERS;REEL/FRAME:020793/0017;SIGNING DATES FROM 20080401 TO 20080410 Owner name: MICROSOFT CORPORATION, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:TANKOVICH, VLADIMIR;LI, HANG;MEYERZON, DMITRIY;AND OTHERS;SIGNING DATES FROM 20080401 TO 20080410;REEL/FRAME:020793/0017 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:034564/0001 Effective date: 20141014 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551) Year of fee payment: 4 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |