US20070055680A1 - Method and system for creating a taxonomy from business-oriented metadata content - Google Patents
Method and system for creating a taxonomy from business-oriented metadata content Download PDFInfo
- Publication number
- US20070055680A1 US20070055680A1 US11/494,974 US49497406A US2007055680A1 US 20070055680 A1 US20070055680 A1 US 20070055680A1 US 49497406 A US49497406 A US 49497406A US 2007055680 A1 US2007055680 A1 US 2007055680A1
- Authority
- US
- United States
- Prior art keywords
- metadata
- taxonomy
- business
- content
- index
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
Definitions
- the present invention relates to a metadata content management and searching system and method, especially to a method and system for creating a taxonomy from business-oriented metadata content.
- CPM Corporate Performance Management
- BI Business Intelligence
- search engines that use a full-text index combined with statistical methods to create ordered search results.
- An example of such a search engine is page ranking that is described in U.S. Pat. No. 6,526,440 issued to Bharat.
- these search engines are not sufficient to search complex data like business oriented metadata since they rely on ranking algorithms that work with data found primarily in the Global Internet and not inside a business.
- taxonomies Some search engines use taxonomies to improve results. Creation of taxonomies has been carried out by a manual process or by an automated process based on advanced linguistic analysis. However, business taxonomies are difficult and expensive to build manually. Also, linguistic analysis are often complicated and thus prone to result in inaccurate outcome.
- the invention uses a content index to store knowledge base representations of terms in source documents containing business oriented metadata along with references to the original source metadata content that uses the terms.
- a method and system for building a searchable index corpus that includes taxonomy definitions (or topic hierarchies) obtained from the structure of business reporting metadata.
- a business taxonomy management system comprising an indexing engine, an index store and a taxonomy engine.
- the indexing engine is provided for indexing content of source business oriented metadata, the indexing engine having a content scanner for reading the business oriented metadata, defining taxonomy of the business oriented metadata, and building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata.
- the index store is provided for storing the content index of the business oriented metadata.
- the taxonomy engine is provided for providing taxonomy services to users using the content index.
- a method of managing business taxonomy comprises the steps of reading content of source business oriented metadata; defining taxonomy of the business oriented metadata; building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata; storing the content index of the business oriented metadata; and providing taxonomy services to users using the content index.
- a computer readable medium storing instructions or statements for use in the execution in a computer of a method of managing business taxonomy.
- the method comprises the steps of reading content of source business oriented metadata; defining taxonomy of the business oriented metadata; building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata; storing the content index of the business oriented metadata; and providing taxonomy services to users using the content index.
- a propagated signal carrier carrying signals containing computer executable instructions that can be read and executed by a computer, the computer executable instructions being used to execute a method of managing business taxonomy.
- the method comprises the steps of reading content of source business oriented metadata; defining taxonomy of the business oriented metadata; building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata; storing the content index of the business oriented metadata; and providing taxonomy services to users using the content index.
- FIG. 1 is a block diagram showing a metadata content management system in accordance with an embodiment of the present invention
- FIG. 2A is a block diagram showing an embodiment of the metadata content management system
- FIG. 2B is a block diagram showing an embodiment of a content index component
- FIG. 3 is a block diagram showing an embodiment of a taxonomy management system
- FIG. 4A is a flowchart showing an embodiment of the procedure for determining parent topic terms
- FIG. 4B is a flowchart showing an embodiment of the procedure for determining sibling topic terms
- FIG. 4C is a flowchart showing an embodiment of the procedure for determining child topic terms
- FIG. 5 is a diagram showing another embodiment of the taxonomy management system
- FIG. 6 is a diagram showing metadata and report values
- FIG. 7 is a diagram showing an example of a report and user interface display by the metadata content management system
- FIG. 8 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 9 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 10 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 11 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 12 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 13 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 14 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 15 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 16 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 17 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 18 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 19 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 20 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 21 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 22 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 23 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 24 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 25 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 26 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 27 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 28 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 29 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 30 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 31 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 32 is a diagram showing another example of a report and user interface display by the metadata content management system
- FIG. 33 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 34 is a diagram showing another example of a report and user interface display by the metadata content management system.
- FIG. 35 is a diagram showing another example of a report and user interface display by the metadata content management system.
- the metadata content management system 10 is suitably used for an enterprise or other organization that has sources of business oriented information, i.e., business oriented metadata 20 .
- the metadata content management system 10 interacts with the business oriented metadata 20 , as well as one or more search tools or components 30 and user reporting applications 40 used by the organization.
- An organization typically has untapped sources of information, e.g., business oriented metadata 20 including reporting metadata 21 and specifications and key report values 22 of the user reporting applications 40 .
- the business oriented metadata 20 includes OLAP and dimensional business data defined by the user reporting applications 40 . These information, metadata and values may be collectively called as business oriented metadata 20 in this specification.
- the metadata content management system 10 indexes the content of the business oriented metadata 20 . It analyzes the business oriented metadata 20 to create a search index. Since the search index is created from the organization's metadata 20 , it is suitable for the organization. By providing such a search index, the metadata content management system 10 promotes navigation between BI tools 30 and reporting applications 40 , creating a strategic view of CPM assets.
- the metadata content management system 10 captures application context, e.g., “viewing location” or “query parameters”, by creating the search index from the reporting metadata 21 .
- the search index created by the metadata content management system 10 enables many unique navigation options beyond traditional folder browsing and text searching.
- a typical organization has various data sources 39 , such as operational databases and/or data warehouses, and several CPM tools or user reporting applications 40 that create cubes and/or report specifications 41 and generate reports 42 .
- Reporting metadata 21 and associated values 22 are produced by those applications 40 .
- Other business oriented metadata may be exported from metadata modeling tools. While authoring reports in reporting applications 40 , the creation of new hierarchies and data definitions occur. These hierarchies and data definitions are useful for drilling and searching. This data is often more recognizable to end-users since this is the data or text that the users see in applications 40 and their reports 41
- These metadata and report data are considered as extended metadata 21 to describe the metadata created by different authoring and processing phases.
- Extended report data 22 refers to values created in a similar fashion.
- These extended metadata 21 and report data 22 can be viewed as new BI data or business oriented metadata 20 of the organization.
- the metadata content management system 10 leverages the new BI data 20 to provide searching and drilling that was previously unavailable in existing systems, as described below.
- Examples of extended metadata 21 added by the authoring process includes dimension names, dimension levels, category names, alternate category names, cube hierarchies, table and record names, group names, parent/child relationships between categories, groups or tables, authored drill target names, CPM tool's model entities such as packages, namespaces, query items, query sources and relevant authored relationships.
- Examples of extended authored report values 22 include items related by one of more dimensions, categories, measures groups or tables, calculated values, and annotations.
- a BI tool may provide dimensional business data, such as crosstable providing dimension, category and measure names. These names represent extended metadata 21 . These names may or may not match table/column names in a star schema or other relational model. Yet each of these names represents an important potential target for drilling or searching. Values stored in a cube, including calculated values, represent extended data or values 22 . They are a valuable target for searching. Like extended metadata 21 , many of these values 22 are not found in any other data store.
- reporting tool 40 may provide a report with columns.
- each of the column heading represents extended metadata 21 .
- the report grouping e.g., by country, represents another form of extended metadata 21 .
- Report values themselves represent extended report data 22 . They offer important linking and search targets.
- the extended metadata names are the same as those viewed by the report user. Thus, these extended metadata names are often most relevant and recognizable to the report user. Using these metadata names allows the metadata content management system 10 to provide information relevant and recognizable to the report user. These metadata names may or may not match the names used in the underlying databases.
- Authored links such as those anchored to the column name “Sales Rep Name”, provide additional summary information about the linked reports.
- This information also represents extended metadata 21 . This information allows the metadata content management system 10 to further increase search relevance about the destination content of the metadata 20 including the metadata 21 or report values 22 .
- the metadata content management system 10 indexes content of the business oriented metadata 20 and generates a content index or index corpus which is a searchable database of representations of the content of the business oriented metadata 20 , as further described below.
- Structured data is defined by a formal schema. Typically structured data is searched with utilities of Online Analytical Processing (OLAP), Structured Query Language (SQL) and eXtensible Markup Language (XML). Unstructured data is normally found in documents and static web pages. Typically unstructured data is searched using free-form queries with web tools, such as Google (TM).
- OLAP Online Analytical Processing
- SQL Structured Query Language
- XML eXtensible Markup Language
- Unstructured data is normally found in documents and static web pages.
- unstructured data is searched using free-form queries with web tools, such as Google (TM).
- the metadata content management system 10 provides searching functions over both types of data by using the content index of the business oriented metadata 20 .
- Structured data searches are used to implement report-to-report drilling. This includes listing selecting from multiple targets.
- the metadata content management system 10 typically uses its internal search engine or navigation and drill engine 14 ( FIG. 2A ) to handle structured data searches.
- Full-text searches are used to find reports for unstructured user queries.
- the metadata content management system 10 typically handles full-text search by interacting with external search tools 30 ( FIG. 1 ). For example, searches launched from Internet Information Services (IIS) or Portal text search tools are full-text searches.
- IIS Internet Information Services
- Portal text search tools are full-text searches.
- the content index provides various advantages.
- the metadata content management system 10 enhances search and drill-through capabilities across the range of user report applications 40 without requiring drill-through authoring in source content.
- a report author simply publishes target reports and lets the metadata content management system 10 find drill locations to the target content.
- the metadata content management system 10 organizes business oriented metadata content in ways that are more relevant and meaningful to users.
- the metadata content management system 10 also includes several personalization and administration options.
- the metadata content management system 10 describes data using names and labels from actual reports. These names are often more familiar and relevant to report users.
- the metadata content management system 10 also provides enhanced report-to-report drilling and product-to-product navigation. It expands the number of places where report users can “drill-to” and “drill-from” in a report. Most drilling requires no advance authoring.
- the metadata content management system 10 improves the capabilities of search tools. This includes the concept of ‘federated’ search across a variety of portal and web search indices.
- User reporting applications 40 often generate authored relational and OLAP reports. Those reports provide a wealth of new metadata, including schema information, that is largely hidden from other tools and reporting applications.
- the metadata content management system 10 exposes this metadata in a standard format that can be re-used by other CPM applications 40 and tools 30 .
- FIG. 2A shows an embodiment of the metadata content management system 10 .
- the metadata content management system 10 has a content index component 12 , navigation and drill engine 14 and tools 16 .
- the metadata content management system 10 uses indexing so that the metadata content can be searched and organized in real-time. Indexing is normally performed by the metadata content management system 10 when the metadata content is published or updated. Indexing can be performed by a scheduled administrator task (example: nightly cron job). It can also be performed manually by an administrator or user.
- the content index component 12 has an indexing engine 80 and an Index store 82 .
- the index store 82 stores files for content index 90 .
- the content index 90 may also be called an index corpus or knowledge base.
- the content index 90 is a full-text index.
- the indexing engine 80 performs indexing of the content of the business oriented metadata 20 for a particular organization. It analyzes the content of the business oriented metadata 20 and creates indexes as described below. Since it creates indexes from the business oriented metadata of the organization, the created indexes are suitable for the organization.
- a single set of index files is typically maintained in the index store 82 in the content index component 12 for all users and user groups for the organization.
- the metadata content management system 10 can provide optimal or improved performance.
- the index store 82 may be part of a server file system of the organization.
- a content index 90 is a collection of content indexes.
- the content index 90 is a concordance of unique words (called terms) across scanned or indexed content items (called documents).
- Each content index contains an entry for each term across the indexed documents.
- Each context index catalogs individual words or terms and stores them along with their usage or other data.
- Each indexed content term contains a list of the indexed documents that have that term.
- Each indexed content term also contains usage statistics and the position of the term within each indexed document where possible.
- a content index is an “inverted index” where each indexed term refers to a list of documents that have the indexed term, rather than each indexed document contains a list of terms as in traditional indexes.
- the content index 90 provides term searches and links to additional data stored in the content index 90 .
- Each content index may contain, for each content, i.e., target item, information regarding the name or identification of the target item; module, cube or report metadata and their relevant metadata hierarchy; item location in the document folder hierarchy; and/or reference to its dependent model.
- a content index may be an XML content index that describes each indexed item in XML.
- An XML content index stores applicable metadata, metrics and planning information that improve search relevance.
- Each XML content index is associated with each indexed document.
- An indexed document is an XML file that catalogs metadata, report values and other reporting application-specific information.
- the XML content index items or data are stored in flat files in the index store 82 .
- the index store 82 may be the application server's file system.
- a relational database can optionally be configured to store this XML content index data. “Read” activity related to XML content index items is low compared to typical full-text index items. Records of XML content index items are read by search tools 30 after a list of preliminary search results has been made by the navigation & drill engine 14 , as described below.
- FIG. 2B shows the index store 82 within the content index component 12
- the index store 82 may or may not be part of the metadata content management system 10 .
- the content index 90 also includes a taxonomy or subject index 94 .
- the subject index 94 may also be called a subject hierarchy, topic hierarchy, topic tree or subject dictionary.
- the subject index 94 is a collection of indexes, each being a file-based index extension that allows subject hierarchies or taxonomies to be quickly queried.
- the subject index 94 allows searches of parent topic names for a given term, as further described below.
- the navigation and drill engine 14 is a server component that analyses each user's “context” within their active reporting application 40 .
- the navigation and drill engine 14 processes drill requests and text searches.
- the navigation and drill engine 14 extracts unique words or terms from the search context of the request.
- An example of search bar “text” and the resultant search context is shown in FIG. 15 .
- the navigation and drill engine 14 passes these unique terms to a full-text search engine, which returns a list of matching documents.
- the metadata content management system 10 may have a full-text search engine, or may use an external full-text search engine.
- the navigation and drill engine 14 may apply a security check to this list of results returned thus far.
- the navigation and drill engine 14 may send batch or grouped security queries to the appropriate target reporting application 40 . Denied items are removed from the results list.
- the navigation and drill engine 14 may also score the documents using the content index entry for each item.
- the content index entry may be retrieved from a flat file storing the relevant content index or the linked relational database.
- the navigation and drill engine 14 may use the original source context to establish the user's position with the reporting application 40 , or to establish the subject/term relationship of the request.
- the navigation and drill engine 14 may apply several queries and optimizations over the index content to select relevant content, such as cubes with best dimension and category matches, reports with best matching of columns, groupings and/or values, and/or prompted reports with the best matching prompts.
- the navigation and drill engine 14 automatically generates dynamics filters that navigate the user into a matching cube. Also, the navigation and drill engine 14 automatically generates prompt answers for prompted reports, using any item with a significant number of matching terms, and priority given to those terms listed in the taxonomy of subjects/terms.
- the navigation and drill engine 14 returns the drilling results to the reporting application 40 .
- An example is shown in FIG. 16 .
- the results may be sorted by relevance (score) or by groups, such as “reports with matching prompts” or “cubes with matching dimensions”.
- the navigation and drill engine 14 may be a Java-based server component.
- the navigation and drill engine 14 may use a unique security algorithm or method to ensure that users see only the results they are authorized to access.
- the metadata content management system 10 validates its search results against the referenced reporting application 40 . A user only sees items that the user has permission to access.
- Each reporting application 40 allows different levels of access.
- the navigation and drill engine 14 may send security queries to the relevant reporting applications 40 to determine security criteria set by the applications 40 .
- Tools 16 provide various features. Tools 16 allow integration with search tools 30 and portals. Administrators may access to the components of the metadata content management system 10 using tools 16 via a web browser.
- the metadata content management system 10 also has a taxonomy management system 50 .
- the metadata content management system 10 provides searchable metadata and report data in a form of knowledge base documents 54 ( FIG. 3 ) in the content index component 12 using the taxonomy management system 50 .
- the business taxonomy management system 50 is used for building a searchable content index or index corpus 90 that includes subject index 94 containing business taxonomy definitions obtained from the structure of business reporting metadata.
- a taxonomy is a hierarchy of topics or subjects.
- a taxonomy may also be called a subject index, subject hierarchy, topic hierarchy, topic tree and subject dictionary.
- the business taxonomy is used to classify terms and phrases.
- the taxonomy lets search components 30 find terms within a given subject or topic.
- the taxonomies improve many search engine functions including search results relevance, refinement of search criteria and creation of related business reporting content.
- the metadata content management system 10 can allow the content index 90 to dynamically produce subject index 94 , i.e., subject hierarchies, by searching the content index 90 for, e.g., “parent term relationships” of any word or phrase. This is possible because for a given word, the content index 90 stores references to its parent or parents. A word may have multiple derivations.
- Taxonomy An example of a taxonomy is described for a system in which the term “Cost” is used as a Measure with names: Billing Cost, Average Billing Cost, Average Billing Cost per Customer, Average Billing Cost per Product, and Actual Cost. Also, it is used as a Report Columns/Heading with names: Product Cost, Planned Total Cost, and Cost of Goods Sold. In a “taxonomy-aware” system, any of these subjects can be used to help find more relevant results for the otherwise ambiguous term “Cost”.
- the business taxonomy management system 50 comprises three main components: content scanner 52 , knowledge-base documents 54 , and taxonomy engine 56 .
- the content scanner 52 may be part of the indexing engine 80 .
- the knowledge-base documents 54 form the content index 90 , which in turn is used to define the subject index 94 .
- External components which interact with the main components are business reporting metadata 21 , full-text index and search component 32 , and end-users or reporting applications 40 that provide search terms and consume taxonomy responses. While FIG. 3 shows business reporting metadata 21 only as source metadata, the taxonomy management system 50 may also use other business oriented metadata 20 .
- FIG. 3 shows the flow of information between these components.
- the business taxonomy management system 50 uses the structure of business reporting metadata 21 to create a living, de facto taxonomy definition of topics for a given business entity or organization, as described below. This taxonomy defines how terms are used in the business by the organization.
- the business reporting metadata 21 is metadata that exists anywhere in a given business or organization.
- the business reporting metadata 21 typically includes OLAP and dimensional business data.
- the business taxonomy management system 50 obtains the business topic structure from such OLAP and dimensional business data.
- the business reporting metadata 21 may be derived or extracted from reports and other documents as described above.
- Reports and other documents are metadata documents, i.e., documents containing business oriented metadata, that define query, layout, labeling and annotation of other content. Examples of metadata documents include business reporting and analysis metadata documents authored with report authoring and creation tools, such as business intelligence application suites; business modeling and optimization metadata documents; budgeting, planning and forecasting metadata documents; and financial consolidation metadata documents.
- the source business content or business reporting metadata 21 contains topic hierarchies.
- the business taxonomy management system 50 extracts information of the structure of the topic hierarchies from the source business reporting metadata 21 .
- the business taxonomy management system 50 creates taxonomy definition of topics automatically without human intervention based on the structure of the topic hierarchies in the business reporting metadata 21 . It uses a deterministic algorithm or method that provides reliable results without the need for complicated and potentially inaccurate linguistic analysis. For example, the business taxonomy management system 50 performs parent subject determination by examining “is a” relationships (i.e., parent/child relationships) found in the topic hierarchies that exist in the source business content 21 .
- the business taxonomy management system 50 stores and maintains the created taxonomy definition of topics, e.g., parent/child relationships, in knowledge base documents 54 .
- the business taxonomy management system 50 uses simple linguistics common processes for synonyms, stemming and morphology. For synonyms, the business taxonomy management system 50 changes word to more common or popular equivalents, e.g., obtuse becomes veiled. For stemming, the business taxonomy management system 50 removes plurals and other word extensions to find base words, e.g., accessories becomes accessory. For morphology, the business taxonomy management system 50 changes words to find more common embodiments, e.g., acceleration is a common embodiment of to accelerate. The business taxonomy management system 50 does not use any complicated and unreliable linguistics processes.
- the full-text index and search component 32 uses a full text index 34 .
- the full text index 34 is a concordance of terms across the scanned or indexed documents. An entry is made for each scanned word (excepting, e.g., stop words which are too common to be useful) that lists the exact position of each occurrence of the word within corpus of documents. From such a list, it is relatively simple to retrieve all the documents that match a query, without having to scan each document.
- the full-text index and search component 32 provides users and applications 40 with interfaces to build and search its full text index 34 .
- Users or applications 40 are consumers of the taxonomy management system 50 . Users or applications 40 may be referred to as operators hereinafter.
- the user applications 40 may be web browsers.
- the content scanner component 52 reads documents containing business reporting metadata 21 . It builds a knowledge-base representation of the business reporting metadata 21 with one or more of details, such as the unique document identifier, document date, structured hierarchy of reporting elements from the source document, database queries used in each structured reporting element, and linkages to other structured reporting elements in this document and other business reporting metadata documents.
- Typical examples of the structured hierarchy of reporting elements from the source document include data grouping, headings and labels, and other structure information often available in OLAP and dimensional business data.
- Each reporting element provides an identification, e.g., a display name, used in the reporting elements produced by the source metadata.
- the content scanner 52 builds or updates a knowledge-base document 54 for each source metadata document containing business reporting metadata 21 .
- a knowledge base document 54 is used to store a knowledge base representation of each term in the each source metadata document along with references to content of the metadata 21 that uses the term.
- a knowledge-base document 54 is encoded in Extensible Markup Language (XML) and stored in system data files.
- XML Extensible Markup Language
- any storage or encoding mechanism can be used.
- data may be stored in database records and accessed with SQL.
- the content scanner 52 may use logical folders and directories where content is stored to classify the source metadata documents contained in those logical folders and directories.
- the content scanner 52 may use an index and search functions file while scanning the source metadata documents.
- the content scanner 52 collects primary metadata hierarchies by adding target holders or items to the index and search functions file, and weights the metadata found in these items higher than metadata found elsewhere. For example, the terms and descriptions used in logical folder/directory names can be weighted to have higher importance that another data, such as OLAP dimension or relational table names.
- the content scanner 52 may define subject “is a” hierarchies or parent/child relationships existed in the source content, such as OLAP cube dimension trees, model entity trees, sample value enumerations for any arbitrary model entity item and folder structures.
- OLAP and relational data are primary sources of high quality subject hierarchies data.
- Sample value enumerations are extensions to these OLAP and relational sources.
- Sample value enumerations may be databases to queried to get “examples” of subject data. For example, the content scanner 52 can find that a “Yosamite Dome Tent” is a “Product”.
- the content scanner 52 captures terms for query item sample value enumerations by running the related model queries.
- the content scanner 52 stores these “is a” hierarchies in the subject Index 94 ( FIG. 2B ) in the knowledge base documents 54 that allow them to be found with standard full-text searches by the full-text index and search component 32 , as described below.
- the content scanner 52 may define subject hierarchies as full-text terms referencing a special XML content documents called subject index documents or knowledge base documents 54 .
- the XML in a subject index document refers to the definition of the subject phrase.
- a full text search for terms returns relevant subject index documents in the knowledge base documents 54 .
- searching for term “binocular” returns subject index documents “Product” and “Outdoor Gear” indicating that binocular is “is a” kind of Product and “is a” kind of Outdoor Gear.
- the content scanner 52 indexes words and phrases to create a virtual “hierarchy of topics”. For example, the content scanner 52 indexes the following hierarchies in each type of content: Content Type Index Hierarchy OLAP cubes Every word in every dimension/category tree and their relationship to parent dimensions/categories Models Every word in every package/folder tree and their relationship to parent packages/folders. Report Every word used in titles and labels. Model query items Specifications used in the report are expanded to index the text in the query result (these results are called “Sample Value Enumerations”). Cognos Every word in every folder and item along with their Connection relationship to parent folders (TM) Items
- the content scanner 52 indexes the entire dimension tree.
- Each word in each folder (dimension/category) and all contained element names are indexed, including content of all collapsed folders in this example.
- the relationship of terms to parent folders is also indexed.
- “products” is indexed as a parent of “accessories” child term.
- “accessories” is indexed as a parent of “headphones”.
- the “order method” is indexed as a parent of terms “sales” and “visit”.
- the content scanner 52 indexes each word in each display name, title and label.
- the content scanner 52 also indexes query subject/query item hierarchy in models.
- Each query item referenced in a report is expanded to get an index of text values.
- all tree relationships are indexed in the same fashion as cubes and folders.
- Query items like “product line”, “product type”, “product name” and “description” are expanded.
- the resulting text values and their relationship to their parent query item is indexed.
- the content scanner 52 stores these indexed words and items in the knowledge base documents 54 .
- Each knowledge-base document 54 is consumed by the full-text index and search component or engine 32 which adds a reference back to the knowledge-base document 54 for each term found in the document.
- the full-text index 34 is subsequently used by the full-text index and search engine 32 or other search tools to retrieve knowledge-base documents 54 that contain specified search terms.
- the taxonomy engine 56 provides taxonomy services to users and other applications 40 .
- the taxonomy engine 56 provides the taxonomy of the given search terms to the user or application 40 .
- the taxonomy may include indexed terms that are parent topic terms, sibling topic terms and descendent or child topic terms of the given search terms.
- the taxonomy engine 56 creates the taxonomy dynamically for each search term provided using the knowledge base documents 54 .
- FIG. 4A shows the procedure for determining parent topic terms from a given term received from the user or application 40 ( 100 ).
- the taxonomy engine 56 uses the full-text and search engine 32 to find knowledge-base documents 54 that contain the given term ( 102 ).
- the taxonomy engine 56 finds structured elements in the matching documents 54 where the given term is used ( 104 ).
- the taxonomy engine 56 gets all structured parent elements relative to the structured element for each matching term ( 106 ).
- the knowledge base document 54 stores the word and all references to its parents in the subject index.
- the taxonomy engine 56 uses these references to parents to locate relevant structured parent elements.
- the taxonomy engine 56 then returns a list of the identification of the located structured parent elements, e.g., a list of display names from the elements found across all matching documents 54 ( 108 ).
- FIG. 4B shows the procedure for determining sibling topic terms from a given term received ( 110 ).
- the taxonomy engine 56 uses the full-text index and search component 32 to find knowledge-base documents 54 that contain the given term ( 112 ).
- the taxonomy engine 56 finds structured elements in the matching documents 54 where the given term is used ( 114 ).
- the taxonomy engine 56 gets all structured sibling elements relative to the structured element for each matching term ( 116 ) using the subject index containing references to parent relationships in the matching documents 54 .
- the taxonomy engine 56 searches for other words with the same parents as the current word.
- the taxonomy engine 56 returns a list of identifications of the located structured sibling elements, e.g., the list of display names from the elements found across all matching documents 54 ( 118 ).
- FIG. 4C shows the procedure for determining Descendent child topic terms from a given term received ( 120 ).
- the taxonomy engine 56 uses the full-text index and search component 32 to find all knowledge-base documents 54 that contain the given term ( 122 ).
- the taxonomy engine 56 finds structured elements in the matching documents 54 where the given term is used ( 124 ).
- the taxonomy engine 56 gets all structured child elements relative to the structured element for each matching term ( 126 ) using the subject index containing references to parent relationships in the matching documents 54 .
- the taxonomy engine 56 searches for other words with this word as a parent.
- the taxonomy engine 56 then returns a list of identifications of the located structured child elements, e.g., a list of display names from elements found across all matching documents 54 ( 128 ).
- Other embodiments may include building taxonomies with “Crawler Task” that performs functions of the content scanner 52 , or building knowledge-base documents or corpus 54 with dedicated tools instead of content scanner 52 .
- a crawler task is an embodiment of a web crawler.
- other embodiments of the taxonomy management system 50 may use a relational database for the knowledge-base corpus 54 .
- the taxonomy engine 56 provides taxonomy searching or lineage searching that allows the parent subjects for any given term to be determined with a single full-text query. Taxonomy search results allow metadata names to be found for input terms that are values. Taxonomy searching is not based on linguistics or natural language processing. The relationships are extracted from actual metadata hierarchies in the source metadata documents as described above. Full-text indexing makes retrieval feasible.
- Taxonomy searching allows quick finding of metadata names and descriptions that contain a given search term. Also, search terms can be quickly associated to metadata. Subsequent metadata searches allow compatible drill targets to be found. Search term values can be passed to a drill target with matching metadata. There is reasonable assurance that term is a valid parameter. Search terms can be broadened to a parent topic phrase, allowing very accurate “see also” topic lists.
- the taxonomy engine 56 may use a navigation mechanism to narrow searches to a specifics topic. For example, the taxonomy engine 56 can show subject data as folder trees or other appropriate hierarchical visual aids to help a user select relevant related topics. When selected, these topics can be used to filter or narrow search results to include a subset of results based on the chosen topics.
- the taxonomy engine 56 may use reference counts to pick the parent topic with the largest number of child references. It may also use other links, such as “see also” links.
- the taxonomy management system 50 may provides APIs that enables implementation of these features of the taxonomy engine 56 .
- the content scanner 52 may be combined with a full-text index scanner that indexes terms, determines example terms in one integrated component.
- a full-text index and search service may integrate itself with the example component or engine 62 .
- the full-text index and search components 32 are search engines that index content by “crawling” through content. Examples include Google (TM) Web Server, Google (TM) Desktop Search, MSN (TM) Web Search, MSN (TM) Desktop Search and other enterprise search tools.
- the operators 40 are operators who issue search requests against the taxonomy engine 56 , view results and navigate to referenced business reporting metadata 21 .
- the taxonomy management system 50 may also have a ranking manager 60 for managing relevance scores for search results.
- the ranking manager 60 has a business relevance handler 62 and score calculator 64 .
- the business relevance handler 62 handles assignment of weights or base scores to individual metadata elements depending on their “relevance to the business”.
- the business relevance handler 62 determines the “relevance to the business” using data in the knowledge base documents 54 .
- the business relevance handler 62 may determine a base score of a metadata element based on its metadata type, parent metadata type, uniqueness of metadata element, usage of a target by other users, subscription to target by other users, running/compiling a target, and/or publish and update date of a target.
- the score calculator 64 calculates a relevance score for each search result item by aggregating base scores assigned to individual metadata elements included in the search result item.
- the metadata content management system 10 indexes the key elements of each report and other business oriented metadata 20 , and maintains a searchable database of reports and other metadata 20 , in a form of content index 90 or knowledge based documents 54 .
- This database 90 is optimized for efficient searching of metadata names and hierarchies. It also offers searching of text. This information is used to provide the metadata content management system 10 drilling features.
- the metadata content management system 10 populates full-text search engines (like the Google (TM) Intranet “Search Appliance”) with information about each report and other business oriented metadata 20 . It allows these search engines to find relevant content of business oriented metadata 20 .
- the content management system 10 preprocesses query text by evaluating terms of the query text to determine if they are a member of a topic hierarchy.
- Product, date, location and customer categorizations are often members of a topic hierarchy.
- the system 10 “stems” words to find their base. For example, “Accessories” becomes “accessory”. “Dates” becomes “date”.
- the system 10 assigns each word in the query text an initial weighting score.
- the system 10 weighs original terms incrementally higher than the stemmed terms, and incrementally higher than related parent topics.
- the system 10 also weighs terms found indexed incrementally higher than other terms.
- the system 10 uses these terms and their associated weights to build a query specification which is used by the index search function of the system 10 .
- a search example is described for search terms: “2005 tent sales”.
- the taxonomy engine 56 performs an initial lineage search or taxonomy search which indicates that “2005” is a date and “tent” is a product. Any reports that use these metadata elements become search targets. Prompts with matching metadata types, matching names in cubes and matching filter specifications that will allow dynamic positioning in the cube also become search targets.
- the user application 40 displays on screen context or a list of relevant items that includes the titles and labels of the searched targets along with the model query items used to produce the current page and/or selected items.
- the metadata content management system 10 searches for reports/cubes with the related metadata using values shown on the screen.
- the system 10 may also use topics and terms related with the displayed context to encompass parent topics.
- the search result is a high likelihood that all matching modeled elements and same-named display elements are found for the target that deals with related content.
- Search results are now described. Search results are typically shown as line items in list format, sorted by relevance score, as shown in FIG. 17 .
- the user can click a name to launch its matching item, click a group button to show grouped results, or click an arrow button in frame on left to show Related Subjects.
- the result display also has the search query edit field, which is optional.
- Grouped results show search results organized by match criteria with inner sort by relevance score, as shown in FIG. 18 .
- the user can click name to launch matching item, click a list button to show list results, or click an arrow button in frame on left to show related subjects.
- the subjects pane shows how the current search terms are used in the index, as shown in FIG. 19 .
- the user can click subject name to refine the current search with a matching item, or click an arrow button to hide this pane.
- Results line items include:
- the relevance score for each line item is uniquely calculated as a weighted aggregate score based on the metadata elements matched. Base scores for matching metadata elements are added together to create a score for each item.
- Metadata elements are assigned different base scores rooted to their “relevance to the business”. Elements with higher base scores will cause result line items for a target to ultimately have a higher aggregate score.
- Metadata drilling is feasible when a match can be found between an item selected in one application and the metadata exported from another report.
- the metadata content management system 10 significantly expands the number of possible metadata values. Therefore, the number of potential drill targets is increased.
- the dimension name “Channels” in the Cognos PowerPlay (TM) cube on the left is used to match a column name in the report on the right.
- This drill example illustrates that by using the metadata content management system 10 , drill targets can be determined dynamically at run-time. This differs from existing CPM tools where drill targets need to be specified when a report is being authored.
- the metadata content management system 10 provides metadata drilling that allows applications 40 to drill in any direction by dynamically determining drill targets at run-time. Metadata searches allow the user to drill in “non-traditional” ways. For example, it is possible to drill from reporting applications. As shown in FIG. 8 , the column name “Channels” in a report on the left is used to match a dimension name (or any star schema element) in a cube on the right. Optionally, the selected “Independent” channel name on the left can be used to link to a category name on the right.
- Meta drilling allows content to be linked using metadata only. Report values and measures are not needed. As such, these reports can be efficiently indexed when they are initially published. Indexing needs to be updated only when a report specification changes. No authoring is needed to drill in either direction.
- An example of using meta data from drilling is as follows. If a user searches for “products”, the user finds all reports that have a database table or OLAP dimension that includes the base (stemmed) word “product”. The user does not necessarily need to find “product” in any of the report data. If the user searches for “tents”, the user can determine that the base (stemmed) word “tent” is a “product” and then finds all reports that have a database table or OLAP dimension that includes the word “product”. The user does not necessarily need to find “tent” or “product” in any of the report data. By allowing content to be linked using metadata only, e.g., using hyperlinks, the system 10 can connect sources and targets.
- Meta drilling often results in multiple drill targets.
- the metadata content management system 10 lists the hierarchy of matches and allows a user to pick an appropriate target, as shown in FIG. 9 .
- the metadata content management system 10 lists matching reports by report type and then sorts them by a calculated relevance rating.
- the metadata content management system 10 calculates search relevance by first creating search criteria at the drill source location and then comparing this criteria with the resulting list of matching items.
- Source metadata and values are used to create a search specification.
- the metadata content management system 10 uses filter information, including the current crosstab dimension and category filters plus dimensions currently being displayed, to create a search specification.
- the metadata content management system 10 By drilling from the Mass Marketer category, the metadata content management system 10 builds a search specification that includes terms “looks for item Mass Marketer within Channels”. By drilling from the GO Sport Line category, metadata content management system 10 builds a search specification that includes terms “looks for item Go Sport Line within Products”. By drilling from the intersection of the Mass Marketer and GO Sport Line categories, metadata content management system 10 builds a search specification that includes both of the terms above.
- the metadata content management system 10 adds the related filter terms to the search criteria.
- the search is submitted to the metadata content management system 10 .
- the metadata content management system 10 calculates report relevance by comparing the number of matched terms with those found in each result item.
- Value drilling and searching allows target report values and measures to be searched. This means that search criteria can include value ranges within metadata constraints. As shown in FIG. 11 , the Order Number value “160” in a report on the left is used to match same Order Number value in a report on the right that is generated by a different reporting tool.
- Value searching allows content to be linked using metadata and associated report values/measures from actual reports. This type of drilling allows extends the number of reports that can be matched at the cost of using more index storage. It is also much slower than searching only metadata. As such, value searching is typically not a good choice for drilling. It is better suited for ad hoc searching where speed is less of a concern.
- the metadata content management system 10 also provides full-text searches against metadata 21 and values 22 .
- the metadata content management system 10 allows enterprise search tools 30 to be used to expose BI content of supported reporting applications 40 to report users. External full text search engines use proprietary technology to index content.
- the metadata content management system 10 “pushes” or populates index values to each supported external search engine. Search indexes are maintained by a search server associated with each external search engine. Storage requirements are dependent on the amount of information provided by metadata content management system 10 .
- the metadata content management system 10 may have configuration options that control how much information is “pushed” to these servers.
- the metadata content management system 10 typically maintains its own index that can be used standalone or in conjunction with the external search engines. The result is fast, relevant and predictable searches.
- the metadata content management system 10 may also allow applications to create lists of “see also” links that show related content.
- the metadata content management system 10 facilities are preferably exposed as WSDL compliant Web Services.
- FIG. 12 is an example from Cognos PowerPlay 7.3 (TM) showing the “Drill” button.
- FIG. 13 the user can type terms in the text search tool or browser search bar.
- the navigation and search engine 14 accepts request and builds a “Source Context”. This is carried out as follows.
- the navigation and search engine 14 starts the actual index search.
- the engine 14 extracts current filter values, view settings and all visible category information to create a “Source Context”.
- FIG. 14 shows an example of “Drill” from Cognos PowerPlay (TM) and the resultant Source Context.
- a “Source Context” is also built for “text” searches.
- the metadata content management system 10 uses the taxonomy management system 50 . Individual terms are inspected to see if they match subjects in the metadata content management system subject and term hierarchy, i.e., a “taxonomy”. When a match is found, terms are placed under their respective matching subjects. This allow terms to be treated like categories within OLAP dimensions.
- Hyper-Dimensional Navigation can be also used to create reports. This example shows a sales manager who wants to compare 2005 actual sales against his forecast for Projection TVs in US and Canada with detail breakdown by Order Method.
- the sales manager begins at the top level of his enterprise Navigation Bar. It shows the most frequently referenced reporting categories across the entire enterprise. While this navigation bar has literally hundreds of topics, only the top seven items for the “Sales Manager Role” are shown in FIG. 20 .
- the metadata content management system 10 creates a report (using the Sales Manager business role that he selected last time) as shown in FIG. 22 .
- Hyper-Dimensional navigation provides a concise definition of report items needed.
- Business Role also helps narrow the choices without asking too many questions.
- Verifying that the information shown is correct and appropriate he selects the columns of interest—in this case: Sales Year, Staff Name and Actual Revenue—and right-clicks to see available options.
- the component of the metadata content management system 10 is started to look for models and reporting examples that match the terms “Sales”, “Target”, or “Exceeded”.
- the metadata content management system 10 creates a new report specification from the model and metadata elements found.
- the report is run and the results are shown in FIG. 27 .
- the generated report has new columns: “Sales target” and “Exceeds target”. The sales manager got the numbers he was looking for.
- Advanced users can launch a report editing/creation tool to fine tune the generated report.
- the metadata content management system 10 performed this as follows. An element with a display title “Sales Target” is found in the model used most often by this user. It is deemed to be compatible with the original report's list-style format. Similarly, a calculation named “exceeds target” is found in a report that shows both actual revenue and sales target elements. This calculation is also deemed compatible. The metadata content management system 10 assembles the new report and displays the answer.
- Visual Report Construction is now described.
- Visual Report Construction is a simple idea: A user views elements in two reports that he wants in a single report. Using a drag-and-drop gesture, he drags elements together. Visually it's a trivial drag-and-drop gesture. Under-the-covers, it is a variant of the Report Extension example shown previously.
- a product manager wants to know who provides customer support for a particular product at different US retailers. She is familiar with Cognos PowerPlay (TM). She navigates to where she thinks the answer will be found as shown in FIG. 29 .
- TM Cognos PowerPlay
- Cognos PowerPlay (TM) drill-through button.
- TM Cognos PowerPlay
- TM recognizes that no authored drill-through actions exist for the selected cells, so it passes the request to the metadata content management system 10 component.
- the metadata content management system 10 asks her for two simple clarifications (which can optionally be defaulted in the future): “What business role are you currently performing?” and “What kind of details are you looking for?”. She sees the page shown in FIG. 30 . From the custom options shown, she selects Product Manager and Staff Details. She clicks Finish to get her answer.
- the metadata content management system 10 creates and runs a detail report that matches criteria in her chosen role using the original selected cell data as query values. She sees the results as shown in FIG. 31 . She verifies the report content by looking at title and column names. She sees her answer.
- the metadata content management system 10 performed this example as follows. Filter information is extracted from the source Cognos PowerPlay (TM) report. It is combined with the caller's selected job role and detail template to create a report with appropriate content. Roles are defined by an administrator and users themselves. Detail templates are created by authoring real reports in Report Studio and optionally creating templates of the metadata content management system 10 when saving and updating.
- TM Cognos PowerPlay
- Administration functions and user interface of the metadata content management system 10 are now described.
- the administration functions of the metadata content management system 10 administration are available only to users with administration capability.
- Index Properties are viewed via a link on item property pages, as shown in FIG. 32 .
- the item Type is Folder.
- the Open Folder link lets the user open the folder in a new window, as shown in FIG. 33 .
- Last Indexed shows the date/time of the most recent indexing on the folder itself (contained items will have different dates/times).
- Re-index initiates indexing of the folder, then all of its content (this is a “deep index” operation—all contained folders, subfolders and items are indexed).
- Disable Indexing stop further indexing of this folder and all contained folders, subfolders and items. When first disabled, all contained index entries are removed.
- the indexing user name can be set as “inherited” (default for new items) or to a specific user.
- XML index data is shown (read only) OK saves changes are returns to the Index tab. Cancel returns without making any changes.
- FIG. 34 shows an example of Item Property Pages. Properties for content items are the same as folders with minor label changes. It may also show a Model Index tab for a metadata model related to this report.
- FIG. 35 shows an example of Model Property Pages. Properties for models items are the same as folders with minor label changes.
- the Model tab is shown by itself for Model list items. It is shown with an Indexing Options tab for reporting application content items (e.g., Cognos ReportNet (TM) content items).
- TM Cognos ReportNet
- General administration functions of the metadata content management system 10 can be launched from an application, such as Cognos Connection, using a link in a lunch bar. This option may be shown only to users with Content Administration capabilities.
- the metadata content management system 10 finds content using simple terms and phrases. Multi-faceted navigation aids refine searches using business terminology related specifically to the customer's enterprise. When content does not t exist, the metadata content management system 10 seamlessly creates a “made-to-order” report, with the help of the customer's enterprise business terminology.
- the metadata content management system 10 allow the users to find relevant BI content using simple term or phrase searches. With the help of multi-faceted navigation aids—using business terminology related specifically to a particular enterprise or organization, the users can refine searches to zero-in on the answer the users need. When content does not exist for particular search criteria, the metadata content management system 10 seamlessly creates a “made-to-order” report. Business terminology from the user's enterprise or organization helps the users refine their report to meet their requirements. Report creation is transparent.
- the metadata content management system 10 allows users to search user's business oriented metadata and reporting applications. It provides a flexible “dynamic drilling” family of features, hyper-dimensional navigation, to user's applications. The “context awareness” feature allows seamless navigation to relevant related content.
- a search-oriented interface allows reports to be found and run easier and more efficiently. It can also use role-based reporting components in reporting applications.
- the metadata content management system 10 finds answers in existing reports and creates custom reports as needed.
- the metadata content management system 10 provides dynamic report construction.
- the metadata content management system 10 creates reports directly from search terms.
- the hyper-dimensional navigation let users navigate their cubes and reports at a time, using the dimension metaphor of their reporting applications.
- Fast searching combined with intrinsic knowledge of key enterprise reporting elements allows the metadata content management system 10 to build multi-dimensional hierarchies on-the-fly.
- This dynamic structure is used to show a hyper-dimensional view of all enterprise content that can be navigated like a Cognos PowerPlay (TM) cube.
- TM Cognos PowerPlay
- the taxonomy management system of the present invention may be implemented by any hardware, software or a combination of hardware and software having the above described functions.
- the software code, instructions and/or statements, either in its entirety or a part thereof, may be stored in a computer readable memory.
- a computer data signal representing the software code, instructions and/or statements may be embedded in a carrier wave may be transmitted via a communication network.
- Such a computer readable memory and a computer data signal and/or its carrier are also within the scope of the present invention, as well as the hardware, software and the combination thereof.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
- The present invention relates to a metadata content management and searching system and method, especially to a method and system for creating a taxonomy from business-oriented metadata content.
- Competitive economies motivate business managers and other users to obtain maximum value from their investments for Corporate Performance Management (CPM) tools, such as Business Intelligence (BI) tools, that are used to manage business oriented data and metadata. These CPM tools provide authored reports or authored drill-through targets to link content together. Users often encounter similar problems in finding important reports or relevant data or drilling to related content if it was not previously authored.
- Traditional search technologies often provide incomplete or irrelevant results in the CPM environments. There are metadata search tools that run against relational databases. They can fail to find relevant data since they only search databases and do not leverage a customer's investment in CPM tools and applications. Relying on authored drill-through targets can also be problematic as new cube, reports, metrics or plans are added since new drill targets are not always kept up-to-date. Users can have difficulties moving seamlessly between CPM tools or applications, particularly when CPM applications are created by different individuals or departments.
- It is therefore desirable to provide a mechanism that allows more effective searches of business oriented metadata content.
- There exist search engines that use a full-text index combined with statistical methods to create ordered search results. An example of such a search engine is page ranking that is described in U.S. Pat. No. 6,526,440 issued to Bharat. However, these search engines are not sufficient to search complex data like business oriented metadata since they rely on ranking algorithms that work with data found primarily in the Global Internet and not inside a business.
- Some search engines use taxonomies to improve results. Creation of taxonomies has been carried out by a manual process or by an automated process based on advanced linguistic analysis. However, business taxonomies are difficult and expensive to build manually. Also, linguistic analysis are often complicated and thus prone to result in inaccurate outcome.
- It is therefore desirable to provide a system that manages business taxonomies automatically without the need for complicated and potentially inaccurate linguistic analysis.
- It is an object of the invention to provide an improved metadata content management system that obviates or mitigates at least one of the disadvantages of existing systems.
- The invention uses a content index to store knowledge base representations of terms in source documents containing business oriented metadata along with references to the original source metadata content that uses the terms.
- In accordance with an aspect of the present invention, there is provided a method and system for building a searchable index corpus that includes taxonomy definitions (or topic hierarchies) obtained from the structure of business reporting metadata.
- In accordance with an aspect of the invention, there is provided a business taxonomy management system comprising an indexing engine, an index store and a taxonomy engine. The indexing engine is provided for indexing content of source business oriented metadata, the indexing engine having a content scanner for reading the business oriented metadata, defining taxonomy of the business oriented metadata, and building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata. The index store is provided for storing the content index of the business oriented metadata. The taxonomy engine is provided for providing taxonomy services to users using the content index.
- In accordance with another aspect of the invention, there is provided a method of managing business taxonomy. The method comprises the steps of reading content of source business oriented metadata; defining taxonomy of the business oriented metadata; building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata; storing the content index of the business oriented metadata; and providing taxonomy services to users using the content index.
- In accordance with another aspect of the invention, there is provided a computer readable medium storing instructions or statements for use in the execution in a computer of a method of managing business taxonomy. The method comprises the steps of reading content of source business oriented metadata; defining taxonomy of the business oriented metadata; building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata; storing the content index of the business oriented metadata; and providing taxonomy services to users using the content index.
- In accordance with another aspect of the invention, there is provided a propagated signal carrier carrying signals containing computer executable instructions that can be read and executed by a computer, the computer executable instructions being used to execute a method of managing business taxonomy. The method comprises the steps of reading content of source business oriented metadata; defining taxonomy of the business oriented metadata; building a content index of the business oriented metadata including a subject index representing the taxonomy of the business oriented metadata; storing the content index of the business oriented metadata; and providing taxonomy services to users using the content index.
- This summary of the invention does not necessarily describe all features of the invention.
- These and other features of the invention will become more apparent from the following description in which reference is made to the appended drawings wherein:
-
FIG. 1 is a block diagram showing a metadata content management system in accordance with an embodiment of the present invention; -
FIG. 2A is a block diagram showing an embodiment of the metadata content management system; -
FIG. 2B is a block diagram showing an embodiment of a content index component; -
FIG. 3 is a block diagram showing an embodiment of a taxonomy management system; -
FIG. 4A is a flowchart showing an embodiment of the procedure for determining parent topic terms; -
FIG. 4B is a flowchart showing an embodiment of the procedure for determining sibling topic terms; -
FIG. 4C is a flowchart showing an embodiment of the procedure for determining child topic terms; -
FIG. 5 is a diagram showing another embodiment of the taxonomy management system; -
FIG. 6 is a diagram showing metadata and report values; -
FIG. 7 is a diagram showing an example of a report and user interface display by the metadata content management system; -
FIG. 8 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 9 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 10 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 11 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 12 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 13 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 14 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 15 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 16 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 17 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 18 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 19 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 20 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 21 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 22 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 23 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 24 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 25 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 26 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 27 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 28 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 29 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 30 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 31 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 32 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 33 is a diagram showing another example of a report and user interface display by the metadata content management system; -
FIG. 34 is a diagram showing another example of a report and user interface display by the metadata content management system; and -
FIG. 35 is a diagram showing another example of a report and user interface display by the metadata content management system. - Referring to
FIG. 1 , a metadatacontent management system 10 in accordance with an embodiment of the invention is described. The metadatacontent management system 10 is suitably used for an enterprise or other organization that has sources of business oriented information, i.e., business oriented metadata 20. The metadatacontent management system 10 interacts with the business oriented metadata 20, as well as one or more search tools orcomponents 30 anduser reporting applications 40 used by the organization. - An organization typically has untapped sources of information, e.g., business oriented metadata 20 including reporting
metadata 21 and specifications and key report values 22 of theuser reporting applications 40. The business oriented metadata 20 includes OLAP and dimensional business data defined by theuser reporting applications 40. These information, metadata and values may be collectively called as business oriented metadata 20 in this specification. - The metadata
content management system 10 indexes the content of the business oriented metadata 20. It analyzes the business oriented metadata 20 to create a search index. Since the search index is created from the organization's metadata 20, it is suitable for the organization. By providing such a search index, the metadatacontent management system 10 promotes navigation betweenBI tools 30 andreporting applications 40, creating a strategic view of CPM assets. The metadatacontent management system 10 captures application context, e.g., “viewing location” or “query parameters”, by creating the search index from the reportingmetadata 21. The search index created by the metadatacontent management system 10 enables many unique navigation options beyond traditional folder browsing and text searching. - As shown in
FIG. 6 , a typical organization hasvarious data sources 39, such as operational databases and/or data warehouses, and several CPM tools oruser reporting applications 40 that create cubes and/or reportspecifications 41 and generate reports 42.Reporting metadata 21 and associatedvalues 22 are produced by thoseapplications 40. Other business oriented metadata may be exported from metadata modeling tools. While authoring reports in reportingapplications 40, the creation of new hierarchies and data definitions occur. These hierarchies and data definitions are useful for drilling and searching. This data is often more recognizable to end-users since this is the data or text that the users see inapplications 40 and theirreports 41 These metadata and report data are considered asextended metadata 21 to describe the metadata created by different authoring and processing phases.Extended report data 22 refers to values created in a similar fashion. - These
extended metadata 21 andreport data 22 can be viewed as new BI data or business oriented metadata 20 of the organization. The metadatacontent management system 10 leverages the new BI data 20 to provide searching and drilling that was previously unavailable in existing systems, as described below. - Examples of
extended metadata 21 added by the authoring process includes dimension names, dimension levels, category names, alternate category names, cube hierarchies, table and record names, group names, parent/child relationships between categories, groups or tables, authored drill target names, CPM tool's model entities such as packages, namespaces, query items, query sources and relevant authored relationships. Examples of extended authored report values 22 include items related by one of more dimensions, categories, measures groups or tables, calculated values, and annotations. - For example, a BI tool may provide dimensional business data, such as crosstable providing dimension, category and measure names. These names represent
extended metadata 21. These names may or may not match table/column names in a star schema or other relational model. Yet each of these names represents an important potential target for drilling or searching. Values stored in a cube, including calculated values, represent extended data or values 22. They are a valuable target for searching. Likeextended metadata 21, many of thesevalues 22 are not found in any other data store. - Another example of a
reporting tool 40 may provide a report with columns. In such a report, each of the column heading represents extendedmetadata 21. The report grouping, e.g., by country, represents another form ofextended metadata 21. Report values themselves representextended report data 22. They offer important linking and search targets. - In these cases, the extended metadata names are the same as those viewed by the report user. Thus, these extended metadata names are often most relevant and recognizable to the report user. Using these metadata names allows the metadata
content management system 10 to provide information relevant and recognizable to the report user. These metadata names may or may not match the names used in the underlying databases. - Authored links, such as those anchored to the column name “Sales Rep Name”, provide additional summary information about the linked reports. This information also represents extended
metadata 21. This information allows the metadatacontent management system 10 to further increase search relevance about the destination content of the metadata 20 including themetadata 21 or report values 22. - The metadata
content management system 10 indexes content of the business oriented metadata 20 and generates a content index or index corpus which is a searchable database of representations of the content of the business oriented metadata 20, as further described below. - Research related to data searching and linking technologies commonly identifies two basic types of data: structured data and unstructured data. Structured data is defined by a formal schema. Typically structured data is searched with utilities of Online Analytical Processing (OLAP), Structured Query Language (SQL) and eXtensible Markup Language (XML). Unstructured data is normally found in documents and static web pages. Typically unstructured data is searched using free-form queries with web tools, such as Google (TM).
- The metadata
content management system 10 provides searching functions over both types of data by using the content index of the business oriented metadata 20. Structured data searches are used to implement report-to-report drilling. This includes listing selecting from multiple targets. The metadatacontent management system 10 typically uses its internal search engine or navigation and drill engine 14 (FIG. 2A ) to handle structured data searches. Full-text searches are used to find reports for unstructured user queries. The metadatacontent management system 10 typically handles full-text search by interacting with external search tools 30 (FIG. 1 ). For example, searches launched from Internet Information Services (IIS) or Portal text search tools are full-text searches. - The content index provides various advantages. The metadata
content management system 10 enhances search and drill-through capabilities across the range ofuser report applications 40 without requiring drill-through authoring in source content. A report author simply publishes target reports and lets the metadatacontent management system 10 find drill locations to the target content. - The metadata
content management system 10 organizes business oriented metadata content in ways that are more relevant and meaningful to users. The metadatacontent management system 10 also includes several personalization and administration options. - The metadata
content management system 10 describes data using names and labels from actual reports. These names are often more familiar and relevant to report users. The metadatacontent management system 10 also provides enhanced report-to-report drilling and product-to-product navigation. It expands the number of places where report users can “drill-to” and “drill-from” in a report. Most drilling requires no advance authoring. The metadatacontent management system 10 improves the capabilities of search tools. This includes the concept of ‘federated’ search across a variety of portal and web search indices. -
User reporting applications 40 often generate authored relational and OLAP reports. Those reports provide a wealth of new metadata, including schema information, that is largely hidden from other tools and reporting applications. The metadatacontent management system 10 exposes this metadata in a standard format that can be re-used byother CPM applications 40 andtools 30. -
FIG. 2A shows an embodiment of the metadatacontent management system 10. The metadatacontent management system 10 has acontent index component 12, navigation anddrill engine 14 andtools 16. - The metadata
content management system 10 uses indexing so that the metadata content can be searched and organized in real-time. Indexing is normally performed by the metadatacontent management system 10 when the metadata content is published or updated. Indexing can be performed by a scheduled administrator task (example: nightly cron job). It can also be performed manually by an administrator or user. - As shown in
FIG. 2B , thecontent index component 12 has anindexing engine 80 and anIndex store 82. Theindex store 82 stores files forcontent index 90. Thecontent index 90 may also be called an index corpus or knowledge base. Thecontent index 90 is a full-text index. - The
indexing engine 80 performs indexing of the content of the business oriented metadata 20 for a particular organization. It analyzes the content of the business oriented metadata 20 and creates indexes as described below. Since it creates indexes from the business oriented metadata of the organization, the created indexes are suitable for the organization. - A single set of index files is typically maintained in the
index store 82 in thecontent index component 12 for all users and user groups for the organization. By storing a single set of index files in a single store, the metadatacontent management system 10 can provide optimal or improved performance. Theindex store 82 may be part of a server file system of the organization. - A
content index 90 is a collection of content indexes. In other words, thecontent index 90 is a concordance of unique words (called terms) across scanned or indexed content items (called documents). Each content index contains an entry for each term across the indexed documents. Each context index catalogs individual words or terms and stores them along with their usage or other data. Each indexed content term contains a list of the indexed documents that have that term. Each indexed content term also contains usage statistics and the position of the term within each indexed document where possible. A content index is an “inverted index” where each indexed term refers to a list of documents that have the indexed term, rather than each indexed document contains a list of terms as in traditional indexes. Thecontent index 90 provides term searches and links to additional data stored in thecontent index 90. Each content index may contain, for each content, i.e., target item, information regarding the name or identification of the target item; module, cube or report metadata and their relevant metadata hierarchy; item location in the document folder hierarchy; and/or reference to its dependent model. - A content index may be an XML content index that describes each indexed item in XML. An XML content index stores applicable metadata, metrics and planning information that improve search relevance. Each XML content index is associated with each indexed document. An indexed document is an XML file that catalogs metadata, report values and other reporting application-specific information.
- The XML content index items or data are stored in flat files in the
index store 82. Theindex store 82 may be the application server's file system. A relational database can optionally be configured to store this XML content index data. “Read” activity related to XML content index items is low compared to typical full-text index items. Records of XML content index items are read bysearch tools 30 after a list of preliminary search results has been made by the navigation &drill engine 14, as described below. - While
FIG. 2B shows theindex store 82 within thecontent index component 12, theindex store 82 may or may not be part of the metadatacontent management system 10. - The
content index 90 may be stored in application server flat files. Thecontent index 90 is typically optimized to minimize disk reads and keep term storage as low as possible. Thecontent index 90 may be stored in a data store of an external full-text search engine. For example, the metadatacontent management system 10 may use an implementation of an existing full-text engine, e.g., the open source Apache Jakata Lucene full-text engine. - The
content index 90 also includes a taxonomy orsubject index 94. Thesubject index 94 may also be called a subject hierarchy, topic hierarchy, topic tree or subject dictionary. Thesubject index 94 is a collection of indexes, each being a file-based index extension that allows subject hierarchies or taxonomies to be quickly queried. Thesubject index 94 allows searches of parent topic names for a given term, as further described below. - The navigation and
drill engine 14 is a server component that analyses each user's “context” within theiractive reporting application 40. The navigation anddrill engine 14 processes drill requests and text searches. When a drill request is received from areporting application 40, the navigation anddrill engine 14 extracts unique words or terms from the search context of the request. An example of search bar “text” and the resultant search context is shown inFIG. 15 . - The navigation and
drill engine 14 passes these unique terms to a full-text search engine, which returns a list of matching documents. The metadatacontent management system 10 may have a full-text search engine, or may use an external full-text search engine. - The navigation and
drill engine 14 may apply a security check to this list of results returned thus far. The navigation anddrill engine 14 may send batch or grouped security queries to the appropriatetarget reporting application 40. Denied items are removed from the results list. - To improve the result relevance, the navigation and
drill engine 14 may also score the documents using the content index entry for each item. The content index entry may be retrieved from a flat file storing the relevant content index or the linked relational database. The navigation anddrill engine 14 may use the original source context to establish the user's position with thereporting application 40, or to establish the subject/term relationship of the request. The navigation anddrill engine 14 may apply several queries and optimizations over the index content to select relevant content, such as cubes with best dimension and category matches, reports with best matching of columns, groupings and/or values, and/or prompted reports with the best matching prompts. The navigation anddrill engine 14 automatically generates dynamics filters that navigate the user into a matching cube. Also, the navigation anddrill engine 14 automatically generates prompt answers for prompted reports, using any item with a significant number of matching terms, and priority given to those terms listed in the taxonomy of subjects/terms. - The navigation and
drill engine 14 returns the drilling results to thereporting application 40. An example is shown inFIG. 16 . The results may be sorted by relevance (score) or by groups, such as “reports with matching prompts” or “cubes with matching dimensions”. - The functions of the navigation and
drill engine 14 are further described below using examples. - The navigation and
drill engine 14 may be a Java-based server component. The navigation anddrill engine 14 may use a unique security algorithm or method to ensure that users see only the results they are authorized to access. The metadatacontent management system 10 validates its search results against the referencedreporting application 40. A user only sees items that the user has permission to access. Each reportingapplication 40 allows different levels of access. The navigation anddrill engine 14 may send security queries to therelevant reporting applications 40 to determine security criteria set by theapplications 40. -
Tools 16 provide various features.Tools 16 allow integration withsearch tools 30 and portals. Administrators may access to the components of the metadatacontent management system 10 usingtools 16 via a web browser. - As shown in
FIG. 2A , the metadatacontent management system 10 also has ataxonomy management system 50. The metadatacontent management system 10 provides searchable metadata and report data in a form of knowledge base documents 54 (FIG. 3 ) in thecontent index component 12 using thetaxonomy management system 50. - The business
taxonomy management system 50 is used for building a searchable content index orindex corpus 90 that includessubject index 94 containing business taxonomy definitions obtained from the structure of business reporting metadata. A taxonomy is a hierarchy of topics or subjects. A taxonomy may also be called a subject index, subject hierarchy, topic hierarchy, topic tree and subject dictionary. The business taxonomy is used to classify terms and phrases. The taxonomy letssearch components 30 find terms within a given subject or topic. The taxonomies improve many search engine functions including search results relevance, refinement of search criteria and creation of related business reporting content. - It is typically desirable to store the
subject index 94 and thecontent index 90 together. By doing so, the metadatacontent management system 10 can allow thecontent index 90 to dynamically producesubject index 94, i.e., subject hierarchies, by searching thecontent index 90 for, e.g., “parent term relationships” of any word or phrase. This is possible because for a given word, thecontent index 90 stores references to its parent or parents. A word may have multiple derivations. - An example of a taxonomy is described for a system in which the term “Cost” is used as a Measure with names: Billing Cost, Average Billing Cost, Average Billing Cost per Customer, Average Billing Cost per Product, and Actual Cost. Also, it is used as a Report Columns/Heading with names: Product Cost, Planned Total Cost, and Cost of Goods Sold. In a “taxonomy-aware” system, any of these subjects can be used to help find more relevant results for the otherwise ambiguous term “Cost”.
- As shown in
FIG. 3 , the businesstaxonomy management system 50 comprises three main components:content scanner 52, knowledge-base documents 54, andtaxonomy engine 56. Thecontent scanner 52 may be part of theindexing engine 80. The knowledge-base documents 54 form thecontent index 90, which in turn is used to define thesubject index 94. - External components which interact with the main components are
business reporting metadata 21, full-text index andsearch component 32, and end-users or reportingapplications 40 that provide search terms and consume taxonomy responses. WhileFIG. 3 showsbusiness reporting metadata 21 only as source metadata, thetaxonomy management system 50 may also use other business oriented metadata 20. -
FIG. 3 shows the flow of information between these components. - The business
taxonomy management system 50 uses the structure ofbusiness reporting metadata 21 to create a living, de facto taxonomy definition of topics for a given business entity or organization, as described below. This taxonomy defines how terms are used in the business by the organization. Thebusiness reporting metadata 21 is metadata that exists anywhere in a given business or organization. Thebusiness reporting metadata 21 typically includes OLAP and dimensional business data. The businesstaxonomy management system 50 obtains the business topic structure from such OLAP and dimensional business data. For example, thebusiness reporting metadata 21 may be derived or extracted from reports and other documents as described above. Reports and other documents are metadata documents, i.e., documents containing business oriented metadata, that define query, layout, labeling and annotation of other content. Examples of metadata documents include business reporting and analysis metadata documents authored with report authoring and creation tools, such as business intelligence application suites; business modeling and optimization metadata documents; budgeting, planning and forecasting metadata documents; and financial consolidation metadata documents. - The source business content or
business reporting metadata 21 contains topic hierarchies. The businesstaxonomy management system 50 extracts information of the structure of the topic hierarchies from the sourcebusiness reporting metadata 21. The businesstaxonomy management system 50 creates taxonomy definition of topics automatically without human intervention based on the structure of the topic hierarchies in thebusiness reporting metadata 21. It uses a deterministic algorithm or method that provides reliable results without the need for complicated and potentially inaccurate linguistic analysis. For example, the businesstaxonomy management system 50 performs parent subject determination by examining “is a” relationships (i.e., parent/child relationships) found in the topic hierarchies that exist in thesource business content 21. The businesstaxonomy management system 50 stores and maintains the created taxonomy definition of topics, e.g., parent/child relationships, in knowledge base documents 54. - The business
taxonomy management system 50 uses simple linguistics common processes for synonyms, stemming and morphology. For synonyms, the businesstaxonomy management system 50 changes word to more common or popular equivalents, e.g., obtuse becomes veiled. For stemming, the businesstaxonomy management system 50 removes plurals and other word extensions to find base words, e.g., accessories becomes accessory. For morphology, the businesstaxonomy management system 50 changes words to find more common embodiments, e.g., acceleration is a common embodiment of to accelerate. The businesstaxonomy management system 50 does not use any complicated and unreliable linguistics processes. - The full-text index and
search component 32 uses afull text index 34. Thefull text index 34 is a concordance of terms across the scanned or indexed documents. An entry is made for each scanned word (excepting, e.g., stop words which are too common to be useful) that lists the exact position of each occurrence of the word within corpus of documents. From such a list, it is relatively simple to retrieve all the documents that match a query, without having to scan each document. The full-text index andsearch component 32 provides users andapplications 40 with interfaces to build and search itsfull text index 34. - Users or
applications 40 are consumers of thetaxonomy management system 50. Users orapplications 40 may be referred to as operators hereinafter. Theuser applications 40 may be web browsers. - The
content scanner component 52 reads documents containingbusiness reporting metadata 21. It builds a knowledge-base representation of thebusiness reporting metadata 21 with one or more of details, such as the unique document identifier, document date, structured hierarchy of reporting elements from the source document, database queries used in each structured reporting element, and linkages to other structured reporting elements in this document and other business reporting metadata documents. Typical examples of the structured hierarchy of reporting elements from the source document include data grouping, headings and labels, and other structure information often available in OLAP and dimensional business data. Each reporting element provides an identification, e.g., a display name, used in the reporting elements produced by the source metadata. - The
content scanner 52 builds or updates a knowledge-base document 54 for each source metadata document containingbusiness reporting metadata 21. Aknowledge base document 54 is used to store a knowledge base representation of each term in the each source metadata document along with references to content of themetadata 21 that uses the term. - In this
system 50, a knowledge-base document 54 is encoded in Extensible Markup Language (XML) and stored in system data files. In a different embodiment, any storage or encoding mechanism can be used. For example data may be stored in database records and accessed with SQL. - The
content scanner 52 may use logical folders and directories where content is stored to classify the source metadata documents contained in those logical folders and directories. Thecontent scanner 52 may use an index and search functions file while scanning the source metadata documents. As the source metadata documents are scanned, thecontent scanner 52 collects primary metadata hierarchies by adding target holders or items to the index and search functions file, and weights the metadata found in these items higher than metadata found elsewhere. For example, the terms and descriptions used in logical folder/directory names can be weighted to have higher importance that another data, such as OLAP dimension or relational table names. - The
content scanner 52 may define subject “is a” hierarchies or parent/child relationships existed in the source content, such as OLAP cube dimension trees, model entity trees, sample value enumerations for any arbitrary model entity item and folder structures. OLAP and relational data are primary sources of high quality subject hierarchies data. Sample value enumerations are extensions to these OLAP and relational sources. Sample value enumerations may be databases to queried to get “examples” of subject data. For example, thecontent scanner 52 can find that a “Yosamite Dome Tent” is a “Product”. Thecontent scanner 52 captures terms for query item sample value enumerations by running the related model queries. - The
content scanner 52 stores these “is a” hierarchies in the subject Index 94 (FIG. 2B ) in theknowledge base documents 54 that allow them to be found with standard full-text searches by the full-text index andsearch component 32, as described below. Thecontent scanner 52 may define subject hierarchies as full-text terms referencing a special XML content documents called subject index documents or knowledge base documents 54. The XML in a subject index document refers to the definition of the subject phrase. - A full text search for terms returns relevant subject index documents in the knowledge base documents 54. For example, searching for term “binocular” returns subject index documents “Product” and “Outdoor Gear” indicating that binocular is “is a” kind of Product and “is a” kind of Outdoor Gear.
- The
content scanner 52 indexes words and phrases to create a virtual “hierarchy of topics”. For example, thecontent scanner 52 indexes the following hierarchies in each type of content:Content Type Index Hierarchy OLAP cubes Every word in every dimension/category tree and their relationship to parent dimensions/categories Models Every word in every package/folder tree and their relationship to parent packages/folders. Report Every word used in titles and labels. Model query items Specifications used in the report are expanded to index the text in the query result (these results are called “Sample Value Enumerations”). Cognos Every word in every folder and item along with their Connection relationship to parent folders (TM) Items - For example, for OLAP cubes, the
content scanner 52 indexes the entire dimension tree. Each word in each folder (dimension/category) and all contained element names are indexed, including content of all collapsed folders in this example. The relationship of terms to parent folders is also indexed. In this example, “products” is indexed as a parent of “accessories” child term. “accessories” is indexed as a parent of “headphones”. The “order method” is indexed as a parent of terms “sales” and “visit”. - For report specifications and models, the
content scanner 52 indexes each word in each display name, title and label. Thecontent scanner 52 also indexes query subject/query item hierarchy in models. Each query item referenced in a report is expanded to get an index of text values. In this example, all tree relationships are indexed in the same fashion as cubes and folders. Query items like “product line”, “product type”, “product name” and “description” are expanded. The resulting text values and their relationship to their parent query item is indexed. Thecontent scanner 52 stores these indexed words and items in the knowledge base documents 54. - Each knowledge-
base document 54 is consumed by the full-text index and search component orengine 32 which adds a reference back to the knowledge-base document 54 for each term found in the document. The full-text index 34 is subsequently used by the full-text index andsearch engine 32 or other search tools to retrieve knowledge-base documents 54 that contain specified search terms. - The
taxonomy engine 56 provides taxonomy services to users andother applications 40. When one or more search terms are provided by a user orapplication 40, thetaxonomy engine 56 provides the taxonomy of the given search terms to the user orapplication 40. The taxonomy may include indexed terms that are parent topic terms, sibling topic terms and descendent or child topic terms of the given search terms. Thetaxonomy engine 56 creates the taxonomy dynamically for each search term provided using the knowledge base documents 54. -
FIG. 4A shows the procedure for determining parent topic terms from a given term received from the user or application 40 (100). Thetaxonomy engine 56 uses the full-text andsearch engine 32 to find knowledge-base documents 54 that contain the given term (102). Thetaxonomy engine 56 finds structured elements in the matchingdocuments 54 where the given term is used (104). Thetaxonomy engine 56 gets all structured parent elements relative to the structured element for each matching term (106). For a given word, theknowledge base document 54 stores the word and all references to its parents in the subject index. Thetaxonomy engine 56 uses these references to parents to locate relevant structured parent elements. Thetaxonomy engine 56 then returns a list of the identification of the located structured parent elements, e.g., a list of display names from the elements found across all matching documents 54 (108). -
FIG. 4B shows the procedure for determining sibling topic terms from a given term received (110). Thetaxonomy engine 56 uses the full-text index andsearch component 32 to find knowledge-base documents 54 that contain the given term (112). Thetaxonomy engine 56 finds structured elements in the matchingdocuments 54 where the given term is used (114). Thetaxonomy engine 56 gets all structured sibling elements relative to the structured element for each matching term (116) using the subject index containing references to parent relationships in the matchingdocuments 54. To find siblings, thetaxonomy engine 56 searches for other words with the same parents as the current word. Thetaxonomy engine 56 returns a list of identifications of the located structured sibling elements, e.g., the list of display names from the elements found across all matching documents 54 (118). -
FIG. 4C shows the procedure for determining Descendent child topic terms from a given term received (120). Thetaxonomy engine 56 uses the full-text index andsearch component 32 to find all knowledge-base documents 54 that contain the given term (122). Thetaxonomy engine 56 finds structured elements in the matchingdocuments 54 where the given term is used (124). Thetaxonomy engine 56 gets all structured child elements relative to the structured element for each matching term (126) using the subject index containing references to parent relationships in the matchingdocuments 54. To find children, thetaxonomy engine 56 searches for other words with this word as a parent. Thetaxonomy engine 56 then returns a list of identifications of the located structured child elements, e.g., a list of display names from elements found across all matching documents 54 (128). - Other embodiments may include building taxonomies with “Crawler Task” that performs functions of the
content scanner 52, or building knowledge-base documents orcorpus 54 with dedicated tools instead ofcontent scanner 52. A crawler task is an embodiment of a web crawler. Also, other embodiments of thetaxonomy management system 50 may use a relational database for the knowledge-base corpus 54. - The
taxonomy engine 56 provides taxonomy searching or lineage searching that allows the parent subjects for any given term to be determined with a single full-text query. Taxonomy search results allow metadata names to be found for input terms that are values. Taxonomy searching is not based on linguistics or natural language processing. The relationships are extracted from actual metadata hierarchies in the source metadata documents as described above. Full-text indexing makes retrieval feasible. - Taxonomy searching allows quick finding of metadata names and descriptions that contain a given search term. Also, search terms can be quickly associated to metadata. Subsequent metadata searches allow compatible drill targets to be found. Search term values can be passed to a drill target with matching metadata. There is reasonable assurance that term is a valid parameter. Search terms can be broadened to a parent topic phrase, allowing very accurate “see also” topic lists.
- When taxonomy searches show that a term has multiple parent topics, the
taxonomy engine 56 may use a navigation mechanism to narrow searches to a specifics topic. For example, thetaxonomy engine 56 can show subject data as folder trees or other appropriate hierarchical visual aids to help a user select relevant related topics. When selected, these topics can be used to filter or narrow search results to include a subset of results based on the chosen topics. Thetaxonomy engine 56 may use reference counts to pick the parent topic with the largest number of child references. It may also use other links, such as “see also” links. - The
taxonomy management system 50 may provides APIs that enables implementation of these features of thetaxonomy engine 56. - In a different embodiment, the
content scanner 52 may be combined with a full-text index scanner that indexes terms, determines example terms in one integrated component. A sophisticated embodiment of a full-text index and search service may integrate itself with the example component orengine 62. - The full-text index and
search components 32 are search engines that index content by “crawling” through content. Examples include Google (TM) Web Server, Google (TM) Desktop Search, MSN (TM) Web Search, MSN (TM) Desktop Search and other enterprise search tools. - The
operators 40 are operators who issue search requests against thetaxonomy engine 56, view results and navigate to referencedbusiness reporting metadata 21. - As shown in Figure ?, the
taxonomy management system 50 may also have aranking manager 60 for managing relevance scores for search results. Theranking manager 60 has abusiness relevance handler 62 and scorecalculator 64. - The
business relevance handler 62 handles assignment of weights or base scores to individual metadata elements depending on their “relevance to the business”. Thebusiness relevance handler 62 determines the “relevance to the business” using data in the knowledge base documents 54. For example, thebusiness relevance handler 62 may determine a base score of a metadata element based on its metadata type, parent metadata type, uniqueness of metadata element, usage of a target by other users, subscription to target by other users, running/compiling a target, and/or publish and update date of a target. - The
score calculator 64 calculates a relevance score for each search result item by aggregating base scores assigned to individual metadata elements included in the search result item. - Examples of operations of the metadata
content management system 10 are now described. - The metadata
content management system 10 indexes the key elements of each report and other business oriented metadata 20, and maintains a searchable database of reports and other metadata 20, in a form ofcontent index 90 or knowledge baseddocuments 54. Thisdatabase 90 is optimized for efficient searching of metadata names and hierarchies. It also offers searching of text. This information is used to provide the metadatacontent management system 10 drilling features. The metadatacontent management system 10 populates full-text search engines (like the Google (TM) Intranet “Search Appliance”) with information about each report and other business oriented metadata 20. It allows these search engines to find relevant content of business oriented metadata 20. - Searching functions and the user interface of the metadata
content management system 10 are now described. Searching functions are launched internally in reportingapplications 40 using a specified user interface. Alternatively, search may also be requested by anyreporting application 40 with the URL, e.g., http://hostname/crn/xxxxxx?c=search&q=p1\p2\p3 . . . &e=y&u=y&r=g&back=backURL; where q=p1\p2\p3 . . . represents any number of arbitrary search terms separated by “\”; e=y shows search edit field with current terms (for refining search); u=y is used when clicking hyperlinked results, launch in separate window; r=g shows grouped results (default is list results); and backURL shows a return address. - The
content management system 10 preprocesses query text by evaluating terms of the query text to determine if they are a member of a topic hierarchy. Product, date, location and customer categorizations are often members of a topic hierarchy. Thesystem 10 “stems” words to find their base. For example, “Accessories” becomes “accessory”. “Dates” becomes “date”. Thesystem 10 assigns each word in the query text an initial weighting score. Thesystem 10 weighs original terms incrementally higher than the stemmed terms, and incrementally higher than related parent topics. Thesystem 10 also weighs terms found indexed incrementally higher than other terms. Thesystem 10 uses these terms and their associated weights to build a query specification which is used by the index search function of thesystem 10. - A search example is described for search terms: “2005 tent sales”. The
taxonomy engine 56 performs an initial lineage search or taxonomy search which indicates that “2005” is a date and “tent” is a product. Any reports that use these metadata elements become search targets. Prompts with matching metadata types, matching names in cubes and matching filter specifications that will allow dynamic positioning in the cube also become search targets. - A drilling example is now described. The
user application 40 displays on screen context or a list of relevant items that includes the titles and labels of the searched targets along with the model query items used to produce the current page and/or selected items. When the user requests drilling from this displayed context, the metadatacontent management system 10 searches for reports/cubes with the related metadata using values shown on the screen. Thesystem 10 may also use topics and terms related with the displayed context to encompass parent topics. The search result is a high likelihood that all matching modeled elements and same-named display elements are found for the target that deals with related content. - Search results are now described. Search results are typically shown as line items in list format, sorted by relevance score, as shown in
FIG. 17 . The user can click a name to launch its matching item, click a group button to show grouped results, or click an arrow button in frame on left to show Related Subjects. The result display also has the search query edit field, which is optional. - Grouped results show search results organized by match criteria with inner sort by relevance score, as shown in
FIG. 18 . The user can click name to launch matching item, click a list button to show list results, or click an arrow button in frame on left to show related subjects. - The subjects pane shows how the current search terms are used in the index, as shown in
FIG. 19 . The user can click subject name to refine the current search with a matching item, or click an arrow button to hide this pane. - Results line items include:
- 1. Hyperlinks that are used to launch referenced target content.
- 2. A relevance score shown graphically or as text. Result line items are usually sorted by this score value.
- 3. Text describing metadata matches for this target.
- 4. Any other text, links or graphics that help a user understand the relevance of the referenced target.
- The relevance score for each line item is uniquely calculated as a weighted aggregate score based on the metadata elements matched. Base scores for matching metadata elements are added together to create a score for each item.
- Metadata elements are assigned different base scores rooted to their “relevance to the business”. Elements with higher base scores will cause result line items for a target to ultimately have a higher aggregate score.
- “Relevance to business” includes the following criteria:
- 1. Metadata type. Types like prompts, headings, column names can cause the base score to be affected.
- 2. Parent Metadata type. The metadata type of a parent in the index taxonomy hierarchy can cause the base score to be affected.
- 3. Uniqueness of metadata element. Elements that occur more frequently can cause the base score to be affected.
- 4. Usage of a target by other users. When more people use a target report/content, the base score can be affected.
- 5. Subscription to target by other users. When more people subscribe to a target report/content in some manner, the base score can be affected.
- 6. Running/compiling a target. When a target is run or compiled in some manner, the base score can be affected.
- 7. Publish and update date of a target. The date that a target was initially published or changed can cause the base score to be affected.
- In order to provide drill-through functions, the metadata
content management system 10 may provide an HTML user interface that can be launched via URL:, such as http://hostname/crn/xxxxxx?d=<xml/>&u=y&r=g&back=backURL, where d represents “XMLEncoded” XML Drill specification providing source content, u represents Value ‘n’ (default) specifies that hyperlinked results should be opened in the same window. Value ‘y’ specifies that hyperlinked results should be opened in a separate window. r shows grouped results (default is list results), and back provides “URLEncoded” return address. Drill-through functions may also be launched internally in reportingapplications 40. - Examples of drilling and linking functions available using the metadata
content management system 10 are now described. - Metadata drilling is feasible when a match can be found between an item selected in one application and the metadata exported from another report. The metadata
content management system 10 significantly expands the number of possible metadata values. Therefore, the number of potential drill targets is increased. - For example, as shown in
FIG. 7 , the dimension name “Channels” in the Cognos PowerPlay (TM) cube on the left is used to match a column name in the report on the right. This drill example illustrates that by using the metadatacontent management system 10, drill targets can be determined dynamically at run-time. This differs from existing CPM tools where drill targets need to be specified when a report is being authored. - The metadata
content management system 10 provides metadata drilling that allowsapplications 40 to drill in any direction by dynamically determining drill targets at run-time. Metadata searches allow the user to drill in “non-traditional” ways. For example, it is possible to drill from reporting applications. As shown inFIG. 8 , the column name “Channels” in a report on the left is used to match a dimension name (or any star schema element) in a cube on the right. Optionally, the selected “Independent” channel name on the left can be used to link to a category name on the right. - Meta drilling allows content to be linked using metadata only. Report values and measures are not needed. As such, these reports can be efficiently indexed when they are initially published. Indexing needs to be updated only when a report specification changes. No authoring is needed to drill in either direction. An example of using meta data from drilling is as follows. If a user searches for “products”, the user finds all reports that have a database table or OLAP dimension that includes the base (stemmed) word “product”. The user does not necessarily need to find “product” in any of the report data. If the user searches for “tents”, the user can determine that the base (stemmed) word “tent” is a “product” and then finds all reports that have a database table or OLAP dimension that includes the word “product”. The user does not necessarily need to find “tent” or “product” in any of the report data. By allowing content to be linked using metadata only, e.g., using hyperlinks, the
system 10 can connect sources and targets. - Meta drilling often results in multiple drill targets. The metadata
content management system 10 lists the hierarchy of matches and allows a user to pick an appropriate target, as shown inFIG. 9 . The metadatacontent management system 10 lists matching reports by report type and then sorts them by a calculated relevance rating. - The metadata
content management system 10 calculates search relevance by first creating search criteria at the drill source location and then comparing this criteria with the resulting list of matching items. - Source metadata and values are used to create a search specification. For example, when drilling from an OLAP report, the metadata
content management system 10 uses filter information, including the current crosstab dimension and category filters plus dimensions currently being displayed, to create a search specification. - Consider the following drill from a report shown in
FIG. 10 . By drilling from the Mass Marketer category, the metadatacontent management system 10 builds a search specification that includes terms “looks for item Mass Marketer within Channels”. By drilling from the GO Sport Line category, metadatacontent management system 10 builds a search specification that includes terms “looks for item Go Sport Line within Products”. By drilling from the intersection of the Mass Marketer and GO Sport Line categories, metadatacontent management system 10 builds a search specification that includes both of the terms above. - If any additional filters are active in the crosstab, for example Years=2003 or Location=California within USA, then the metadata
content management system 10 adds the related filter terms to the search criteria. - The search is submitted to the metadata
content management system 10. The metadatacontent management system 10 calculates report relevance by comparing the number of matched terms with those found in each result item. - Value drilling and searching allows target report values and measures to be searched. This means that search criteria can include value ranges within metadata constraints. As shown in
FIG. 11 , the Order Number value “160” in a report on the left is used to match same Order Number value in a report on the right that is generated by a different reporting tool. - Value searching allows content to be linked using metadata and associated report values/measures from actual reports. This type of drilling allows extends the number of reports that can be matched at the cost of using more index storage. It is also much slower than searching only metadata. As such, value searching is typically not a good choice for drilling. It is better suited for ad hoc searching where speed is less of a concern.
- Examples shown thus far have concentrated on report-to-report “drilling”. The metadata
content management system 10 also provides full-text searches againstmetadata 21 and values 22. - The metadata
content management system 10 allowsenterprise search tools 30 to be used to expose BI content of supportedreporting applications 40 to report users. External full text search engines use proprietary technology to index content. The metadatacontent management system 10 “pushes” or populates index values to each supported external search engine. Search indexes are maintained by a search server associated with each external search engine. Storage requirements are dependent on the amount of information provided by metadatacontent management system 10. The metadatacontent management system 10 may have configuration options that control how much information is “pushed” to these servers. The metadatacontent management system 10 typically maintains its own index that can be used standalone or in conjunction with the external search engines. The result is fast, relevant and predictable searches. - The metadata
content management system 10 may also allow applications to create lists of “see also” links that show related content. - The metadata
content management system 10 facilities are preferably exposed as WSDL compliant Web Services. - Another example is described using a report generated using Cognos PowerPlay (TM) to describe how a list of search results of the metadata
content management system 10 is produced. In this example, a user wants to find related CPM content. The user initiates his “search” by launching a dynamic “drill” from inside a Cognos PowerPlay (TM). The user presses the metadatacontent management system 10 “Drill” button, or enters terms in the metadatacontent management system 10 “search bar”.FIG. 12 is an example from Cognos PowerPlay 7.3 (TM) showing the “Drill” button. Alternatively, as shown inFIG. 13 , the user can type terms in the text search tool or browser search bar. - The navigation and
search engine 14 accepts request and builds a “Source Context”. This is carried out as follows. The navigation andsearch engine 14 starts the actual index search. When drilling from a reporting application like Cognos PowerPlay (TM), theengine 14 extracts current filter values, view settings and all visible category information to create a “Source Context”.FIG. 14 shows an example of “Drill” from Cognos PowerPlay (TM) and the resultant Source Context. - A “Source Context” is also built for “text” searches. The metadata
content management system 10 uses thetaxonomy management system 50. Individual terms are inspected to see if they match subjects in the metadata content management system subject and term hierarchy, i.e., a “taxonomy”. When a match is found, terms are placed under their respective matching subjects. This allow terms to be treated like categories within OLAP dimensions. - An example of hyper-dimensional report creation is now described. Hyper-Dimensional Navigation can be also used to create reports. This example shows a sales manager who wants to compare 2005 actual sales against his forecast for Projection TVs in US and Canada with detail breakdown by Order Method.
- The sales manager begins at the top level of his enterprise Navigation Bar. It shows the most frequently referenced reporting categories across the entire enterprise. While this navigation bar has literally hundreds of topics, only the top seven items for the “Sales Manager Role” are shown in
FIG. 20 . - As an alternative to typing search terms, e.g., “2004 United States Canada Projection TV”, he simply clicks the down arrow symbol next to the related “hyper dimensional topics” shown. Under Products, he selects Projection TVs. Under Years he selects 2005. Under Distribution Channel he selects Order Method. Under Location he selects both United States and Canada. Search results show matching reports as shown in
FIG. 21 . - He sees that no reports match his search criteria. Now he checks the topic Plan Versus Actual and clicks Create report to answer his question. The metadata
content management system 10 creates a report (using the Sales Manager business role that he selected last time) as shown inFIG. 22 . - Hyper-Dimensional navigation provides a concise definition of report items needed. Business Role also helps narrow the choices without asking too many questions.
- Now report extensions are described. Consider the regional sales manager for GO Sporting Goods. As part of his quarterly expense activity, he needs to know: “Which salespeople exceeded their target last year?”. This user goes to his “usual” set of reports that have been authored for him. Finding nothing that would appear to directly answer his question, he open a report, entitled Sales Revenue by Salesperson. He sees it has some of the information needed as shown in
FIG. 23 . - Verifying that the information shown is correct and appropriate, he selects the columns of interest—in this case: Sales Year, Staff Name and Actual Revenue—and right-clicks to see available options.
- As shown in
FIG. 24 , he selects Show related content to launch the metadatacontent management system 10 and find all related reports. He sees the following search results as shown inFIG. 25 . - Seeing that none of the reports match his needs, he decides to extend his current report by clicking Extend Report with Related Data. He is asked to describe the extra information he wishes to see as shown in
FIG. 26 . He types search terms “Sales Target Exceeded” and clicks Finish. - The component of the metadata
content management system 10 is started to look for models and reporting examples that match the terms “Sales”, “Target”, or “Exceeded”. The metadatacontent management system 10 creates a new report specification from the model and metadata elements found. The report is run and the results are shown inFIG. 27 . The generated report has new columns: “Sales target” and “Exceeds target”. The sales manager got the numbers he was looking for. - Advanced users can launch a report editing/creation tool to fine tune the generated report.
- The metadata
content management system 10 performed this as follows. An element with a display title “Sales Target” is found in the model used most often by this user. It is deemed to be compatible with the original report's list-style format. Similarly, a calculation named “exceeds target” is found in a report that shows both actual revenue and sales target elements. This calculation is also deemed compatible. The metadatacontent management system 10 assembles the new report and displays the answer. - Visual Report Construction is now described. Visual Report Construction is a simple idea: A user views elements in two reports that he wants in a single report. Using a drag-and-drop gesture, he drags elements together. Visually it's a trivial drag-and-drop gesture. Under-the-covers, it is a variant of the Report Extension example shown previously.
- In this example, a regional sales manager for GO Sporting Goods. As part of his quarterly expense activity, he needs to know: “Which salespeople exceeded their target last year?”. Once again he goes to his “usual” set of reports and opens Sales Revenue by Salesperson. He sees it has some of the information needed as shown in
FIG. 28 . He opens another report, and sees it has some of the information he wants. - Dynamic Details are now described. Simple relational list reports are often created as authored drill-through targets in dimensional reporting tools, like Cognos PowerPlay (TM). This example shows how the metadata
content management system 10 can automatically produce these reports without authoring. - In this example, a product manager wants to know who provides customer support for a particular product at different US retailers. She is familiar with Cognos PowerPlay (TM). She navigates to where she thinks the answer will be found as shown in
FIG. 29 . - Realizing that this cube lacks the detail she needs, she clicks to the Cognos PowerPlay (TM) drill-through button. Cognos PowerPlay (TM) recognizes that no authored drill-through actions exist for the selected cells, so it passes the request to the metadata
content management system 10 component. - Having sufficient knowledge about the source cube location from the selected cells: Year=2004, Product=Canyon Mule Climber Backpack, Location=United States, the metadata
content management system 10 asks her for two simple clarifications (which can optionally be defaulted in the future): “What business role are you currently performing?” and “What kind of details are you looking for?”. She sees the page shown inFIG. 30 . From the custom options shown, she selects Product Manager and Staff Details. She clicks Finish to get her answer. - The metadata
content management system 10 creates and runs a detail report that matches criteria in her chosen role using the original selected cell data as query values. She sees the results as shown inFIG. 31 . She verifies the report content by looking at title and column names. She sees her answer. - The metadata
content management system 10 performed this example as follows. Filter information is extracted from the source Cognos PowerPlay (TM) report. It is combined with the caller's selected job role and detail template to create a report with appropriate content. Roles are defined by an administrator and users themselves. Detail templates are created by authoring real reports in Report Studio and optionally creating templates of the metadatacontent management system 10 when saving and updating. - Administration functions and user interface of the metadata
content management system 10 are now described. The administration functions of the metadatacontent management system 10 administration are available only to users with administration capability. - Index Properties are viewed via a link on item property pages, as shown in
FIG. 32 . - The item Type is Folder. The Open Folder link lets the user open the folder in a new window, as shown in
FIG. 33 . Last Indexed shows the date/time of the most recent indexing on the folder itself (contained items will have different dates/times). Re-index initiates indexing of the folder, then all of its content (this is a “deep index” operation—all contained folders, subfolders and items are indexed). Disable Indexing stop further indexing of this folder and all contained folders, subfolders and items. When first disabled, all contained index entries are removed. The indexing user name can be set as “inherited” (default for new items) or to a specific user. XML index data is shown (read only) OK saves changes are returns to the Index tab. Cancel returns without making any changes. -
FIG. 34 shows an example of Item Property Pages. Properties for content items are the same as folders with minor label changes. It may also show a Model Index tab for a metadata model related to this report. -
FIG. 35 shows an example of Model Property Pages. Properties for models items are the same as folders with minor label changes. The Model tab is shown by itself for Model list items. It is shown with an Indexing Options tab for reporting application content items (e.g., Cognos ReportNet (TM) content items). - General administration functions of the metadata
content management system 10 can be launched from an application, such as Cognos Connection, using a link in a lunch bar. This option may be shown only to users with Content Administration capabilities. - The metadata
content management system 10 finds content using simple terms and phrases. Multi-faceted navigation aids refine searches using business terminology related specifically to the customer's enterprise. When content does not t exist, the metadatacontent management system 10 seamlessly creates a “made-to-order” report, with the help of the customer's enterprise business terminology. - Thus, the metadata
content management system 10 allow the users to find relevant BI content using simple term or phrase searches. With the help of multi-faceted navigation aids—using business terminology related specifically to a particular enterprise or organization, the users can refine searches to zero-in on the answer the users need. When content does not exist for particular search criteria, the metadatacontent management system 10 seamlessly creates a “made-to-order” report. Business terminology from the user's enterprise or organization helps the users refine their report to meet their requirements. Report creation is transparent. - The metadata
content management system 10 allows users to search user's business oriented metadata and reporting applications. It provides a flexible “dynamic drilling” family of features, hyper-dimensional navigation, to user's applications. The “context awareness” feature allows seamless navigation to relevant related content. A search-oriented interface allows reports to be found and run easier and more efficiently. It can also use role-based reporting components in reporting applications. The metadatacontent management system 10 finds answers in existing reports and creates custom reports as needed. The metadatacontent management system 10 provides dynamic report construction. The metadatacontent management system 10 creates reports directly from search terms. - The hyper-dimensional navigation let users navigate their cubes and reports at a time, using the dimension metaphor of their reporting applications. Fast searching combined with intrinsic knowledge of key enterprise reporting elements allows the metadata
content management system 10 to build multi-dimensional hierarchies on-the-fly. This dynamic structure is used to show a hyper-dimensional view of all enterprise content that can be navigated like a Cognos PowerPlay (TM) cube. - The taxonomy management system of the present invention may be implemented by any hardware, software or a combination of hardware and software having the above described functions. The software code, instructions and/or statements, either in its entirety or a part thereof, may be stored in a computer readable memory. Further, a computer data signal representing the software code, instructions and/or statements may be embedded in a carrier wave may be transmitted via a communication network. Such a computer readable memory and a computer data signal and/or its carrier are also within the scope of the present invention, as well as the hardware, software and the combination thereof.
- While particular embodiments of the present invention have been shown and described, changes and modifications may be made to such embodiments without departing from the scope of the invention. For example, the elements of the taxonomy management system are described separately, however, two or more elements may be provided as a single element, or one or more elements may be shared with other components in one or more computer systems.
Claims (42)
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2514165 | 2005-07-29 | ||
CA2,514,165 | 2005-07-29 | ||
CA002514165A CA2514165A1 (en) | 2005-07-29 | 2005-07-29 | Metadata content management and searching system and method |
CA2545232 | 2006-04-28 | ||
CA2,545,232 | 2006-04-28 | ||
CA002545232A CA2545232A1 (en) | 2005-07-29 | 2006-04-28 | Method and system for creating a taxonomy from business-oriented metadata content |
Publications (2)
Publication Number | Publication Date |
---|---|
US20070055680A1 true US20070055680A1 (en) | 2007-03-08 |
US7885918B2 US7885918B2 (en) | 2011-02-08 |
Family
ID=37696182
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/494,974 Active 2027-02-21 US7885918B2 (en) | 2005-07-29 | 2006-07-28 | Creating a taxonomy from business-oriented metadata content |
Country Status (2)
Country | Link |
---|---|
US (1) | US7885918B2 (en) |
CA (1) | CA2545232A1 (en) |
Cited By (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070260592A1 (en) * | 2006-05-03 | 2007-11-08 | International Business Machines Corporation | Hierarchical storage management of metadata |
US20080133473A1 (en) * | 2006-11-30 | 2008-06-05 | Broder Andrei Z | Efficient multifaceted search in information retrieval systems |
US20080168083A1 (en) * | 2007-01-10 | 2008-07-10 | Microsoft Corporation | Taxonomy object modeling |
US20080177692A1 (en) * | 2007-01-24 | 2008-07-24 | Microsoft Corporation | Using virtual repository items for customized display |
US20080184277A1 (en) * | 2007-01-26 | 2008-07-31 | Microsoft Corporation | Systems management policy validation, distribution and enactment |
US20080184201A1 (en) * | 2007-01-26 | 2008-07-31 | Microsoft Corporation | Universal schema for representing management policy |
US20080201355A1 (en) * | 2007-02-16 | 2008-08-21 | Microsoft Corporation | Easily queriable software repositories |
US20080201330A1 (en) * | 2007-02-16 | 2008-08-21 | Microsoft Corporation | Software repositories |
US20080222515A1 (en) * | 2007-02-26 | 2008-09-11 | Microsoft Corporation | Parameterized types and elements in xml schema |
US20080263085A1 (en) * | 2007-04-20 | 2008-10-23 | Microsoft Corporation | Describing expected entity relationships in a model |
US20080288510A1 (en) * | 2007-05-16 | 2008-11-20 | Microsoft Corporation | Data model for a common language |
US20080301111A1 (en) * | 2007-05-29 | 2008-12-04 | Cognos Incorporated | Method and system for providing ranked search results |
US20090006367A1 (en) * | 2007-06-28 | 2009-01-01 | Microsoft Corporation | Search-based filtering for property grids |
US20090089276A1 (en) * | 2007-10-02 | 2009-04-02 | International Business Machines Corporation | Systems, methods and computer products for a monitoring context generator |
US20090177668A1 (en) * | 2008-01-08 | 2009-07-09 | International Business Machines Corporation | Term-Driven Records File Plan and Thesaurus Design |
US20090276834A1 (en) * | 2008-04-30 | 2009-11-05 | Microsoft Corporation | Securing resource stores with claims-based security |
US20100010973A1 (en) * | 2008-07-09 | 2010-01-14 | International Business Machines Corporation | Vector Space Lightweight Directory Access Protocol Data Search |
US20100042615A1 (en) * | 2008-08-12 | 2010-02-18 | Peter Rinearson | Systems and methods for aggregating content on a user-content driven website |
US20100235353A1 (en) * | 2009-03-10 | 2010-09-16 | Warnock Christopher M | Method and Apparatus for Real Time Text Analysis and Text Navigation |
US20100274782A1 (en) * | 2007-10-08 | 2010-10-28 | Koninklijke Philips Electronics N.V. | Generating metadata for association with a collection of content items |
US20100325170A1 (en) * | 2009-06-22 | 2010-12-23 | Microsoft Corporation | Partitioning modeling platform data |
US7873670B2 (en) | 2005-07-29 | 2011-01-18 | International Business Machines Corporation | Method and system for managing exemplar terms database for business-oriented metadata content |
US20110113052A1 (en) * | 2007-06-08 | 2011-05-12 | Hoernkvist John | Query result iteration for multiple queries |
US20110238653A1 (en) * | 2010-03-25 | 2011-09-29 | Microsoft Corporation | Parsing and indexing dynamic reports |
US8326806B1 (en) * | 2007-05-11 | 2012-12-04 | Google Inc. | Content item parameter filter |
US20130007605A1 (en) * | 2011-06-29 | 2013-01-03 | Mihail-Cristian Ivascu I | Methods and Apparatus for Generating Content Variations |
US20130013597A1 (en) * | 2011-06-17 | 2013-01-10 | Alibaba Group Holding Limited | Processing Repetitive Data |
US20130073520A1 (en) * | 2011-09-20 | 2013-03-21 | Oracle International Corporation | Mechanism for updating olap system structure and oltp system structure structure |
US20130166573A1 (en) * | 2011-12-27 | 2013-06-27 | Business Objects Software Ltd. | Managing Business Objects Data Sources |
US8751505B2 (en) * | 2012-03-11 | 2014-06-10 | International Business Machines Corporation | Indexing and searching entity-relationship data |
US20140236958A1 (en) * | 2011-12-15 | 2014-08-21 | Robert L. Vaughn | Evolving metadata |
US20140372412A1 (en) * | 2013-06-14 | 2014-12-18 | Microsoft Corporation | Dynamic filtering search results using augmented indexes |
US20150293973A1 (en) * | 2014-04-15 | 2015-10-15 | Bank Of America Corporation | Managing electronic records |
US20170364333A1 (en) * | 2016-06-20 | 2017-12-21 | Austin Consultants Limited | System for automated code calculation and an automated code calculation method |
US11100152B2 (en) | 2017-08-17 | 2021-08-24 | Target Brands, Inc. | Data portal |
CN113392227A (en) * | 2021-05-31 | 2021-09-14 | 交控科技股份有限公司 | Metadata knowledge map engine system facing rail transit field |
US11243910B2 (en) * | 2019-08-19 | 2022-02-08 | Uber Technologies, Inc. | Indexing documents in a nested hierarchy of directories |
CN114817149A (en) * | 2022-03-23 | 2022-07-29 | 建信融通有限责任公司 | Metadata management system |
CN117474092A (en) * | 2023-12-21 | 2024-01-30 | 巢湖学院 | Enterprise knowledge base construction system based on AIGC |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7769579B2 (en) * | 2005-05-31 | 2010-08-03 | Google Inc. | Learning facts from semi-structured text |
US7587387B2 (en) | 2005-03-31 | 2009-09-08 | Google Inc. | User interface for facts query engine with snippets from information sources that include query terms and answer terms |
US9208229B2 (en) * | 2005-03-31 | 2015-12-08 | Google Inc. | Anchor text summarization for corroboration |
US8682913B1 (en) | 2005-03-31 | 2014-03-25 | Google Inc. | Corroborating facts extracted from multiple sources |
US8996470B1 (en) | 2005-05-31 | 2015-03-31 | Google Inc. | System for ensuring the internal consistency of a fact repository |
US7831545B1 (en) * | 2005-05-31 | 2010-11-09 | Google Inc. | Identifying the unifying subject of a set of facts |
US8260785B2 (en) | 2006-02-17 | 2012-09-04 | Google Inc. | Automatic object reference identification and linking in a browseable fact repository |
US8122026B1 (en) | 2006-10-20 | 2012-02-21 | Google Inc. | Finding and disambiguating references to entities on web pages |
US8347202B1 (en) | 2007-03-14 | 2013-01-01 | Google Inc. | Determining geographic locations for place names in a fact repository |
US8032858B2 (en) * | 2007-08-28 | 2011-10-04 | International Business Machines Corporation | Method and system for navigationally displaying HTTP session entry and exit points |
US8607197B2 (en) * | 2007-08-28 | 2013-12-10 | International Business Machines Corporation | Displaying HTTP session entry and exit points |
US8812435B1 (en) | 2007-11-16 | 2014-08-19 | Google Inc. | Learning objects and facts from documents |
US9754230B2 (en) * | 2010-11-29 | 2017-09-05 | International Business Machines Corporation | Deployment of a business intelligence (BI) meta model and a BI report specification for use in presenting data mining and predictive insights using BI tools |
US8874435B2 (en) * | 2012-04-17 | 2014-10-28 | International Business Machines Corporation | Automated glossary creation |
US8661049B2 (en) | 2012-07-09 | 2014-02-25 | ZenDesk, Inc. | Weight-based stemming for improving search quality |
US9009615B2 (en) | 2012-10-17 | 2015-04-14 | Microsoft Technology Licensing, Llc | Portal for submitting business metadata for services |
US11907662B2 (en) | 2017-12-29 | 2024-02-20 | Robert Bosch Gmbh | System and method for domain-independent terminology linking |
CN110865990B (en) * | 2019-10-12 | 2024-04-30 | 中国平安财产保险股份有限公司 | Non-real-time data exchange method, system and computer equipment |
US11308665B2 (en) | 2020-01-30 | 2022-04-19 | International Business Machines Corporation | Automatic generation of user onboarding tours for business analytic applications |
US12045290B2 (en) | 2020-02-27 | 2024-07-23 | International Business Machines Corporation | Recommending template of business analytics applications based on user's dataset |
US12197463B2 (en) | 2020-02-27 | 2025-01-14 | International Business Machines Corporation | Creating descriptors for business analytics applications |
CN119046290A (en) * | 2024-10-30 | 2024-11-29 | 浙江五疆科技发展有限公司 | Data mapping method and system for generating report form based on big data |
Citations (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5940821A (en) * | 1997-05-21 | 1999-08-17 | Oracle Corporation | Information presentation in a knowledge base search and retrieval system |
US5963940A (en) * | 1995-08-16 | 1999-10-05 | Syracuse University | Natural language information retrieval system and method |
US6026388A (en) * | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
US6038560A (en) * | 1997-05-21 | 2000-03-14 | Oracle Corporation | Concept knowledge base search and retrieval system |
US6161084A (en) * | 1997-03-07 | 2000-12-12 | Microsoft Corporation | Information retrieval utilizing semantic representation of text by identifying hypernyms and indexing multiple tokenized semantic structures to a same passage of text |
US20010054034A1 (en) * | 2000-05-04 | 2001-12-20 | Andreas Arning | Using an index to access a subject multi-dimensional database |
US6363378B1 (en) * | 1998-10-13 | 2002-03-26 | Oracle Corporation | Ranking of query feedback terms in an information retrieval system |
US6405190B1 (en) * | 1999-03-16 | 2002-06-11 | Oracle Corporation | Free format query processing in an information search and retrieval system |
US6460034B1 (en) * | 1997-05-21 | 2002-10-01 | Oracle Corporation | Document knowledge base research and retrieval system |
US6510425B1 (en) * | 1998-02-25 | 2003-01-21 | Hitachi, Ltd. | Document search method for registering documents, generating a structure index with elements having position of occurrence in documents represented by meta-nodes |
US6526440B1 (en) * | 2001-01-30 | 2003-02-25 | Google, Inc. | Ranking search results by reranking the results based on local inter-connectivity |
US20030110158A1 (en) * | 2001-11-13 | 2003-06-12 | Seals Michael P. | Search engine visibility system |
US20040148278A1 (en) * | 2003-01-22 | 2004-07-29 | Amir Milo | System and method for providing content warehouse |
US20040230572A1 (en) * | 2001-06-22 | 2004-11-18 | Nosa Omoigui | System and method for semantic knowledge retrieval, management, capture, sharing, discovery, delivery and presentation |
US6842761B2 (en) * | 2000-11-21 | 2005-01-11 | America Online, Inc. | Full-text relevancy ranking |
US20050165718A1 (en) * | 2004-01-26 | 2005-07-28 | Fontoura Marcus F. | Pipelined architecture for global analysis and index building |
US6924828B1 (en) * | 1999-04-27 | 2005-08-02 | Surfnotes | Method and apparatus for improved information representation |
US20050192955A1 (en) * | 2004-03-01 | 2005-09-01 | International Business Machines Corporation | Organizing related search results |
US6954750B2 (en) * | 2000-10-10 | 2005-10-11 | Content Analyst Company, Llc | Method and system for facilitating the refinement of data queries |
US20050246322A1 (en) * | 2004-04-30 | 2005-11-03 | Shanmugasundaram Ravikumar | On the role of market economics in ranking search results |
US6963867B2 (en) * | 1999-12-08 | 2005-11-08 | A9.Com, Inc. | Search query processing to provide category-ranked presentation of search results |
US20060053151A1 (en) * | 2004-09-03 | 2006-03-09 | Bio Wisdom Limited | Multi-relational ontology structure |
US20060155685A1 (en) * | 2005-01-13 | 2006-07-13 | International Business Machines Corporation | System and method for exposing internal search indices to Internet search engines |
US20060212461A1 (en) * | 2005-03-21 | 2006-09-21 | Meysman David J | System for organizing a plurality of data sources into a plurality of taxonomies |
US20060265364A1 (en) * | 2000-03-09 | 2006-11-23 | Keith Robert O Jr | Method and apparatus for organizing data by overlaying a searchable database with a directory tree structure |
US7231405B2 (en) * | 2004-05-08 | 2007-06-12 | Doug Norman, Interchange Corp. | Method and apparatus of indexing web pages of a web site for geographical searchine based on user location |
US7251637B1 (en) * | 1993-09-20 | 2007-07-31 | Fair Isaac Corporation | Context vector generation and retrieval |
US7266548B2 (en) * | 2004-06-30 | 2007-09-04 | Microsoft Corporation | Automated taxonomy generation |
US7272610B2 (en) * | 2001-11-02 | 2007-09-18 | Medrecon, Ltd. | Knowledge management system |
US20080243830A1 (en) * | 2007-03-30 | 2008-10-02 | Fatdoor, Inc. | User suggested ordering to influence search result ranking |
US7472113B1 (en) * | 2004-01-26 | 2008-12-30 | Microsoft Corporation | Query preprocessing and pipelining |
US7571157B2 (en) * | 2004-12-29 | 2009-08-04 | Aol Llc | Filtering search results |
US7689615B2 (en) * | 2005-02-25 | 2010-03-30 | Microsoft Corporation | Ranking results using multiple nested ranking |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2545237A1 (en) | 2005-07-29 | 2007-01-29 | Cognos Incorporated | Method and system for managing exemplar terms database for business-oriented metadata content |
US20080027971A1 (en) | 2006-07-28 | 2008-01-31 | Craig Statchuk | Method and system for populating an index corpus to a search engine |
-
2006
- 2006-04-28 CA CA002545232A patent/CA2545232A1/en not_active Abandoned
- 2006-07-28 US US11/494,974 patent/US7885918B2/en active Active
Patent Citations (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7251637B1 (en) * | 1993-09-20 | 2007-07-31 | Fair Isaac Corporation | Context vector generation and retrieval |
US5963940A (en) * | 1995-08-16 | 1999-10-05 | Syracuse University | Natural language information retrieval system and method |
US6026388A (en) * | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
US6161084A (en) * | 1997-03-07 | 2000-12-12 | Microsoft Corporation | Information retrieval utilizing semantic representation of text by identifying hypernyms and indexing multiple tokenized semantic structures to a same passage of text |
US5940821A (en) * | 1997-05-21 | 1999-08-17 | Oracle Corporation | Information presentation in a knowledge base search and retrieval system |
US6038560A (en) * | 1997-05-21 | 2000-03-14 | Oracle Corporation | Concept knowledge base search and retrieval system |
US6460034B1 (en) * | 1997-05-21 | 2002-10-01 | Oracle Corporation | Document knowledge base research and retrieval system |
US6510425B1 (en) * | 1998-02-25 | 2003-01-21 | Hitachi, Ltd. | Document search method for registering documents, generating a structure index with elements having position of occurrence in documents represented by meta-nodes |
US6363378B1 (en) * | 1998-10-13 | 2002-03-26 | Oracle Corporation | Ranking of query feedback terms in an information retrieval system |
US6405190B1 (en) * | 1999-03-16 | 2002-06-11 | Oracle Corporation | Free format query processing in an information search and retrieval system |
US6924828B1 (en) * | 1999-04-27 | 2005-08-02 | Surfnotes | Method and apparatus for improved information representation |
US6963867B2 (en) * | 1999-12-08 | 2005-11-08 | A9.Com, Inc. | Search query processing to provide category-ranked presentation of search results |
US20060265364A1 (en) * | 2000-03-09 | 2006-11-23 | Keith Robert O Jr | Method and apparatus for organizing data by overlaying a searchable database with a directory tree structure |
US20010054034A1 (en) * | 2000-05-04 | 2001-12-20 | Andreas Arning | Using an index to access a subject multi-dimensional database |
US6954750B2 (en) * | 2000-10-10 | 2005-10-11 | Content Analyst Company, Llc | Method and system for facilitating the refinement of data queries |
US6842761B2 (en) * | 2000-11-21 | 2005-01-11 | America Online, Inc. | Full-text relevancy ranking |
US6526440B1 (en) * | 2001-01-30 | 2003-02-25 | Google, Inc. | Ranking search results by reranking the results based on local inter-connectivity |
US20040230572A1 (en) * | 2001-06-22 | 2004-11-18 | Nosa Omoigui | System and method for semantic knowledge retrieval, management, capture, sharing, discovery, delivery and presentation |
US7272610B2 (en) * | 2001-11-02 | 2007-09-18 | Medrecon, Ltd. | Knowledge management system |
US20030110158A1 (en) * | 2001-11-13 | 2003-06-12 | Seals Michael P. | Search engine visibility system |
US20040148278A1 (en) * | 2003-01-22 | 2004-07-29 | Amir Milo | System and method for providing content warehouse |
US20050165718A1 (en) * | 2004-01-26 | 2005-07-28 | Fontoura Marcus F. | Pipelined architecture for global analysis and index building |
US7472113B1 (en) * | 2004-01-26 | 2008-12-30 | Microsoft Corporation | Query preprocessing and pipelining |
US20050192955A1 (en) * | 2004-03-01 | 2005-09-01 | International Business Machines Corporation | Organizing related search results |
US20050246322A1 (en) * | 2004-04-30 | 2005-11-03 | Shanmugasundaram Ravikumar | On the role of market economics in ranking search results |
US7231405B2 (en) * | 2004-05-08 | 2007-06-12 | Doug Norman, Interchange Corp. | Method and apparatus of indexing web pages of a web site for geographical searchine based on user location |
US7266548B2 (en) * | 2004-06-30 | 2007-09-04 | Microsoft Corporation | Automated taxonomy generation |
US20060053151A1 (en) * | 2004-09-03 | 2006-03-09 | Bio Wisdom Limited | Multi-relational ontology structure |
US7571157B2 (en) * | 2004-12-29 | 2009-08-04 | Aol Llc | Filtering search results |
US20060155685A1 (en) * | 2005-01-13 | 2006-07-13 | International Business Machines Corporation | System and method for exposing internal search indices to Internet search engines |
US7689615B2 (en) * | 2005-02-25 | 2010-03-30 | Microsoft Corporation | Ranking results using multiple nested ranking |
US20060212461A1 (en) * | 2005-03-21 | 2006-09-21 | Meysman David J | System for organizing a plurality of data sources into a plurality of taxonomies |
US20080243830A1 (en) * | 2007-03-30 | 2008-10-02 | Fatdoor, Inc. | User suggested ordering to influence search result ranking |
Cited By (66)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7873670B2 (en) | 2005-07-29 | 2011-01-18 | International Business Machines Corporation | Method and system for managing exemplar terms database for business-oriented metadata content |
US7574435B2 (en) * | 2006-05-03 | 2009-08-11 | International Business Machines Corporation | Hierarchical storage management of metadata |
US20070260592A1 (en) * | 2006-05-03 | 2007-11-08 | International Business Machines Corporation | Hierarchical storage management of metadata |
US20080222117A1 (en) * | 2006-11-30 | 2008-09-11 | Broder Andrei Z | Efficient multifaceted search in information retrieval systems |
US20080133473A1 (en) * | 2006-11-30 | 2008-06-05 | Broder Andrei Z | Efficient multifaceted search in information retrieval systems |
US8032532B2 (en) | 2006-11-30 | 2011-10-04 | International Business Machines Corporation | Efficient multifaceted search in information retrieval systems |
US7496568B2 (en) * | 2006-11-30 | 2009-02-24 | International Business Machines Corporation | Efficient multifaceted search in information retrieval systems |
US7689625B2 (en) * | 2007-01-10 | 2010-03-30 | Microsoft Corporation | Taxonomy object modeling |
US20080168083A1 (en) * | 2007-01-10 | 2008-07-10 | Microsoft Corporation | Taxonomy object modeling |
US20080177692A1 (en) * | 2007-01-24 | 2008-07-24 | Microsoft Corporation | Using virtual repository items for customized display |
US8190661B2 (en) * | 2007-01-24 | 2012-05-29 | Microsoft Corporation | Using virtual repository items for customized display |
US20080184201A1 (en) * | 2007-01-26 | 2008-07-31 | Microsoft Corporation | Universal schema for representing management policy |
US20080184277A1 (en) * | 2007-01-26 | 2008-07-31 | Microsoft Corporation | Systems management policy validation, distribution and enactment |
US8104080B2 (en) | 2007-01-26 | 2012-01-24 | Microsoft Corporation | Universal schema for representing management policy |
US8392464B2 (en) | 2007-02-16 | 2013-03-05 | Microsoft Corporation | Easily queriable software repositories |
US20080201355A1 (en) * | 2007-02-16 | 2008-08-21 | Microsoft Corporation | Easily queriable software repositories |
US8145673B2 (en) | 2007-02-16 | 2012-03-27 | Microsoft Corporation | Easily queriable software repositories |
US20080201330A1 (en) * | 2007-02-16 | 2008-08-21 | Microsoft Corporation | Software repositories |
US20080222515A1 (en) * | 2007-02-26 | 2008-09-11 | Microsoft Corporation | Parameterized types and elements in xml schema |
US20080263085A1 (en) * | 2007-04-20 | 2008-10-23 | Microsoft Corporation | Describing expected entity relationships in a model |
US7765241B2 (en) | 2007-04-20 | 2010-07-27 | Microsoft Corporation | Describing expected entity relationships in a model |
US8326806B1 (en) * | 2007-05-11 | 2012-12-04 | Google Inc. | Content item parameter filter |
US7831625B2 (en) | 2007-05-16 | 2010-11-09 | Microsoft Corporation | Data model for a common language |
US20080288510A1 (en) * | 2007-05-16 | 2008-11-20 | Microsoft Corporation | Data model for a common language |
US7792826B2 (en) | 2007-05-29 | 2010-09-07 | International Business Machines Corporation | Method and system for providing ranked search results |
US20080301111A1 (en) * | 2007-05-29 | 2008-12-04 | Cognos Incorporated | Method and system for providing ranked search results |
US20110113052A1 (en) * | 2007-06-08 | 2011-05-12 | Hoernkvist John | Query result iteration for multiple queries |
US20110099183A1 (en) * | 2007-06-28 | 2011-04-28 | Microsoft Corporation | Search-based filtering for property grids |
US20090006367A1 (en) * | 2007-06-28 | 2009-01-01 | Microsoft Corporation | Search-based filtering for property grids |
US7890523B2 (en) * | 2007-06-28 | 2011-02-15 | Microsoft Corporation | Search-based filtering for property grids |
US20090089276A1 (en) * | 2007-10-02 | 2009-04-02 | International Business Machines Corporation | Systems, methods and computer products for a monitoring context generator |
US8140547B2 (en) * | 2007-10-02 | 2012-03-20 | International Business Machines Corporation | Systems, methods and computer products for a monitoring context generator |
US8321456B2 (en) * | 2007-10-08 | 2012-11-27 | Koninklijke Philips Electronics N.V. | Generating metadata for association with a collection of content items |
US20100274782A1 (en) * | 2007-10-08 | 2010-10-28 | Koninklijke Philips Electronics N.V. | Generating metadata for association with a collection of content items |
US10733223B2 (en) * | 2008-01-08 | 2020-08-04 | International Business Machines Corporation | Term-driven records file plan and thesaurus design |
US20090177668A1 (en) * | 2008-01-08 | 2009-07-09 | International Business Machines Corporation | Term-Driven Records File Plan and Thesaurus Design |
US8453217B2 (en) | 2008-04-30 | 2013-05-28 | Microsoft Corporation | Securing resource stores with claims-based security |
US8095963B2 (en) | 2008-04-30 | 2012-01-10 | Microsoft Corporation | Securing resource stores with claims-based security |
US20090276834A1 (en) * | 2008-04-30 | 2009-11-05 | Microsoft Corporation | Securing resource stores with claims-based security |
US20100010973A1 (en) * | 2008-07-09 | 2010-01-14 | International Business Machines Corporation | Vector Space Lightweight Directory Access Protocol Data Search |
US8918383B2 (en) * | 2008-07-09 | 2014-12-23 | International Business Machines Corporation | Vector space lightweight directory access protocol data search |
US20100042615A1 (en) * | 2008-08-12 | 2010-02-18 | Peter Rinearson | Systems and methods for aggregating content on a user-content driven website |
US20100235353A1 (en) * | 2009-03-10 | 2010-09-16 | Warnock Christopher M | Method and Apparatus for Real Time Text Analysis and Text Navigation |
US8280878B2 (en) | 2009-03-10 | 2012-10-02 | Ebrary, Inc. | Method and apparatus for real time text analysis and text navigation |
US10061827B2 (en) | 2009-06-15 | 2018-08-28 | Oracle International Corporation | Mechanism for synchronizing OLAP system structure and OLTP system structure |
US8095571B2 (en) | 2009-06-22 | 2012-01-10 | Microsoft Corporation | Partitioning modeling platform data |
US20100325170A1 (en) * | 2009-06-22 | 2010-12-23 | Microsoft Corporation | Partitioning modeling platform data |
US20110238653A1 (en) * | 2010-03-25 | 2011-09-29 | Microsoft Corporation | Parsing and indexing dynamic reports |
US20130013597A1 (en) * | 2011-06-17 | 2013-01-10 | Alibaba Group Holding Limited | Processing Repetitive Data |
US9424234B2 (en) * | 2011-06-29 | 2016-08-23 | Adobe Systems Incorporated | Methods and apparatus for automatically creating variations of original content based on searching for alternative content objects based on characteristics of the original content |
US20130007605A1 (en) * | 2011-06-29 | 2013-01-03 | Mihail-Cristian Ivascu I | Methods and Apparatus for Generating Content Variations |
US20130073520A1 (en) * | 2011-09-20 | 2013-03-21 | Oracle International Corporation | Mechanism for updating olap system structure and oltp system structure structure |
US9959330B2 (en) * | 2011-09-20 | 2018-05-01 | Oracle International Corporation | Mechanism for updating OLAP system structure and OLTP system structure |
US20140236958A1 (en) * | 2011-12-15 | 2014-08-21 | Robert L. Vaughn | Evolving metadata |
US9092478B2 (en) * | 2011-12-27 | 2015-07-28 | Sap Se | Managing business objects data sources |
US20130166573A1 (en) * | 2011-12-27 | 2013-06-27 | Business Objects Software Ltd. | Managing Business Objects Data Sources |
US8751505B2 (en) * | 2012-03-11 | 2014-06-10 | International Business Machines Corporation | Indexing and searching entity-relationship data |
US20140372412A1 (en) * | 2013-06-14 | 2014-12-18 | Microsoft Corporation | Dynamic filtering search results using augmented indexes |
US20150293973A1 (en) * | 2014-04-15 | 2015-10-15 | Bank Of America Corporation | Managing electronic records |
US20170364333A1 (en) * | 2016-06-20 | 2017-12-21 | Austin Consultants Limited | System for automated code calculation and an automated code calculation method |
US11100152B2 (en) | 2017-08-17 | 2021-08-24 | Target Brands, Inc. | Data portal |
US11243910B2 (en) * | 2019-08-19 | 2022-02-08 | Uber Technologies, Inc. | Indexing documents in a nested hierarchy of directories |
US11853259B2 (en) | 2019-08-19 | 2023-12-26 | Uber Technologies, Inc. | Indexing documents in a nested hierarchy of directories |
CN113392227A (en) * | 2021-05-31 | 2021-09-14 | 交控科技股份有限公司 | Metadata knowledge map engine system facing rail transit field |
CN114817149A (en) * | 2022-03-23 | 2022-07-29 | 建信融通有限责任公司 | Metadata management system |
CN117474092A (en) * | 2023-12-21 | 2024-01-30 | 巢湖学院 | Enterprise knowledge base construction system based on AIGC |
Also Published As
Publication number | Publication date |
---|---|
CA2545232A1 (en) | 2007-01-29 |
US7885918B2 (en) | 2011-02-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7885918B2 (en) | Creating a taxonomy from business-oriented metadata content | |
US7873670B2 (en) | Method and system for managing exemplar terms database for business-oriented metadata content | |
US9727628B2 (en) | System and method of applying globally unique identifiers to relate distributed data sources | |
KR101691247B1 (en) | Semantic trading floor | |
US9305100B2 (en) | Object oriented data and metadata based search | |
US6598046B1 (en) | System and method for retrieving documents responsive to a given user's role and scenario | |
US20050149538A1 (en) | Systems and methods for creating and publishing relational data bases | |
Sinha et al. | Magnet: Supporting navigation in semistructured data environments | |
US7949674B2 (en) | Integration of documents with OLAP using search | |
Zaıane | Resource and knowledge discovery from the internet and multimedia repositories | |
WO2005119518A1 (en) | Defining a data dependency path through a body of related data | |
US20120117048A1 (en) | Information-Retrieval Systems, Methods and Software with Content Relevancy Enhancements | |
Crescenzi et al. | Crowdsourcing for data management | |
Menendez et al. | Novel node importance measures to improve keyword search over rdf graphs | |
Azad et al. | Lod search engine: A semantic search over linked data | |
CN101866340A (en) | Online retrieval and intelligent analysis method and system of product information | |
Haruechaiyasak et al. | A web-page recommender system via a data mining framework and the semantic web concept | |
CA2514165A1 (en) | Metadata content management and searching system and method | |
Stefanidis et al. | Keyword search on RDF graphs: it is more than just searching for keywords | |
Agarwal et al. | Semantic methods and tools for information portals | |
Blumberg et al. | More than search | |
Eberius et al. | Exploratory ad-hoc analytics for big data | |
Umagandhi et al. | Search Query Recommendations using Hybrid User Profile with Query Logs | |
Petkovic | Full-text search extensions for JSON documents: Design goals and implementations | |
Telang et al. | Information Integration across Heterogeneous Domains: Current Scenario, Challenges and the InfoMosaic Approach |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: COGNOS INCORPORATED, CANADA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:STATCHUK, CRAIG;REEL/FRAME:018511/0716 Effective date: 20061018 |
|
AS | Assignment |
Owner name: COGNOS ULC, CANADA Free format text: CERTIFICATE OF AMALGAMATION;ASSIGNOR:COGNOS INCORPORATED;REEL/FRAME:021387/0813 Effective date: 20080201 Owner name: IBM INTERNATIONAL GROUP BV, NETHERLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:COGNOS ULC;REEL/FRAME:021387/0837 Effective date: 20080703 Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:IBM INTERNATIONAL GROUP BV;REEL/FRAME:021398/0001 Effective date: 20080714 Owner name: COGNOS ULC,CANADA Free format text: CERTIFICATE OF AMALGAMATION;ASSIGNOR:COGNOS INCORPORATED;REEL/FRAME:021387/0813 Effective date: 20080201 Owner name: IBM INTERNATIONAL GROUP BV,NETHERLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:COGNOS ULC;REEL/FRAME:021387/0837 Effective date: 20080703 Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION,NEW YO Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:IBM INTERNATIONAL GROUP BV;REEL/FRAME:021398/0001 Effective date: 20080714 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
REMI | Maintenance fee reminder mailed | ||
FPAY | Fee payment |
Year of fee payment: 4 |
|
SULP | Surcharge for late payment | ||
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552) Year of fee payment: 8 |
|
AS | Assignment |
Owner name: DAEDALUS GROUP LLC, NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:051032/0784 Effective date: 20190930 |
|
AS | Assignment |
Owner name: DAEDALUS GROUP, LLC, NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:051710/0445 Effective date: 20191230 |
|
AS | Assignment |
Owner name: DAEDALUS BLUE LLC, NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:DAEDALUS GROUP, LLC;REEL/FRAME:051737/0191 Effective date: 20200128 |
|
AS | Assignment |
Owner name: GINEGAR LLC, NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:DAEDALUS GROUP LLC;REEL/FRAME:052874/0293 Effective date: 20200602 |
|
AS | Assignment |
Owner name: GINEGAR LLC, MARYLAND Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE THE NAME OF THE ASSIGNOR AND THE ADDRESS OF THE ASSIGNEE PREVIOUSLY RECORDED ON REEL 052874 FRAME 0293. ASSIGNOR(S) HEREBY CONFIRMS THE "WHEREAS, DAEDALUS GROUP LLC ("ASSIGNOR")...";ASSIGNOR:DAEDALUS BLUE LLC;REEL/FRAME:053644/0682 Effective date: 20200624 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 12 |
|
AS | Assignment |
Owner name: K.MIZRA LLC, FLORIDA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:GINEGAR LLC;REEL/FRAME:067706/0540 Effective date: 20240612 |