US11776533B2 - Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement - Google Patents
Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement Download PDFInfo
- Publication number
- US11776533B2 US11776533B2 US17/225,997 US202117225997A US11776533B2 US 11776533 B2 US11776533 B2 US 11776533B2 US 202117225997 A US202117225997 A US 202117225997A US 11776533 B2 US11776533 B2 US 11776533B2
- Authority
- US
- United States
- Prior art keywords
- interpret
- statement
- pattern
- block
- token
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/30—Creation or generation of source code
- G06F8/31—Programming languages or programming paradigms
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4938—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals comprising a voice browser which renders and interprets, e.g. VoiceXML
Definitions
- the technology disclosed relates to natural language understanding (NLU) and to analysis of meaning of text and spoken phrases.
- NLU natural language understanding
- the technology disclosed relates to new programming constructs and tools and processing patterns that implement those new programming constructs.
- FIG. 8 This figure is part of a simple calculator example written in the grammar definition language known as Grammatical Framework (“GF”).
- GF Grammatical Framework
- An abstract syntax 811 provides a foundation for a concrete syntax 812 , 813 , as explained in Aarne Ranta, Grammatical Framework: Programming with Multilingual Grammars, Chapter 8 (2011).
- Obscure functions including lön 814 , lin 815 , and oper 816 are part of GF's expression of parsing input.
- Many layers of special purpose programming and linguistic expertise are required of those who program vertical applications using NLU programming approaches such as GF.
- grammar-based NLU frameworks such as those by Nuance, are built around defining a fixed set of slots that represent the expected information supplied in utterances within a vertical application and determining how each phrase in the grammar results in filling those slots. See, e.g., Grammar Developer's Guide, Nuance Speech Recognition System Version 8.5, Chapter 4 (2003).
- Use of grammar slots is consistent with the W3C standard known as Voice XML. See, e.g., Scott McGlashan et al., Voice Extensible Markup Language (VoiceXML) 3.0, section 6.10 Field Module, Table 41 (8 th Working Draft December 2010). Version 2.0 of the VoiceXML recommendation has been implemented in BeVocal's Nuance Café, with grammar slots.
- a vertical application or vertical market application is software defined by requirements for a single or narrowly defined market. It contrasts with horizontal application.
- An example provided by Wikipedia of a vertical application is software that helps doctors manage patient records, insurance billing, etc. Software like this can be purchased off-the-shelf and used as-is, or the doctor can hire a consultant to modify the software to accommodate the needs of the doctor. The software is specifically designed to be used by any doctor's office, but would not be useful to other businesses.
- NLU natural language understanding
- FIG. 1 is a high-level block diagram of a method and system that can be used to implement the technology disclosed.
- FIG. 2 is a high-level sequence diagram of actions and associated components from application development through runtime.
- FIGS. 3 and 4 provide additional detail regarding the application parser number and the phrase interpreter.
- FIG. 5 is a block diagram of the programming language constructs interpret-block and interpret-statement.
- FIG. 6 is a block diagram with more detail of an implementation of the interpret-statement.
- FIG. 7 includes FIGS. 7 A- 7 E .
- FIG. 7 A shows a table with three weighted patterns.
- FIG. 7 B explains normalized weighting of pattern 123456.
- FIGS. 7 C- 7 E score three token phrases against the same pattern 123456.
- FIG. 8 is a prior art example of programming natural language understanding using the Grammatical Framework language to describe abstract and concrete descriptions of a calculator.
- FIGS. 9 A- 9 D are excerpts from the code of a NLU vertical application that understands prefix and infix requests to a calculator and performs the requested calculations; they illustrate interpret blocks and interpret statements.
- FIGS. 10 A- 10 C are excerpts from the code of a NLU application that handles dates.
- FIG. 11 is an example of action statements that sets a returned weight.
- FIG. 12 is a sample table block that represents song titles.
- FIGS. 1 - 12 A detailed description of implementations of the technology disclosed is provided with reference to the FIGS. 1 - 12 .
- speech recognition is typically applied first to produce a sequence of words or a set of hypotheses.
- this speech recognition is referred to as a combination of acoustic recognition and language, or linguistic, recognition.
- Speech recognition output is sent to the NLU system to extract the meaning.
- the weakness of such a system is that errors in the speech recognition output can cause the NLU system to misunderstand the intended meaning.
- the technology disclosed includes an integrated approach that decodes both speech and meaning concurrently. If at any time interval, a partial hypothesis is not a prefix of a parsable phrase, it can be eliminated or pruned from a set of token recognition hypotheses for the speech recognition. For those hypotheses that are parsable prefixes, the partial score of the parse can be accumulated, influencing the more likely phrases from a meaning parser's perspective to have a higher score for speech recognition. This approach increases accuracy by eliminating meaningless phrases and by promoting more likely phrases. This approach also improves latency. Response time decreases due to the fact that both steps are executed concurrently as opposed to sequentially.
- the technology disclosed applies the run time meaning parser to new programming constructs that extend a programming language, such as a general-purpose programming language, using the interpret blocks and interpret statements.
- the interpret blocks process token lists (or phrases), produce scores for pattern matches and incomplete pattern matches, and return meanings associated with completed pattern matches.
- the interpret statements identify meaningful patterns that can be constructed from one or more words in a natural language, extended pattern tables, custom statistical language models and other interpret blocks.
- Interpret blocks also link interpret statements with action statements written in the general-purpose programming language which can be invoked by a completed pattern match to perform actions programmed using the power of the general-purpose programming language.
- the first example involves recognition of street addresses.
- This example illustrates defining patterns to process a token string that ambiguously expresses a street address.
- ambiguously expressed we mean that the same token string may match multiple patterns and may have multiple meanings, as in the following example.
- the patterns that the token string may match are expressed using interpret blocks and statements, extended pattern tables and action statements.
- the goal of this example is to be able to detect and parse street addresses and validate them by performing some logic.
- the user may or may not include the word “north” in the query, and if included, may put it in the wrong place as in “first street north” instead of “north first street”.
- interpret block that includes an interpret statement with an extended pattern table that, when the matches are returned, uses an auxiliary table to look up valid street addresses for particular streets:
- n_street US_STREET_ADDRESS_TABLE( ) ⁇ as ⁇
- the optional STREET_NUMBER( ) block captures a number in the address and returns its value as a street address number.
- This block can process input tokens that are as simple as a digit sequence, which, once defined can be easily used, or it can be extended to support more complex cases such as street numbers with dashes, fractions or letters of the alphabet.
- the modularity of this block supports incremental extension from simple to more complex cases.
- the next entity is a more detailed example of an extended pattern table US_STREET_ADDRESS_TABLE( ) that can represent all the variations of all the street addresses that the developer intends to support.
- This table is sometimes called an extended pattern table in contrast to bigram and trigram sequences in conventional statistical language models.
- An extended pattern table is used to parse the meaning of a token sequence, rather than predict the likelihood that a speaker or typist would utter or enter the token sequence.
- the table returns two values: an “id”, a unique number which points to a data structure that contains or refers to information about each street, such as a list of valid street number and the full name of the street as a string.
- the street name alone is provided here as an example of multiple return values.
- Other information such as the street's city, state and geo coordinates can also be part of the data structure that the id points to.
- N first street has the direction “north” before the street name.
- “north” can come after the street name, with a weight of 1/5, which again is normalized by the interpret pattern to have a probability of 1/(1+1/5), since sum of weights need to add up to 1.
- the “street” suffix can be skipped completely, but with a weight of 1/51. If a suffix is provided, it can be the correct suffix “street” with the weight of 1/(1.1).
- a modified regular expression style syntax can used as seen with weights prefixing speech recognition results (or text inputs) as quoted strings combined with symbols such as pipe (“
- weightings are automatically normalized and so the table expression to recognize any one of a, b, or c: (10 “a”
- this table includes “N First Street”, “1st street” and “21st avenue”, which all match the query example above. Assuming no other row of the table matches that query example, then the table entry that is pointed to by “N First Street”, returns 2 rows of the table for when “one twenty” is matched to the street number, and 1 row of the table when “one” is matched to the street number.
- the action statements in the programming language inside the interpret block can then evaluate various conditions, including the weight of the matched row, the range of street numbers in that row, the current location of the user and the distance from that location to the location of the matched row, to pick at least one result. See below discussing FIG. 11 , where action statements that adjust the weighting based on the distance between the user the matched location are discussed.
- n_number which is set to a street number, if one is present
- n_street which points to an entry in an extended pattern table of US streets.
- n_street is the head of a linked list of specific streets, each with a full street name and other properties.
- the code below determines the street with the best weight among the candidate streets. When performing this comparison, it takes the street number into account by calling an auxiliary function to check if a particular number is valid on a particular street.
- the program above can be extended to include additional logic such as the user location (see FIG. 11 discussion), skipping the city and choosing the right target based on population and location, and so on.
- the matching pattern can be made to return a result with fewer token matches by making the. “san jose”. segment optional, using square brackets as in: . [“san jose”].
- the table analysis will not return a completed phrase recognition if the token list omits the city name.
- FIG. 1 illustrates a block diagram of an example environment 100 in which the meaning parser and new programming constructs can be used for custom natural language understanding
- the environment 100 includes at least one client computing device 155 , 156 that includes a processor and at least one application running on the processor 157 .
- the environment also includes a communications network 135 that allows for communication between various components of the environment.
- developers prepare and submit application code 140 .
- Application code 140 can be stored for later processing or submitted directly to the application parser 130 .
- the application code may reference custom statistical language models 110 (SLMs) and extended pattern tables 115 . These SLMs and tables may be submitted with the application code 140 or may previously have been submitted by or made available to developers.
- SLMs statistical language models 110
- the network 135 includes the Internet.
- the network 135 also can utilize dedicated or private communication links that are not necessarily part of the Internet.
- the network 135 uses standard communication technologies, protocols, and/or inter-process communication technologies.
- the client computing devices 155 , the application parser engine 130 and the phrase interpreter engine 117 each include memory for storage of data and software applications, a processor for accessing data in executing applications, and components that facilitate communication over the network 135 .
- the computing devices 155 execute applications, such as web browsers (e.g., a web browser application 157 executing on the computing device 156 ), to allow developers to prepare and submit application code 140 and allow users to submit phrases to be interpreted by the phrase interpreter engine 117 .
- the computing devices 155 , 156 may be for example a workstation, desktop computer, laptop, a tablet computer, mobile phone, or any other type of computing device.
- the application parser engine 130 receives applications and parses them, producing a parse tree or an event stream. It produces application data structures 120 from the parsed application code 140 .
- An application data structure 120 may represent a single application 140 . Alternatively, multiple applications 140 may be used to prepare a single application data structure 120 .
- the application data structure can, for instance, be a tree, a state machine, or a network of valid tokens.
- the application data structure can be compiled or in an interpretable structure.
- the application data structure 120 can include nodes that reference the custom SLMs 110 and the extended pattern tables 115 . This data may be stored collectively or in copies on multiple computers and/or storage devices.
- the acoustic-language recognizer 128 can be a conventional acoustic or speech recognition component that outputs tokens. It can operate in one or more stages. In this application, an acoustic-language recognizer or processor 128 can potentially only include a single stage, acoustic recognition-only processing without application of separate linguistic analysis.
- the technology disclosed can be applied to coupling preliminary processing to meaning-specific patterns, even when the tokens from preliminary processing are phonemes or other tokens that are not full words. For instance, an acoustic stage can process input sound samples to produce phonemes. These phonemes can be passed to a language or linguistic stage that considers and scores sequences of phonemes. Language recognizers sometimes use diphone or triphone analysis to recognize likely sequences of phonemes. Language recognizers sometimes use statistical language models to recognize statistically likely sequences of words.
- Phrase interpreter engine 117 includes an acoustic-language recognizer 128 and the meaning parser 118 .
- the phrase interpreter engine 117 like the application parser engine 130 , is implemented using at least one hardware component.
- the engines are implemented in hardware, firmware, or software running on hardware.
- Software that is combined with hardware to carry out the actions of a phrase interpreter engine 117 can be stored on computer readable media such a rotating or non-rotating memory.
- the non-rotating memory can be volatile or non-volatile.
- computer readable media does not include a transitory electromagnetic signal that is not stored in a memory; computer readable media stores program instructions for execution.
- FIG. 2 in overview, includes an editor 210 used to generate at least one electronic record 211 that includes code with the programming constructs disclosed.
- the electronic record 211 is transmitted to a parser 221 , which may build a parse tree or emit a series of events.
- the parser output is used by an interpreter or compiler 231 to create executable pseudocode or object code.
- the runtime system 241 uses the pseudocode or object code to recognize natural language.
- FIG. 2 is a high-level sequence diagram of actions and associated components from application development through runtime.
- the components illustrated in FIG. 2 operate on computing devices that include a processor and memory coupled to the processor. While components are indicated by blocks, systems that implement the technology disclosed may include subdividing the indicated blocks into more components or combining multiple components into fewer blocks.
- computer-aided software engineering tools such as an integrated development environment (IDE)
- IDE integrated development environment
- Smart editors check syntax using edit time parsing to recognize keywords and the structures implied.
- An integrated development environment also may invoke an interpreter or compiler.
- Some IDEs also include runtime support for debugging a program. Debugging tools recognize the programming constructs disclosed. They allow a programmer to set breakpoints and monitor program execution. Accordingly, there is a range of environments that can implement the technology disclosed, from authoring through launched applications.
- a smart program editor 210 can recognize the programming constructs and keywords corresponding to the technology disclosed. It can color code or otherwise highlight the keywords. It may check the syntax of the programming constructs disclosed as a programmer types. It can create stub code for structures to be completed by the programmer. While smart editors have been used for many years, the programming constructs disclosed herein are new to natural language understanding programming, and offer new opportunities for developer support.
- Regular editors also can be used to author electronic records including program code.
- a regular editor When a regular editor is used, a pretty printer that recognizes the disclosed programming constructs can be used to format the code to make it more readable. Code formatting is often supported in CASE tools, IDEs and smart editors. Still, there are standalone code pretty printing products, not shown in FIG. 2 , that can apply the technology disclosed.
- a parser 221 receives one of more records 211 and converts them to machine recognized format.
- One machine recognized format is a parse tree.
- Another machine recognized format is a stream of events.
- An interpreter or compiler 231 uses the output of the parser.
- the interpreter typically uses the parser output directly in combination with a runtime 241 to execute or debug a program.
- An interpreter may persist an intermediate format for execution, debugging or optimized compilation.
- the intermediate format may be programming language independent.
- a compiler typically uses the output of the parser to compile object code or pseudocode.
- Object code can be optimized for particular platform.
- Pseudocode can be machine independent and can be run on a virtual machine, instead of directly on a physical machine.
- a preprocessor also may use output from the parser to expand the disclosed programming constructs into code in the programming language before its interpretation or compilation.
- a preprocessor may be integrated with a parser.
- a variety of devices 235 are indicated which may be targets for NLU development. These devices may include handheld devices, such as smart phones and tablets, mobile devices such as laptops and workstations or PC.
- NLU components can be deployed to servers 237 coupled in communication with other devices 235 .
- FIGS. 3 and 4 provide additional detail regarding the application parser number 130 and the phrase interpreter 117 .
- reference numbers are reused, such as reference 120 for the application data structure, they refer to the same component as previously described.
- the application code 140 , application parser 130 , application data structure 120 , extended pattern tables 115 , and custom statistical language models 110 are the same components as previously described.
- the application parser 130 parses application code 140 . In some implementations, it recognizes interpret blocks and interpret statements, as explained in the context of the street address example above and in the context of FIG. 9 below. Other implementations may perform these actions in different orders and/or perform different or additional actions than illustrated in FIG. 3 .
- the application parser recognizes patterns 331 in interpret statements and extended pattern tables.
- the extended pattern tables 115 may be stored separately from the application code 140 and reused in a variety of applications.
- the application parser 130 handles integration of tables 333 referred to in application code 140 with the data tables themselves, which may be stored separately 115 .
- the application parser 130 handles integration of custom statistical language models 335 referred to in the application code 140 with custom SLMs, which may be stored separately 110 .
- the custom SLMs 110 may be stored separately from the application code 140 and reused in a variety of applications.
- the application parser 130 Upon parsing the application code 140 and integrating it with the tables 115 and the custom SLMs 110 , the application parser 130 produces one or more application data structures 120 .
- the phrase interpreter 117 includes an acoustic-language recognizer 128 and meaning parser 118 , both of which are used to interpret spoken phrases.
- a meaning parser 118 can operate without an acoustic-language recognizer 128 .
- the meaning parser 118 is useful by itself and can operate in text-based environments that do not receive spoken input.
- the acoustic language recognizer 128 When processing spoken input, the acoustic language recognizer 128 produces token phrases and alternative token recognition hypotheses 437 . At each time interval during interpretation of spoken input, hundreds or even thousands of token recognition hypotheses 437 can be generated. To expand on this point, an interval such as every 10 or 100 milliseconds could be selected to test and generate token recognition hypotheses. At each interval, a thousand, three thousand, or an even larger number of hypotheses could be generated. In some implementations, enumeration of hypotheses could exhaust all combinatorial possibilities for hypotheses.
- the conventional acoustic-language recognizer scores the alternative token recognition hypotheses that it produces and selects a set of token recognition hypotheses for further processing.
- An acoustic recognizer applies an acoustic recognition score to alternative token sequences, selects the set of token sequences and sends them to a language or linguistic recognizer.
- the language recognizer applies a language model, such as a statistical language model, and scores the token sequence hypotheses. It returns the scores to the acoustic recognition stage, which combines the acoustic and language recognition scores.
- the acoustic-language recognizer 128 sends token recognition hypotheses 437 to the meaning parser 118 as token phrases, sequences, or lists 438 .
- the meaning parser 118 processes the tokens and returns values 438 .
- Complete and incomplete parses of tokens can be scored by the meaning parser 118 to return meaning recognition scores. Unrecognizable token phrases can be flagged as such in the returned values.
- Completed parses of token phrases that satisfy a recognized pattern can further return data and/or references to data structures that express the meaning of the token phrase.
- a token phrase being processed can be all or part of the token recognition hypothesis 437 .
- meaning parser 118 includes a token processor 455 , table handler 465 , an SLM handler 475 , and a scorer 485 . Some implementations may have different and/or additional modules than those shown in FIG. 4 . Moreover, the functionalities can be distributed among the modules in a different manner than described or illustrated.
- the token processor 455 receives tokens 438 in the hypotheses 437 . It processes these tokens against the application data structure 120 . As tables and statistical language models are encountered or invoked, the table handler 465 and SLM handler 475 are invoked.
- the table handler 465 handles extended patterns expressed as rows in the tables 115 . Additional details of these patterns and the processing of rows are described below in the context of FIG. 7 .
- the SLM handler 475 handles custom statistical language models 110 .
- Mixing the indication of custom SLMs into an extended pattern, whether in an interpret statement for the role of the table creates a context for invoking the SLM.
- This context favors custom SLMs over general SLMs. For instance, the subject line of an email will use different language constructs and different phrases in the body of an email. Accordingly, different custom SLMs would be used in patterns for subject lines and message text.
- a scorer 485 accumulates and normalizes a meaning recognition score during processing of a token phrase.
- the scorer 485 can generate scores for both partial and completed pattern recognition. Scoring of token sequences against patterns is the subject of FIGS. 7 B- 7 D , below.
- Meaning parser 118 further executes action statements contained within interpret statements. These action statements are discussed above in the context of the introductory example and below in the context of FIG. 9 .
- FIG. 5 is a block diagram of the programming language constructs interpret-block 511 and interpret-statement 515 .
- a program using these constructs is stored in one or more electronic records.
- the program includes one or more interpret-blocks 511 , 531 .
- the blocks include one or more variables 521 , 541 to be returned from the block and one or more interpret-statements 515 , 525 , 535 , 545 .
- Variables returned during execution of a block can be accessed by containing blocks.
- the variables are declared public, the values returned also can be accessed by subsequently invoked blocks that are not containing blocks.
- FIG. 6 is a block diagram with more detail of an implementation of the interpret-statement 615 .
- the interpret-statement includes a pattern 615 and an action 625 triggered by matching the pattern.
- the pattern is the modified regular expression of words in a target natural language and additional interpret-blocks, as previously described.
- the words in the natural language are terminal symbols and the additional interpret-blocks are non-terminal symbols. While a regular expression is a convenient and well-understood pattern formulation, other patterns also can be used. Patterns in the interpret-statements are used to match text or utterances. Multiple patterns in interpret-statements can match parts of a single input text or utterance.
- a parser may flag and regroup words or word patterns that will match multiple interpret-statements, and take advantage of this to optimize the application data structure 120 .
- the action statements include programming instructions in the extended programming language, such as a general-purpose programming language. These action statements include assigning values to the variables of the block-statement, which represent understanding of parts of the input text or utterance.
- the action statements may also modify the weight (score) of the parse, and eliminate parses using the special excludethis( ) statement.
- the excludethis( ) statement is a special statement that effectively sets the weight of a parse to 0. Since weights in the example are accumulated through multiplication, a weight of 0 should remove a partial parse from the list.
- FIG. 11 An example of weight modification and excludethis( ) is provided in FIG. 11 .
- the user can ask for nearby location, which is defined in another block called LOCATION( ). If the physical location of the returned location is not within 100 miles of the user's current location, the action statement calls the excludethis( ) statement which eliminates this parse.
- LOCATION( ) the excludethis( ) statement which eliminates this parse.
- the same approach could be extended to the location set by the user for the origin of the search. Otherwise the weight of the parse is adjusted by the value of the distance of the user from the location. This effectively gives preference to locations that are closer to the user.
- other constructs may be provided to provide similar exclusion functionality.
- FIG. 7 includes FIGS. 7 A- 7 E . It begins with a table showing three weighted patterns in the table and corresponding return references, or ids, 123456, 123457, 123458 and street names. While these extended patterns are illustrated as part of the table, similar patterns can be used to define the pattern of an interpret statement with the added capability to reference other interpret blocks, tables, and SLMs.
- FIG. 7 B explains normalized weighting of pattern 123456.
- FIGS. 7 C- 7 E score three token phrases against the same pattern 123456.
- the pattern for 123456 is expanded across three rows 723 , 725 , 727 to better show the nesting and weighting. Above each row are digits indicating a depth of nesting. These digits are above parentheses and square brackets. Immediately below each row is a summary of weighting juxtaposed with the rest of the pattern, e.g. “next row” and “above” designations. In an integrated development environment, nesting pairs might be indicated by matching colors of parentheses or brackets. Below each row are indications of weights assigned to segments of the pattern.
- Weights are indicated in two ways. Inside curly braces, alternatives are separated by the symbol
- a second type of weight is indicated by fractions without curly brackets, tracking optional pattern elements that appear in square brackets. For instance, in line 723 , the pattern element “north” is indicated as optional within square brackets.
- the weight 10 precedes the token. This indicates that it is 10 times as likely that the token “north” will precede “first” in a reference that means “N. 1st Avenue”, as it is likely to be omitted. When the token “north” appears in the token phrase, this term in the pattern is given a weight of 10/11. When the token is omitted, this term in the pattern is given a weight of 1/11.
- FIGS. 7 C- 7 E are scoring examples.
- the three rows in pattern 123456 are reproduced as rows 733 , 735 and 737 of FIG. 7 C ; rows 743 , 745 and 747 of FIG. 7 D ; and rows 753 , 755 and 757 of FIG. 7 E .
- Nesting levels appear above parentheses and brackets in the pattern. Resulting weights appear in curly braces below the pattern.
- the first half of the disjunction between lines 733 and 735 is matched. Accordingly, weights are assigned to line 733 and not to line 735 . As between the two alternatives, the total weight available is 5/6 for line 733 and 1/6 for line 735 . In curly braces, this is expressed as 1/(1+1/5). In the scored token string “north first street san jose”, the token “north” appears, so weight of 10/11 is assigned. The token “first” is mandatory for a pattern match and it appears with an implicit weight of one. One of the tokens “street”, “avenue”, or “road” appears, the first of which is 50 times as likely as an alternative, so a weight of 50/51 is assigned.
- the token string does not include the optional word “in”, which is weighted as unlikely to be used.
- the omission of this token effectively has a weight of 10/11.
- the token string also omits the state, which is equally likely to appear be omitted, so a weight of 1/2 is applied to the omission.
- the partial scores illustrated in this figure can be combined by multiplying them together.
- Each of the partial scores is between zero and one.
- the product of the scores also will be between zero and one.
- an average of the scores could be calculated, which also would be between zero and one.
- multiplicative and additive approaches can be used, for instance, multiplying together weights in a pattern segment (between two periods) and taking the average of weights across the pattern segments.
- a weight of 1/(1+1/10) is assigned.
- the token string does not include the optional word “in”, which is weighted as unlikely to be used.
- the omission of this token effectively has a weight of 10/11.
- the token string also omits the state, which is equally likely to appear be omitted, so a weight of 1/2 is applied to the omission.
- FIG. 7 E is similar to FIG. 7 C , but omitting “north” from the token list.
- the first half of the disjunction between lines 733 and 735 is matched because “north” is mandatory in the second line 735 .
- weights are assigned to line 733 and not to line 735 .
- the total weight available is 5/6 for line 733 and 1/6 for line 735 . In curly braces, this is expressed as 1/(1+1/5).
- the token “north” is omitted, so weight of 1/11 is assigned.
- the token “first” is mandatory for a pattern match and it appears, with an implicit weight of one.
- One of the tokens “street”, “avenue”, or “road” appears, the first of which is 50 times as likely as an alternative, so a weight of 50/51 is assigned. Since “street” is being matched in this case instead of the alternatives “avenue” or “road”, a weight of 1/(1+1/10) is assigned.
- the token string does not include the optional word “in”, with a weight of 10/11.
- the token string also omits the state, with a weight of 1/2 is applied to the omission.
- a sequence of input tokens can have more than one parse (or partial parse), with each parse having its own weight.
- the weights for all of the partial parses can be combined to represent the total weight (or probability) of a sequence of tokens.
- the weight of a sequence of tokens is useful and can be used, for example, as the score of the sequence of tokens in a speech recognition engine where multiple sequences of tokens are being considered.
- the total weight is the sum of all the weights, which makes statistical sense as weights are considered to be probabilities (the probability of a sequence of tokens is the sum of the probabilities of all its possible parses).
- the maximum weight can be used.
- the average of the weights can be calculated.
- FIG. 12 is a portion of a table block that represents song titles. This table may have thousands or even millions of rows but only 3 rows are shown in this example.
- the weight for each record can represent a measure of popularity, and if normalized to add up to one, can be considered the probability of the record.
- the system is capable of automatically normalizing weights to 1, as explained in the example of FIG. 7 , which saves a developer the burden of verifying correct sums of weights. If the user asks for “I just called to say I love you by stevie wonder”, then the popularity measure of 0.01 is used (in addition to the appropriate adjustment of 1/2 to skip the optional “by”). However, if the user asks for “I just called to say I love you”, then there are multiple matches in the table, and the weights can be added because the probability of the user asking for this song title should be the sum of the probabilities of all songs with this title.
- an interpret block is used with a table and two custom statistical language models to compose an email message.
- the table is a simple table of contacts.
- the SLMs are for subject and message.
- CONTACT( ) is a table pattern representing a list of valid contacts in a database that the user can send emails to.
- SUBJECT( ) represents a statistical language model
- MESSAGE also represents a statistical language model, and the developer can choose to use a different language model for each. This makes sense as the statistical properties of an email subject could be different from that of the body of an email.
- the system Given the query, “email ernie subject lunch meeting message i am running late”, the system matches “ernie” as the recipient (assuming Ernie is in the CONTACT( ) database), the subject becomes “lunch meeting” and the body of the email becomes “i am running late”. The system scores the subject line using the statistical language model represented by SUBJECT( ), and the body of the email using the statistical language model represented by MESSAGE( ). The system can do that because as it is parsing the query, it knows the state of its various parses. Now assume the following query:
- the code in FIG. 9 is an example of one implementation of the technology disclosed. This example applies natural language understanding to requests to a calculator to perform calculations.
- This code written as an extension of C++, is unmistakably more elegant than the prior art FIG. 8 from GF.
- the prior art GF code is written with multiple levels of abstraction in a special purpose programming language. Significant specialized expertise is required to even read the prior art code in FIG. 8 .
- one of ordinary skill in the art could well read the code in FIG. 9 and understand it with a general orientation, instead of a manual that is dozens or hundreds of pages long.
- the technology disclosed can be used to extend a programming language by the addition of a small number of constructs.
- the technology can serve as a core around which a programming language is designed.
- the technology disclosed is used to extend a mainstream, well-understood programming language such as C++.
- it may extend other programming languages such as Objective C, C#, Javascript, Java, Python, Ruby, or Perl. Extending a general purpose programming language takes advantage of an existing large pool of programmers who already are familiar with the language.
- the technology disclosed also could be incorporated into a special purpose programming language with benefits to those who understand or choose to learn the special purpose language.
- Interpret-statements such as 913 , 914 , 928 include both a pattern and an action to be performed (or action statements) when the pattern matches an input text or utterance.
- the patterns in this example are modified regular expressions of terminal and non-terminal symbols, with optional weights. For instance, in interpret-statement 928 of FIG. 5 b , the natural language word to match is “minus”, which is called a terminal symbol by those working in NLU.
- the actions triggered by various pattern matches are expressed in a general purpose programming language.
- the patterns can include non-terminals that invoke other blocks, pattern tables, and SLMs.
- FIG. 9 A-D are four pages excerpted from a longer working NLU vertical application for a calculator that can be expressed in nine pages of clear and readable code.
- the excerpts from this nine-page program extend a general purpose programming language with just two programming constructs and use the general purpose language to implement actions triggered by matched patterns.
- FIG. 10 the code samples provided apply the programming constructs disclosed.
- FIGS. 10 A- 10 C are four pages excerpted from code from a longer working NLU application that handles dates. Interpretation of natural language that expresses a date is more challenging than implementing a calculator, as described below, because dates can be expressed in so many ways.
- recurring dates are excluded 1013 ; today, tomorrow and the next day are handled 1014 , 1015 , 1016 ; a day of week and date of month are combined 1021 ; a day of week and ordinal number are combined 1026 ; a part of day, such as morning, is combined with an ordinal number 1031 ; a legal document date style is handled 1033 ; and a day of week is combined with “this” or “next” week 1036 .
- interpret blocks 1011 can cause several interpret blocks (.ter files) to be included in the code before the DATE( ) interpret block 1012 .
- Several interpret-statements 1014 , 1015 , 1016 , 1021 , 1026 , 1031 , 1033 , 1036 appear in block 1012 .
- the exclude-statement 1013 stops execution of the block without processing subsequent interpret-statements and without assigning values to the block variables 1012 , when the input includes phrases that indicate recurring dates, as this block does not handle recurring dates.
- exclude statements are a type of interpret statement.
- Interpret-statements 1014 , 1015 , 1016 handle the words “today”, “this”, “tomorrow” and “day after tomorrow”.
- Optional words that do not change the selected date include approximate times of day, such as “morning”, “evening”, “afternoon” etc. Some of these optional words may trigger assignment of a positive value to the variable “pm_hint”.
- Parts of the pattern in interpret-statement 1014 for instance, cause values to be assigned to n1 or n2. If values have been assigned to n1 or n2, the last line in 1014 resets pm_hint from 0 to 1.
- several qualitative block variables 1012 such as month_index_implicit, year_index_implicit, week_delay and pm_hint are given values.
- Interpret-statements 1021 , 1026 , 1031 , 1033 and 1036 include weights 1022 , 1027 , 1032 , 1034 , 1037 assigned when optional approximate times of day are part of the input. Four of the five interpret-statements use the same weight. The fifth 1033 , assigns a much smaller weight 1034 . These weights can be used by a runtime system to select among multiple interpret statements that may be triggered by a particular input pattern. In some implementations, the parser automatically normalizes weights returned by interpret-statements and parts of interpret-statements. Normalizing weights to total one another chosen value simplifies human understanding of output and debugging output when the example code runs. Normalizing weights to sum to one effectively produces a Bayesian probability distribution that can be used at runtime.
- the DATE( ) example in FIG. 10 illustrates the power of extending a general purpose programming language through the juxtaposition of patterns with actions.
- the general purpose programming language is used in this example and others to convey complex logic that could be difficult to express in a special purpose programming language.
- the immediate juxtaposition of patterns and actions in the interpret-statements makes it easy to see how language is being understood and what patterns are being matched. This contrasts with other approaches that rely on separate abstract syntax and concrete syntax or that express how to proceed depending on “slots” that have been filled.
- the action statements can inter-relate complex computations including mismatches across “slots”, e.g. invalid addresses.
- the main programming constructs in FIG. 9 are enumerations, interpret-blocks and interpret-statements.
- Enumerations 911 are programmed in the underlying general-purpose programming language, in this case in C++.
- a number of interpret-blocks are identified 912 , 917 , 921 , 926 , 941 . Many others are unlabeled.
- the interpret-block ARITH_QUERY( ) 912 for instance, includes three variable arguments response, *formula and valueStr. It also includes two interpret statements 913 , 914 .
- the interpret-block is distinguished by its name; interpret-statements are distinguished by their patterns. In other implementations, the interpret-statements also could be named.
- Interpret-statements 913 , 914 in ARITH_QUERY( ) 912 both have patterns that are non-terminal symbols, invoking other named interpret-blocks.
- Interpret statement 913 has slightly more complicated pattern, which concatenates results from two named interpret-blocks.
- the interpret-block ARITH_WHAT( ) has no variables and therefore does not return any values. This is because the sole interpret-statement 916 in the block is triggered by literals such as “tell me” and “what is”, which contribute to recognizing a natural language query, but which do not contribute to performing the desired calculation. From this example, one sees that interpret-blocks and interpret-statements can, in some cases, be configured as trivial filters that ignore parts of the input.
- the interpret-blocks ARITH_INFIX_TAIL( ) 926 and ARITH_INFIX( ) 521 combine to interpret operators in the input and execute the requested operations.
- One block includes multiple interpret statements 927 , 928 , 929 that each address a different operator, such as “plus” 927 or “minus” 928 . Enumerated operators are assigned to the variable “op”, which is one of the variables returned by ARITH_INFIX_TAIL( ).
- the other block ARITH_INFIX( ) 921 uses the value and formula returned by invoking ARITH_INFIX_TAIL( ) 926 as part of pattern in the single interpret-statement 922 contained in the block.
- This interpret statement includes a series of “if” and “else if” statements with the same effect as a case statement, testing the value of the variable “op” returned by ARITH_INFIX_TAIL( ) p26. For a “minus” operator 928 in the input stream, one of the else if clauses 923 matches the value of “op” and the value returned by ARITH_INFIX_TAIL( ) 926 is subtracted from the value returned by ARITH_PREFIX( ) 932 .
- FIG. 9 D Another example in FIG. 9 D is the interpret-block ARITH_PREFIX_UNARY( ) 941 , which handles unary operators.
- the first two interpret-statements 942 , 943 take into account parentheses around unitary operators.
- Additional interpret-statements 944 , 945 interpret various operators. Some of these interpret statements, which operate on patterns that begin with “the” could be reformulated to make “the” an optional part of a regular expression.
- FIG. 9 illustrates elegance of the interpret-block and interpret-statement programming constructs, implanted as an eminently readable extension of a general-purpose programming language.
- interpret-statements can be invoked from a single interpret block to combine a variety of applications with a common entry point. An example follows:
- each of the above commands could have been independently developed by different individuals and entities and tied together by the common entry point by yet another developer.
- each domain or vertical can be created or maintained by experts in that domain. For example, weather service providers can wok on the weather vertical, while navigation is created and maintained by other experts and so on.
- a method includes an automated method of accurately recognizing a speaker's meaning.
- This method includes producing and scoring partial transcriptions of a spoken phrase at intervals as the phrase is spoken using at least one acoustic-language recognition stage and a meaning parser stage.
- the acoustic-language stage generates a multiplicity of token recognition hypotheses that build on prior partial transcriptions and selects a set of the token recognition hypotheses using at least prior scoring of the prior partial transcriptions at earlier intervals and current acoustic-language scoring of the token recognition hypotheses at a present interval.
- the meaning parser stage concurrently processes particular token recognition hypotheses in the set of the token recognition hypotheses; determines whether a particular token recognition hypothesis has a parsable meaning; rejects unparsable hypotheses; and scores and returns at least one parsable meaning score for a particular token recognition hypothesis that has a parsable meaning.
- the acoustic-language recognition stage further stores for use at subsequent intervals combined scores of the token recognition hypotheses for current partial transcriptions, the combined scores incorporating at least the acoustic-language recognition scores and the parsable meaning scores.
- the acoustic-language recognition stage can prune the prior partial transcriptions used to build the token recognition hypotheses using the meaning parser stage rejections of the unparsable hypotheses.
- the method includes selecting at least one completed transcription of the spoken phrase that has been scored as recognizable by the acoustic-language recognition stage and as parsable by the meaning parser stage.
- the meaning parser stage can further take the actions of processing the token recognition hypothesis that includes at least one ambiguously expressed element and at least one dependent element that correlates with the ambiguously expressed element against an interpretation pattern and of processing and scoring the ambiguously expressed element against multiple rows of an extended pattern table that is invoked while processing the interpretation pattern, at least some of the rows declaratively expressing weighted alternative expressions of a particular ambiguously expressed element.
- Handling ambiguously expressed elements further include applying logic expressed in a general programming language to process and rescore at least some of the scored rows from the extended pattern table using return values from the scored rows in combination with against at least information from a group consisting of (1) the dependent element from the token recognition hypothesis, (2) an optional element in the token recognition hypothesis, and (3) supplemental information not included in the token recognition hypothesis.
- One implementation can further include applying logic expressed in a general programming language to process and rescore at least some of the scored rows from the extended pattern table, including comparing valid dependent values in an auxiliary table with the dependent element from the token recognition hypothesis.
- One implementation can further include applying logic expressed in a general programming language to process supplemental information not included in the token recognition hypothesis and rescore at least some of the scored rows.
- One implementation can further include applying logic expressed in a general programming language to process and rescore at least some of the scored rows from the extended pattern table against optional elements in the token recognition hypothesis.
- One implementation can further include the meaning parser stage scoring the token recognition hypotheses against an interpretation pattern that includes at least one predicate condition and at least one statistical language model applied when the predicate condition is satisfied.
- a particular token recognition hypothesis includes an ambiguously expressed element and a dependent element that correlates with the ambiguously expressed element; the meaning parser stage further takes the actions of scoring the token recognition hypothesis against a plurality of interpretation patterns built from a model pattern, the model pattern implementing at least: (1) a table pattern that includes rows in an extended pattern table, at least some of the rows declaratively expressing weighted alternative expressions of a particular ambiguously expressed element; and (2) a statistical pattern that includes a predicate condition and a custom statistical language model applied when the predicate condition is satisfied.
- applying the table pattern includes scoring the token recognition hypothesis against multiple rows of the extended pattern table.
- applying the statistical pattern includes scoring the token recognition hypothesis against the custom statistical language model.
- Processing results of scoring the tech in recognition hypothesis against rows of the table can further include applying logic expressed in a general programming language to process and rescore at least some scored rows from the extended pattern table using at least information from a group consisting of: (1) the dependent element from the token recognition hypothesis; (2) an optional element in the token recognition hypothesis; and (3) supplemental information not included in the token recognition hypothesis.
- This processing also can include applying the logic expressed in the general programming language to process and rescore at least some of the scored rows from the extended pattern table against valid dependent values in an auxiliary table and the dependent element from the token recognition hypothesis.
- It can include applying the logic expressed in the general programming language to process and rescore at least some of the scored rows from the dependent value table against the optional element in the token recognition hypothesis.
- It can include applying the logic expressed in the general programming language to process the supplemental information not included in the token recognition hypothesis and rescore at least some of the scored rows.
- implementations may include a non-transitory computer readable storage medium storing instructions executable by a processor to perform a method as described above.
- implementations include a system including memory and one or more processors operable to execute instructions, stored in memory, to perform a method as described above.
- a method that, in some environments accurately recognizes an intended meaning of a complete phrase.
- This method includes invoking an interpretation pattern that expresses a complete phrase with a meaning as a sequence of elements that include one or more words from a natural language and a table element that invokes an extended pattern table and receiving a text string of tokens that express an intended meaning, wherein the token include a combination of at least one ambiguously expressed element and a dependent element that correlates with the ambiguously expressed element, and further including one or more supplemental elements.
- the method can further include applying logic expressed in a general programming language to process and rescore at least some of the scored rows from the extended pattern table against at least information from a group consisting of (1) a dependent element from the text string, (2) an optional element in the text string, and (3) supplemental information not included in the text string.
- the method further includes applying logic expressed in a general programming language to process and rescore at least some of the scored rows from the extended pattern table against valid dependent values in a dependent value table and a dependent element from the text string.
- the method can further include applying logic expressed in a general programming language to process supplemental information not included in the text string and rescore at least some of the scored rows.
- That it can further include applying logic expressed in a general programming language to process and rescore at least some of the scored rows from the dependent value table against optional elements in the text string; and selecting at least one intended meaning using at least the rescored rows of the extended pattern table.
- implementations of this an additional methods described below also may include non-transitory computer readable storage medium storing instructions executable by a processor to perform a method is described.
- implementations may include the system including memory and one or more processors operable to execute instructions, stored in the memory, to perform a method is described.
- the proviso in this paragraph is hereby applied the to the implementations in this section.
- an automated method of building a natural language understanding application includes receiving at least one electronic record containing programming code that interprets sequence of input tokens by extending a general purpose programming language with interpret-block and interpret-statement data structures.
- the interpret-block data structures include at least one of the interpret-statements and zero or more variables returned by the interpret-block.
- the interpret-statements include a pattern of one or more tokens, and zero or more action instructions.
- the action instructions perform logic not achieved by pattern matching and/or assign values to the variables of the interpret-block.
- the method further includes parsing the received program code to produce an executable representation of the interpret-block and interpret-statement data structures.
- This method mother implementations of the technology disclosed can each optionally include one or more the following features.
- At least one token of the interpret expression can be another interpret-block.
- the returned parameters from other interpret-blocks are made available to the action statements inside the interpret block.
- At least one token of the interpret expression can be a statistical language model. It also can be a wildcard. It can be a table of token expressions with fixed returned values for each row of the table and without any action statements. At least one sub-expression of the interpret expression is allowed to have repetitions. At least one of a minimum and a maximum number of repetitions of a sub-expression can be specified.
- the outgoing weights at each token can be normalized to add up to 1. The normalization of outgoing weights can be performed at sub nodes instead of tokens to reflect the way the expression is modularized.
- this method also can be practiced in a non-transitory computer readable storage medium or by a system.
- a method in another implementation, includes scoring a partial transcription of input. Practicing this method includes instantiating in memory at least one data structure derived from programming code that interprets token list using a general purpose programming language extended with interpret-block and interpret-statement data structures.
- the interpret-block data structures include at least one of the interpret-statements and one or more values returned by the interpret-block.
- the interpret-statements include patterns that are built from words in a target natural language, from at least one extended pattern table, and from references to additional interpret-blocks and action instructions in the general purpose programming language that are triggered by a match between parts of an input text and the patterns.
- the extended pattern table matches and scores at least part of the token list against multiple rows of in the extended pattern table, at least some of the rows declaratively expressing weighted alternative expressions of ambiguously expressed elements.
- the action instructions assign values to the variables of the interpret-block, which values attribute meaning to the token list.
- the method further includes receiving the token list and processing and scoring the token list against the data structure including scoring at least part of the token list against multiple rows of the extended pattern table, at least some of the rows declaratively expressing weighted alternative expressions of a particular ambiguously expressed element.
- the action instructions can further include logic expressed in the general programming language to process and rescore at least some of the scored rows from the extended pattern table using at least information from a group consisting of: (1) a dependent element in the token list that has meaning in a context set by an ambiguously expressed element in the token list; (2) an optional element in the token list; and (3) supplemental information receive in addition to the token list.
- the action instructions further can include logic expressed in a general programming language to process and rescore at least some of the scored rows from the extended pattern table comparing valid dependent values in an auxiliary value table with the dependent element.
- the action instructions further can include logic expressed in a general programming language to process and rescore at least some of the scored rows from the extended pattern table using the optional element.
- the action instructions further include logic expressed in a general programming language to process the supplemental information not included in the token list and rescore at least some of the scored rows.
- this method also can be practiced in a non-transitory computer readable storage medium or by a system.
- method in another implementation, includes building a natural language understanding (abbreviated NLU) data structure.
- This method includes receiving at least one electronic record containing programming code that interprets an input text by extending a general purpose programming language with interpret-block and interpret-statement data structures.
- the interpret-block data structures include at least one of the interpret-statements and one or more variables returned by the interpret-block.
- the interpret-statements include patterns that are built from words in a target natural language, from at least one extended pattern table and from references to additional interpret-blocks and action instructions in the general purpose programming language that are triggered by a match between parts of an input text and the patterns.
- the extended pattern table matches and scores at least part of the input text against multiple rows of in the extended pattern table, at least some of the rows declaratively expressing weighted alternative expressions of ambiguously expressed elements.
- the action instructions assign values to the variables of the interpret-block, which values attribute meaning to the text.
- the method further includes parsing the received program code to produce a data structure representing the interpret-block and interpret-statement data structures.
- the pattern specified in the interpret-statement data structure can include a regular expression of the words and the additional interpret-blocks.
- the extended pattern table can be invoked by an antecedent event selected from a group at least consisting of: a match between part of the word hypothesis and at least one word in the natural language that is part of the pattern preceding the extended pattern table; and positioning of the extended pattern table as a first element of the pattern.
- the general purpose programming language can belong to a “C” programming language family.
- the set of the interpret-blocks collectively can define a vertical application of NLU.
- Values assigned to the variables of a particular interpret-block can be available to additional interpret-blocks and to a NLU processor at run time.
- Patterns in the interpret-statements in the set of interpret-blocks collectively can match substantially all of a vertical application vocabulary that is recognized by the vertical application of NLU.
- the method can further include receiving a plurality of sets of the interpret-blocks that define a plurality of vertical applications and parsing the plurality of sets of interpret-blocks.
- the interpret-block can further include at least one exclude-statement that contains an exclude pattern that is built from words in a target natural language and matching of the pattern in the exclude-statement causes an exit from the interpret-block without further processing of include-statements.
- the patterns of the include-statements include relative weights assignable to matches of patterns or partial patterns.
- this method also can be practiced as code stored on a non-transitory computer readable storage medium or on running a system.
- parser running on a processor that builds a representation of natural language understanding (abbreviated NLU).
- This parser includes, program instructions running on at least one processor that cause the processor to receive at least one electronic record containing programming code that interprets text or utterances by extending a general purpose programming language with interpret-block and interpret-statement data structures.
- the interpret-block data structures include at least one of the interpret-statements and one or move variables returned by the interpret-block.
- the interpret-statements include a pattern that is built from words in a target natural language or from references to additional interpret-blocks and action instructions in the general purpose programming language that are triggered by a match between parts of the text or utterances and the pattern.
- the action instructions assign values to the variables of the interpret-block, which values attribute meaning to the text or utterances.
- the parser parses the received program code to produce a parse tree that represents the interpret-block and interpret-statement data structures.
- the pattern specified in the interpret-statement data structure can include a regular expression of the words and the additional interpret-blocks.
- the general purpose programming language belongs to a “C” programming language family.
- the values assigned to the variables of a particular interpret-block are available to additional interpret-blocks and to a NLU processor runtime.
- a set of the interpret-blocks collectively can define a vertical application of NLU.
- the patterns in the interpret-statements in the set of interpret-blocks can collectively match substantially all of a vertical application vocabulary that is recognized by the vertical application of NLU.
- Operation of the parser can further include receiving a plurality of sets of the interpret-blocks that define a plurality of vertical applications and parsing the plurality of sets of interpret-blocks.
- the interpret-block can further include at least one exclude-statement that contains an exclude pattern that is built from words in a target natural language and matching of the pattern in the exclude-statement causes an exit from the interpret-block without further processing of include-statements.
- the patterns of the include-statements can further include relative weights assignable to matches of patterns or partial patterns.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Machine Translation (AREA)
- Devices For Executing Special Programs (AREA)
Abstract
Description
-
- 120 first avenue san jose california
- 1 21st avenue san jose california
- 121st avenue san jose california
-
- interpret {[100 n_number=STREET_NUMBER( ).[1/200 “on”]].
-
- /*programming statements to perform logic involving auxiliary table*/
- }
-
- [[123456 “N First Street”]
- (
- ([10 “north”]. “first”. [50 (“street”|(1/10 (“avenue”|“road”)))])|
- (1/5 “first”. [50 (“street”|(1/10 (“avenue”|“road”)))]. “north”)
- ). [1/10 “in”]. “san jose”. [“california”|0.01 “c. a.”]
- ],
- [[123457 “1st Avenue”]
- (“first”. [50 (“avenue”|(1/10 (“street”|“road”)))])
- .[1/10 “in”]. “san jose”. [“california”|0.01 “c. a.”]
- ],
- [[123458 “21st Avenue”]
- (“twenty first”. [50 (“avenue”|(1/10 (“street”|“road”)))])
- .[1/10 “in”]. “san jose”. [“california”|0.01 “c. a.”]
- ],
- /*etc.*/
float best_weight = 0; |
float item_weight; |
unsigned best_id = 0; |
/* Iterate through the linked list starting at n_street */ |
for (size_t match_num = 0; match_num <= n_street->additional_matches ; ++match_num){ |
item_weight = n_street->weight; |
if (n_number){ |
/*check to see if the street number is valid for n_street->id. if not, penalize |
item_weight or exclude it */ |
if (!valid_number(n_number->value, n_street->id)) { |
//excludethis( ); | // complete exclusion |
item_weight *= 0.001; | // strong penalization |
} |
} |
/* keep track of the best id found so far */ |
if ((item_weight > best_weight) |
{ |
best_weight = item_weight; |
best_id=n_street->id; |
} |
n_street=n_street->next_match; |
} |
if (best_id!=0) { |
/* compose the full address based on best_id and n_number, and return as the |
full address of the block */ |
} else { |
excludethis( ); |
} |
-
- Subject: lunch meeting
- Message: message me when you get here
or part of the body: - Subject: lunch meeting message
- message: me when you get here
-
- interpret {n1=CALENDAR_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=PHONE_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=WEATHER_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=TRANSLATE_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=ALARM_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=ARITH_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=WEB_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=PLACE_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=MAP_COMMAND( )} as
- {
- command=n1→command;
- }
- {
- interpret {n1=CALENDAR_COMMAND( )} as
Claims (14)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/225,997 US11776533B2 (en) | 2012-07-23 | 2021-04-08 | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement |
US18/375,906 US20240029721A1 (en) | 2012-07-23 | 2023-10-02 | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261674833P | 2012-07-23 | 2012-07-23 | |
US201361798526P | 2013-03-15 | 2013-03-15 | |
US201313843290A | 2013-03-15 | 2013-03-15 | |
US16/209,854 US10996931B1 (en) | 2012-07-23 | 2018-12-04 | Integrated programming framework for speech and text understanding with block and statement structure |
US17/225,997 US11776533B2 (en) | 2012-07-23 | 2021-04-08 | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/209,854 Continuation US10996931B1 (en) | 2012-07-23 | 2018-12-04 | Integrated programming framework for speech and text understanding with block and statement structure |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US18/375,906 Continuation US20240029721A1 (en) | 2012-07-23 | 2023-10-02 | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement |
Publications (2)
Publication Number | Publication Date |
---|---|
US20210224043A1 US20210224043A1 (en) | 2021-07-22 |
US11776533B2 true US11776533B2 (en) | 2023-10-03 |
Family
ID=74882543
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/842,735 Active US10957310B1 (en) | 2012-07-23 | 2013-03-15 | Integrated programming framework for speech and text understanding with meaning parsing |
US16/209,854 Active US10996931B1 (en) | 2012-07-23 | 2018-12-04 | Integrated programming framework for speech and text understanding with block and statement structure |
US17/225,997 Active 2033-09-17 US11776533B2 (en) | 2012-07-23 | 2021-04-08 | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement |
US18/375,906 Pending US20240029721A1 (en) | 2012-07-23 | 2023-10-02 | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/842,735 Active US10957310B1 (en) | 2012-07-23 | 2013-03-15 | Integrated programming framework for speech and text understanding with meaning parsing |
US16/209,854 Active US10996931B1 (en) | 2012-07-23 | 2018-12-04 | Integrated programming framework for speech and text understanding with block and statement structure |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US18/375,906 Pending US20240029721A1 (en) | 2012-07-23 | 2023-10-02 | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement |
Country Status (1)
Country | Link |
---|---|
US (4) | US10957310B1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220180893A1 (en) * | 2020-12-08 | 2022-06-09 | Line Corporation | Method and system for generating multimedia content |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
AU2014214676A1 (en) | 2013-02-07 | 2015-08-27 | Apple Inc. | Voice trigger for a digital assistant |
US11295730B1 (en) | 2014-02-27 | 2022-04-05 | Soundhound, Inc. | Using phonetic variants in a local context to improve natural language understanding |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US12197817B2 (en) | 2016-06-11 | 2025-01-14 | Apple Inc. | Intelligent device arbitration and control |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK201770427A1 (en) | 2017-05-12 | 2018-12-20 | Apple Inc. | Low-latency intelligent automated assistant |
US20180336275A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Intelligent automated assistant for media exploration |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
JP7298692B2 (en) * | 2019-08-08 | 2023-06-27 | 日本電気株式会社 | Function estimation device, function estimation method and program |
CN111190583B (en) * | 2019-12-31 | 2021-10-22 | 华为技术有限公司 | Associated conflict block presenting method and equipment |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
US11698777B2 (en) * | 2020-09-22 | 2023-07-11 | Jpmorgan Chase Bank, N.A. | Method and apparatus for automatically generating an application code and framework |
US11984124B2 (en) * | 2020-11-13 | 2024-05-14 | Apple Inc. | Speculative task flow execution |
CN113220888B (en) * | 2021-06-01 | 2022-12-13 | 上海交通大学 | Method and system for extracting case clue elements based on Ernie model |
CN113836936B (en) * | 2021-09-22 | 2023-10-13 | 珠海格力电器股份有限公司 | Semantic analysis method, system, device, equipment and storage medium |
US12124804B2 (en) | 2022-03-09 | 2024-10-22 | Soundhound, Inc. | Ordering from a menu using natural language |
Citations (306)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3919479A (en) | 1972-09-21 | 1975-11-11 | First National Bank Of Boston | Broadcast signal identification system |
US4450531A (en) | 1982-09-10 | 1984-05-22 | Ensco, Inc. | Broadcast signal recognition system and method |
US4697209A (en) | 1984-04-26 | 1987-09-29 | A. C. Nielsen Company | Methods and apparatus for automatically identifying programs viewed or recorded |
US4739398A (en) | 1986-05-02 | 1988-04-19 | Control Data Corporation | Method, apparatus and system for recognizing broadcast segments |
US4843562A (en) | 1987-06-24 | 1989-06-27 | Broadcast Data Systems Limited Partnership | Broadcast information classification system and method |
US4918730A (en) | 1987-06-24 | 1990-04-17 | Media Control-Musik-Medien-Analysen Gesellschaft Mit Beschrankter Haftung | Process and circuit arrangement for the automatic recognition of signal sequences |
US4928249A (en) | 1987-02-27 | 1990-05-22 | Societe Anonyme Dite : Smh Alcatel | Operating system for an electronic franking machine |
US4959850A (en) | 1987-05-29 | 1990-09-25 | Kabushiki Kaisha Toshiba | Radio telephone apparatus |
US5019899A (en) | 1988-11-01 | 1991-05-28 | Control Data Corporation | Electronic data encoding and recognition system |
US5033087A (en) | 1989-03-14 | 1991-07-16 | International Business Machines Corp. | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system |
US5054074A (en) | 1989-03-02 | 1991-10-01 | International Business Machines Corporation | Optimized speech recognition system and method |
US5164915A (en) | 1990-09-26 | 1992-11-17 | Information Storage Devices, Inc. | Cascading analog record/playback devices |
US5436653A (en) | 1992-04-30 | 1995-07-25 | The Arbitron Company | Method and system for recognition of broadcast segments |
US5437050A (en) | 1992-11-09 | 1995-07-25 | Lamb; Robert G. | Method and apparatus for recognizing broadcast information using multi-frequency magnitude detection |
US5457768A (en) | 1991-08-13 | 1995-10-10 | Kabushiki Kaisha Toshiba | Speech recognition apparatus using syntactic and semantic analysis |
US5511000A (en) | 1993-11-18 | 1996-04-23 | Kaloi; Dennis M. | Electronic solid-state record/playback device and system |
US5542138A (en) | 1995-02-06 | 1996-08-06 | Williams; Terry N. | Bedside control unit for a hospital bed |
US5577249A (en) | 1992-07-31 | 1996-11-19 | International Business Machines Corporation | Method for finding a reference token sequence in an original token string within a database of token strings using appended non-contiguous substrings |
US5581658A (en) | 1993-12-14 | 1996-12-03 | Infobase Systems, Inc. | Adaptive system for broadcast program identification and reporting |
US5634084A (en) | 1995-01-20 | 1997-05-27 | Centigram Communications Corporation | Abbreviation and acronym/initialism expansion procedures for a text to speech reader |
US5664270A (en) | 1994-07-19 | 1997-09-09 | Kinetic Concepts, Inc. | Patient interface system |
US5687279A (en) | 1994-12-23 | 1997-11-11 | Intel Corporation | Retro-storing analog information in a digital storage circuit |
US5708477A (en) | 1994-12-01 | 1998-01-13 | Forbes; Scott Jared | Video signal identifier for controlling a VCR and television based on the occurrence of commercials |
US5845306A (en) | 1994-06-01 | 1998-12-01 | Mitsubishi Electric Information Technology Center America, Inc. | Context based system for accessing dictionary entries |
US5862260A (en) | 1993-11-18 | 1999-01-19 | Digimarc Corporation | Methods for surveying dissemination of proprietary empirical data |
US5874686A (en) | 1995-10-31 | 1999-02-23 | Ghias; Asif U. | Apparatus and method for searching a melody |
US5880386A (en) | 1995-11-30 | 1999-03-09 | Yamaha Corporation | Musical information processing system with automatic data transfer |
WO1999018518A2 (en) | 1997-10-03 | 1999-04-15 | Peter Polash | Internet based musical indexing system for radio |
US5907815A (en) | 1995-12-07 | 1999-05-25 | Texas Instruments Incorporated | Portable computer stored removable mobile telephone |
US5918223A (en) | 1996-07-22 | 1999-06-29 | Muscle Fish | Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information |
US5956683A (en) | 1993-12-22 | 1999-09-21 | Qualcomm Incorporated | Distributed voice recognition system |
EP0944033A1 (en) | 1998-03-19 | 1999-09-22 | Tomonari Sonoda | Melody retrieval system and method |
US5963957A (en) | 1997-04-28 | 1999-10-05 | Philips Electronics North America Corporation | Bibliographic music data base with normalized musical themes |
JPH11272274A (en) | 1998-03-19 | 1999-10-08 | Tomoya Sonoda | Method for retrieving piece of music by use of singing voice |
US5969283A (en) | 1998-06-17 | 1999-10-19 | Looney Productions, Llc | Music organizer and entertainment center |
US5974409A (en) | 1995-08-23 | 1999-10-26 | Microsoft Corporation | System and method for locating information in an on-line network |
US5991737A (en) | 1996-03-11 | 1999-11-23 | Connexus Corporation | Automated consumer response to publicly broadcast information |
US6049710A (en) | 1997-06-19 | 2000-04-11 | Kimberley Nanette Engen | Wireless prepaid telephone system with dispensable instruments |
US6067516A (en) | 1997-05-09 | 2000-05-23 | Siemens Information | Speech and text messaging system with distributed speech recognition and speaker database transfers |
JP2000187671A (en) | 1998-12-21 | 2000-07-04 | Tomoya Sonoda | Music retrieval system with singing voice using network and singing voice input terminal equipment to be used at the time of retrieval |
US6092039A (en) | 1997-10-31 | 2000-07-18 | International Business Machines Corporation | Symbiotic automatic speech recognition and vocoder |
US6098042A (en) | 1998-01-30 | 2000-08-01 | International Business Machines Corporation | Homograph filter for speech synthesis system |
US6108626A (en) | 1995-10-27 | 2000-08-22 | Cselt-Centro Studi E Laboratori Telecomunicazioni S.P.A. | Object oriented audio coding |
US6122403A (en) | 1995-07-27 | 2000-09-19 | Digimarc Corporation | Computer system linked by using information in data objects |
US6163767A (en) | 1997-09-19 | 2000-12-19 | International Business Machines Corporation | Speech recognition method and system for recognizing single or un-correlated Chinese characters |
US6182128B1 (en) | 1998-03-05 | 2001-01-30 | Touchmusic Entertainment Llc | Real-time music distribution systems |
US6188985B1 (en) | 1997-01-06 | 2001-02-13 | Texas Instruments Incorporated | Wireless voice-activated device for control of a processor-based host system |
US6201176B1 (en) | 1998-05-07 | 2001-03-13 | Canon Kabushiki Kaisha | System and method for querying a music database |
US6233682B1 (en) | 1999-01-22 | 2001-05-15 | Bernhard Fritsch | Distribution of musical products by a web site vendor over the internet |
US6246986B1 (en) | 1998-12-31 | 2001-06-12 | At&T Corp. | User barge-in enablement in large vocabulary speech recognition systems |
US20010005823A1 (en) | 1999-12-24 | 2001-06-28 | Uwe Fischer | Method and system for generating a characteristic identifier for digital data and for detecting identical digital data |
US20010014891A1 (en) | 1996-05-24 | 2001-08-16 | Eric M. Hoffert | Display of media previews |
US6292767B1 (en) | 1995-07-18 | 2001-09-18 | Nuance Communications | Method and system for building and running natural language understanding systems |
US6314577B1 (en) | 1994-10-27 | 2001-11-06 | Michael H. Pocock | Apparatus and method to generate and access broadcast information |
US20010049601A1 (en) | 2000-03-24 | 2001-12-06 | John Kroeker | Phonetic data processing system and method |
US20010053974A1 (en) | 2000-03-14 | 2001-12-20 | Helmut Lucke | Speech recognition apparatus, speech recognition method, and recording medium |
US6345256B1 (en) | 1998-08-13 | 2002-02-05 | International Business Machines Corporation | Automated method and apparatus to package digital content for electronic distribution using the identity of the source content |
US20020023020A1 (en) | 1999-09-21 | 2002-02-21 | Kenyon Stephen C. | Audio identification system and method |
US6363349B1 (en) | 1999-05-28 | 2002-03-26 | Motorola, Inc. | Method and apparatus for performing distributed speech processing in a communication system |
US20020042707A1 (en) | 2000-06-19 | 2002-04-11 | Gang Zhao | Grammar-packaged parsing |
US20020048350A1 (en) | 1995-05-26 | 2002-04-25 | Michael S. Phillips | Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system |
US20020049037A1 (en) | 2000-09-13 | 2002-04-25 | Christensen Kelly M. | System and method for ordering and delivering media content |
US6385434B1 (en) | 1998-09-16 | 2002-05-07 | Motorola, Inc. | Wireless access unit utilizing adaptive spectrum exploitation |
US20020072982A1 (en) | 2000-12-12 | 2002-06-13 | Shazam Entertainment Ltd. | Method and system for interacting with a user in an experiential environment |
US6408272B1 (en) | 1999-04-12 | 2002-06-18 | General Magic, Inc. | Distributed voice user interface |
US20020083060A1 (en) | 2000-07-31 | 2002-06-27 | Wang Avery Li-Chun | System and methods for recognizing sound and music signals in high noise and distortion |
US6434520B1 (en) | 1999-04-16 | 2002-08-13 | International Business Machines Corporation | System and method for indexing and querying audio archives |
US20020111806A1 (en) | 2001-02-13 | 2002-08-15 | International Business Machines Corporation | Dynamic language model mixtures with history-based buckets |
US20020116191A1 (en) | 2000-12-26 | 2002-08-22 | International Business Machines Corporation | Augmentation of alternate word lists by acoustic confusability criterion |
US6453252B1 (en) | 2000-05-15 | 2002-09-17 | Creative Technology Ltd. | Process for identifying audio content |
US20020138265A1 (en) | 2000-05-02 | 2002-09-26 | Daniell Stevens | Error correction in speech recognition |
US20020138630A1 (en) | 2000-12-27 | 2002-09-26 | Solomon Barry M. | Music scheduling algorithm |
US20020156627A1 (en) | 2001-02-20 | 2002-10-24 | International Business Machines Corporation | Speech recognition apparatus and computer system therefor, speech recognition method and program and recording medium therefor |
US20020174431A1 (en) | 2001-05-15 | 2002-11-21 | John Bowman | Method and system for receiving music related information via an internet connection |
US6487532B1 (en) | 1997-09-24 | 2002-11-26 | Scansoft, Inc. | Apparatus and method for distinguishing similar-sounding utterances speech recognition |
US20020181671A1 (en) | 2001-06-01 | 2002-12-05 | Logan James D. | Cellular telephone with audio recording subsystem |
US20020193895A1 (en) | 2001-06-18 | 2002-12-19 | Ziqiang Qian | Enhanced encoder for synchronizing multimedia files into an audio bit stream |
US20020198719A1 (en) | 2000-12-04 | 2002-12-26 | International Business Machines Corporation | Reusable voiceXML dialog components, subdialogs and beans |
US20020198705A1 (en) | 2001-05-30 | 2002-12-26 | Burnett Gregory C. | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
US20020198713A1 (en) | 1999-01-29 | 2002-12-26 | Franz Alexander M. | Method and apparatus for perfoming spoken language translation |
US20020198789A1 (en) | 2001-06-22 | 2002-12-26 | Sony Corp. And Sony Music Entertainment, Inc. | Apparatus and method for identifying and purchasing music |
US20030004717A1 (en) | 2001-03-22 | 2003-01-02 | Nikko Strom | Histogram grammar weighting and error corrective training of grammar weights |
US6504089B1 (en) | 1997-12-24 | 2003-01-07 | Canon Kabushiki Kaisha | System for and method of searching music data, and recording medium for use therewith |
US6505160B1 (en) | 1995-07-27 | 2003-01-07 | Digimarc Corporation | Connected audio and other media objects |
US20030009335A1 (en) | 2001-07-05 | 2003-01-09 | Johan Schalkwyk | Speech recognition with dynamic grammars |
US6507727B1 (en) | 2000-10-13 | 2003-01-14 | Robert F. Henrick | Purchase and delivery of digital content using multiple devices and data networks |
US6510325B1 (en) | 1996-04-19 | 2003-01-21 | Mack, Ii Gawins A. | Convertible portable telephone |
US20030023437A1 (en) | 2001-01-27 | 2003-01-30 | Pascale Fung | System and method for context-based spontaneous speech recognition |
US6519564B1 (en) | 1999-07-01 | 2003-02-11 | Koninklijke Philips Electronics N.V. | Content-driven speech-or audio-browser |
US20030040901A1 (en) | 2001-08-21 | 2003-02-27 | Wang Yeyi | Method and apparatus for dynamic grammars and focused semantic parsing |
US6535849B1 (en) | 2000-01-18 | 2003-03-18 | Scansoft, Inc. | Method and system for generating semi-literal transcripts for speech recognition systems |
US6542869B1 (en) | 2000-05-11 | 2003-04-01 | Fuji Xerox Co., Ltd. | Method for automatic analysis of audio including music and speech |
US20030078928A1 (en) | 2001-10-23 | 2003-04-24 | Dorosario Alden | Network wide ad targeting |
US20030083863A1 (en) | 2000-09-08 | 2003-05-01 | Ringger Eric K. | Augmented-word language model |
US20030106413A1 (en) | 2001-12-06 | 2003-06-12 | Ramin Samadani | System and method for music identification |
US20030110035A1 (en) | 2001-12-12 | 2003-06-12 | Compaq Information Technologies Group, L.P. | Systems and methods for combining subword detection and word detection for processing a spoken input |
US20030125945A1 (en) | 2001-12-14 | 2003-07-03 | Sean Doyle | Automatically improving a voice recognition system |
US6594628B1 (en) | 1995-09-21 | 2003-07-15 | Qualcomm, Incorporated | Distributed voice recognition system |
WO2003061285A2 (en) | 2001-12-24 | 2003-07-24 | Scientific Generics Limited | Captioning system |
US6611607B1 (en) | 1993-11-18 | 2003-08-26 | Digimarc Corporation | Integrating digital watermarks in multimedia content |
US20030163320A1 (en) | 2001-03-09 | 2003-08-28 | Nobuhide Yamazaki | Voice synthesis device |
US6614914B1 (en) | 1995-05-08 | 2003-09-02 | Digimarc Corporation | Watermark embedder and reader |
US20030167166A1 (en) | 1999-09-04 | 2003-09-04 | International Business Machines Corporation | Speech recognition system |
US20030187649A1 (en) | 2002-03-27 | 2003-10-02 | Compaq Information Technologies Group, L.P. | Method to expand inputs for word or document searching |
US6631346B1 (en) | 1999-04-07 | 2003-10-07 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for natural language parsing using multiple passes and tags |
US20030191645A1 (en) | 2002-04-05 | 2003-10-09 | Guojun Zhou | Statistical pronunciation model for text to speech |
US6633845B1 (en) | 2000-04-07 | 2003-10-14 | Hewlett-Packard Development Company, L.P. | Music summarization system and method |
US6633846B1 (en) | 1999-11-12 | 2003-10-14 | Phoenix Solutions, Inc. | Distributed realtime speech recognition system |
US20030192424A1 (en) | 2002-04-11 | 2003-10-16 | Yuji Koike | Lyric display method, lyric display computer program and lyric display apparatus |
US6640306B1 (en) | 1997-08-15 | 2003-10-28 | Sony Corporation | Data transmitter, data reproduction method, terminal, and method for transferring data transmitter |
EP1367590A2 (en) | 2002-05-31 | 2003-12-03 | Pioneer Corporation | Music recording/reproducing apparatus, list generation method, and information recording medium |
US20030233225A1 (en) * | 1999-08-24 | 2003-12-18 | Virtual Research Associates, Inc. | Natural language sentence parser |
US20040002858A1 (en) | 2002-06-27 | 2004-01-01 | Hagai Attias | Microphone array signal enhancement using mixture models |
US20040019497A1 (en) | 2001-12-04 | 2004-01-29 | Volk Andrew R. | Method and system for providing listener-requested music over a network |
US6708150B1 (en) | 1999-09-09 | 2004-03-16 | Zanavi Informatics Corporation | Speech recognition apparatus and speech recognition navigation apparatus |
US20040193420A1 (en) | 2002-07-15 | 2004-09-30 | Kennewick Robert A. | Mobile systems and methods for responding to natural language speech utterance |
US6804645B1 (en) | 1996-04-02 | 2004-10-12 | Siemens Aktiengesellschaft | Dynamic phoneme dictionary for speech recognition |
WO2004091307A2 (en) | 2003-04-08 | 2004-10-28 | Advanced Bionutriton Corporation | Feed additives against diseasse infection in terrestrial and aquatic animals |
US20040231498A1 (en) | 2003-02-14 | 2004-11-25 | Tao Li | Music feature extraction using wavelet coefficient histograms |
US6834308B1 (en) | 2000-02-17 | 2004-12-21 | Audible Magic Corporation | Method and apparatus for identifying media content presented on a media playing device |
US20050010412A1 (en) | 2003-07-07 | 2005-01-13 | Hagai Aronowitz | Phoneme lattice construction and its application to speech recognition and keyword spotting |
US20050016360A1 (en) | 2003-07-24 | 2005-01-27 | Tong Zhang | System and method for automatic classification of music |
US6850288B2 (en) | 2000-07-04 | 2005-02-01 | Matsushita Electric Industrial Co., Ltd. | Apparatus for providing programming |
US20050027699A1 (en) | 2003-08-01 | 2005-02-03 | Amr Awadallah | Listings optimization using a plurality of data sources |
US20050033574A1 (en) | 2003-08-06 | 2005-02-10 | Samsung Electronics Co., Ltd. | Method and apparatus handling speech recognition errors in spoken dialogue systems |
US20050060685A1 (en) | 2003-09-11 | 2005-03-17 | Ingo Franz | Program generator |
US6879950B1 (en) | 2000-10-19 | 2005-04-12 | Motorola, Inc. | System and method of decoding a packed representation of multiple parses |
US20050086059A1 (en) | 1999-11-12 | 2005-04-21 | Bennett Ian M. | Partial speech processing device & method for use in distributed systems |
US20050125232A1 (en) | 2003-10-31 | 2005-06-09 | Gadd I. M. | Automated speech-enabled application creation method and apparatus |
US20050137939A1 (en) | 2003-12-19 | 2005-06-23 | Palo Alto Research Center Incorporated | Server-based keyword advertisement management |
US20050143970A1 (en) | 2003-09-11 | 2005-06-30 | Voice Signal Technologies, Inc. | Pronunciation discovery for spoken words |
US20050144064A1 (en) | 2003-12-19 | 2005-06-30 | Palo Alto Research Center Incorporated | Keyword advertisement management |
US20050144065A1 (en) | 2003-12-19 | 2005-06-30 | Palo Alto Research Center Incorporated | Keyword advertisement management with coordinated bidding among advertisers |
US6931451B1 (en) | 1996-10-03 | 2005-08-16 | Gotuit Media Corp. | Systems and methods for modifying broadcast programming |
US6941275B1 (en) | 1999-10-07 | 2005-09-06 | Remi Swierczek | Music identification system |
US20050256715A1 (en) | 2002-10-08 | 2005-11-17 | Yoshiyuki Okimoto | Language model generation and accumulation device, speech recognition device, language model creation method, and speech recognition method |
US20050254366A1 (en) | 2004-05-14 | 2005-11-17 | Renaud Amar | Method and apparatus for selecting an audio track based upon audio excerpts |
US6967275B2 (en) | 2002-06-25 | 2005-11-22 | Irobot Corporation | Song-matching system and method |
US20050273326A1 (en) | 2004-06-02 | 2005-12-08 | Stmicroelectronics Asia Pacific Pte. Ltd. | Energy-based audio pattern recognition |
US20060004572A1 (en) | 2004-06-30 | 2006-01-05 | Microsoft Corporation | Homonym processing in the context of voice-activated command systems |
US6996529B1 (en) | 1999-03-15 | 2006-02-07 | British Telecommunications Public Limited Company | Speech synthesis with prosodic phrase boundary information |
US20060059225A1 (en) | 2004-09-14 | 2006-03-16 | A9.Com, Inc. | Methods and apparatus for automatic generation of recommended links |
US7017208B2 (en) | 1995-08-04 | 2006-03-28 | Hill-Rom Services, Inc. | Hospital bed |
US20060069547A1 (en) | 2004-09-15 | 2006-03-30 | Microsoft Corporation | Creating a speech recognition grammar for alphanumeric concepts |
US20060095250A1 (en) | 2004-11-03 | 2006-05-04 | Microsoft Corporation | Parser for natural language processing |
US7058376B2 (en) | 1999-01-27 | 2006-06-06 | Logan James D | Radio receiving, recording and playback system |
US20060129396A1 (en) | 2004-12-09 | 2006-06-15 | Microsoft Corporation | Method and apparatus for automatic grammar generation from data entries |
US20060155694A1 (en) | 2004-12-29 | 2006-07-13 | Chowdhury Abdur R | Query routing |
US7085716B1 (en) | 2000-10-26 | 2006-08-01 | Nuance Communications, Inc. | Speech recognition using word-in-phrase command |
US20060169126A1 (en) | 2002-09-18 | 2006-08-03 | Takehiko Ishiwata | Music classification device, music classification method, and program |
US7089541B2 (en) * | 2001-11-30 | 2006-08-08 | Sun Microsystems, Inc. | Modular parser architecture with mini parsers |
US20060189298A1 (en) | 2003-03-06 | 2006-08-24 | Maurizio Marcelli | Method and software program product for mutual authentication in a communications network |
US20060200350A1 (en) | 2004-12-22 | 2006-09-07 | David Attwater | Multi dimensional confidence |
US20060224384A1 (en) | 2005-03-31 | 2006-10-05 | International Business Machines Corporation | System and method for automatic speech recognition |
US20060242017A1 (en) | 2005-03-09 | 2006-10-26 | Medio Systems, Inc. | Method and system of bidding for advertisement placement on computing devices |
US20060277052A1 (en) | 2005-06-01 | 2006-12-07 | Microsoft Corporation | Variable speed playback of digital audio |
US20060277030A1 (en) | 2005-06-06 | 2006-12-07 | Mark Bedworth | System, Method, and Technique for Identifying a Spoken Utterance as a Member of a List of Known Items Allowing for Variations in the Form of the Utterance |
US20060282266A1 (en) | 2005-06-13 | 2006-12-14 | Microsoft Corporation | Static analysis of grammars |
US20070010195A1 (en) | 2005-07-08 | 2007-01-11 | Cingular Wireless Llc | Mobile multimedia services ecosystem |
US20070016404A1 (en) | 2005-07-15 | 2007-01-18 | Samsung Electronics Co., Ltd. | Method and apparatus to extract important spectral component from audio signal and low bit-rate audio signal coding and/or decoding method and apparatus using the same |
US20070038453A1 (en) | 2005-08-09 | 2007-02-15 | Kabushiki Kaisha Toshiba | Speech recognition system |
US20070055500A1 (en) | 2005-09-01 | 2007-03-08 | Sergiy Bilobrov | Extraction and matching of characteristic fingerprints from audio signals |
US7190971B1 (en) | 1997-07-29 | 2007-03-13 | Sony Corporation | Information processing apparatus and method, information processing system, and transmission medium |
US7206820B1 (en) | 2000-03-18 | 2007-04-17 | Digimarc Corporation | System for linking from object to remote resource |
US7209892B1 (en) | 1998-12-24 | 2007-04-24 | Universal Music Group, Inc. | Electronic music/media distribution system |
US7225132B2 (en) | 2000-03-14 | 2007-05-29 | British Telecommunications Plc | Method for assigning an identification code |
US20070120689A1 (en) | 2001-08-03 | 2007-05-31 | Zerhusen Robert M | Patient point-of-care computer system |
US7233321B1 (en) | 1998-12-15 | 2007-06-19 | Intel Corporation | Pointing device with integrated audio input |
US20070156392A1 (en) | 2005-12-30 | 2007-07-05 | International Business Machines Corporation | Method and system for automatically building natural language understanding models |
US20070168413A1 (en) | 2003-12-05 | 2007-07-19 | Sony Deutschland Gmbh | Visualization and control techniques for multimedia digital content |
US20070168409A1 (en) | 2004-02-26 | 2007-07-19 | Kwan Cheung | Method and apparatus for automatic detection and identification of broadcast audio and video signals |
US7257536B1 (en) | 1999-11-23 | 2007-08-14 | Radiant Systems, Inc. | Audio request interaction system |
US20070204319A1 (en) | 1996-12-05 | 2007-08-30 | Subutai Ahmad | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US7266343B1 (en) | 1999-09-29 | 2007-09-04 | Nokia Corporation | Method of and system for providing identification of a piece of information |
US7266495B1 (en) | 2003-09-12 | 2007-09-04 | Nuance Communications, Inc. | Method and system for learning linguistically valid word pronunciations from acoustic data |
US20070208569A1 (en) | 2006-03-03 | 2007-09-06 | Balan Subramanian | Communicating across voice and text channels with emotion preservation |
US20070239676A1 (en) | 2006-03-29 | 2007-10-11 | Stonehocker Timothy P | Method and system for providing focused search results |
US20070260456A1 (en) | 2006-05-02 | 2007-11-08 | Xerox Corporation | Voice message converter |
US20070260634A1 (en) | 2006-05-04 | 2007-11-08 | Nokia Corporation | Apparatus, system, method, and computer program product for synchronizing the presentation of media content |
US20070288444A1 (en) | 2003-05-06 | 2007-12-13 | International Business Machines Corporation | Web-based customer service interface |
WO2008004181A2 (en) | 2006-07-06 | 2008-01-10 | Koninklijke Philips Electronics N.V. | Methods and devices for managing a play-list of digital content |
US7323629B2 (en) | 2003-07-16 | 2008-01-29 | Univ Iowa State Res Found Inc | Real time music recognition and display system |
US20080026355A1 (en) | 2006-07-27 | 2008-01-31 | Sony Ericsson Mobile Communications Ab | Song lyrics download for karaoke applications |
US20080022844A1 (en) | 2005-08-16 | 2008-01-31 | Poliner Graham E | Methods, systems, and media for music classification |
US7328153B2 (en) | 2001-07-20 | 2008-02-05 | Gracenote, Inc. | Automatic identification of sound recordings |
US20080046247A1 (en) | 2006-08-21 | 2008-02-21 | Gakuto Kurata | System And Method For Supporting Text-To-Speech |
US20080059188A1 (en) | 1999-10-19 | 2008-03-06 | Sony Corporation | Natural Language Interface Control System |
US20080059185A1 (en) | 2006-08-25 | 2008-03-06 | Hoon Chung | Speech recognition system for mobile terminal |
US20080071520A1 (en) | 2006-09-14 | 2008-03-20 | David Lee Sanford | Method and system for improving the word-recognition rate of speech recognition software |
US20080082510A1 (en) | 2006-10-03 | 2008-04-03 | Shazam Entertainment Ltd | Method for High-Throughput Identification of Distributed Broadcast Content |
US7373209B2 (en) | 2001-03-22 | 2008-05-13 | Matsushita Electric Industrial Co., Ltd. | Sound features extracting apparatus, sound data registering apparatus, sound data retrieving apparatus, and methods and programs for implementing the same |
US7379875B2 (en) | 2003-10-24 | 2008-05-27 | Microsoft Corporation | Systems and methods for generating audio thumbnails |
US20080126089A1 (en) | 2002-10-31 | 2008-05-29 | Harry Printz | Efficient Empirical Determination, Computation, and Use of Acoustic Confusability Measures |
US20080134264A1 (en) | 2006-11-30 | 2008-06-05 | Motorola, Inc. | Method and apparatus for interactivity with broadcast media |
US20080148224A1 (en) | 2004-10-21 | 2008-06-19 | International Business Machines Corporation | Method and Apparatus for Automatically Converting Numeric Data to a Processor Efficient Format for Performing Arithmetic Operations |
US20080154951A1 (en) | 2006-12-22 | 2008-06-26 | Yahoo! Inc. | Link Retrofitting of Digital Media Objects |
US20080172224A1 (en) | 2007-01-11 | 2008-07-17 | Microsoft Corporation | Position-dependent phonetic models for reliable pronunciation identification |
US20080215319A1 (en) | 2007-03-01 | 2008-09-04 | Microsoft Corporation | Query by humming for ringtone search and download |
US20080228496A1 (en) | 2007-03-15 | 2008-09-18 | Microsoft Corporation | Speech-centric multimodal user interface design in mobile technology |
US20080235017A1 (en) | 2007-03-22 | 2008-09-25 | Honda Motor Co., Ltd. | Voice interaction device, voice interaction method, and voice interaction program |
US20080235872A1 (en) | 2007-03-30 | 2008-10-02 | Newkirk David C | User interface for hospital bed |
US20080249982A1 (en) | 2005-11-01 | 2008-10-09 | Ohigo, Inc. | Audio search system |
US20080255937A1 (en) | 2007-04-10 | 2008-10-16 | Yahoo! Inc. | System for optimizing the performance of online advertisements using a network of users and advertisers |
US20080256115A1 (en) | 2007-04-11 | 2008-10-16 | Oleg Beletski | Systems, apparatuses and methods for identifying transitions of content |
US7444353B1 (en) | 2000-01-31 | 2008-10-28 | Chen Alexander C | Apparatus for delivering music and information |
US20080270129A1 (en) | 2005-02-17 | 2008-10-30 | Loquendo S.P.A. | Method and System for Automatically Providing Linguistic Formulations that are Outside a Recognition Domain of an Automatic Speech Recognition System |
US7464065B2 (en) | 2005-11-21 | 2008-12-09 | International Business Machines Corporation | Object specific language extension interface for a multi-level data structure |
US20090013255A1 (en) | 2006-12-30 | 2009-01-08 | Matthew John Yuschik | Method and System for Supporting Graphical User Interfaces |
US20090030686A1 (en) | 2007-07-27 | 2009-01-29 | Fuliang Weng | Method and system for computing or determining confidence scores for parse trees at all levels |
US20090037174A1 (en) | 2007-07-31 | 2009-02-05 | Microsoft Corporation | Understanding spoken location information based on intersections |
US20090031882A1 (en) | 2004-07-09 | 2009-02-05 | Sony Deutschland Gmbh | Method for Classifying Music |
US20090064029A1 (en) | 2006-11-27 | 2009-03-05 | Brightqube, Inc. | Methods of Creating and Displaying Images in a Dynamic Mosaic |
US20090063147A1 (en) | 2002-06-28 | 2009-03-05 | Conceptual Speech Llc | Phonetic, syntactic and conceptual analysis driven speech recognition system and method |
US20090112593A1 (en) | 2007-10-24 | 2009-04-30 | Harman Becker Automotive Systems Gmbh | System for recognizing speech for searching a database |
US20090119097A1 (en) | 2007-11-02 | 2009-05-07 | Melodis Inc. | Pitch selection modules in a system for automatic transcription of sung or hummed melodies |
US20090125306A1 (en) | 2007-09-19 | 2009-05-14 | International Business Machines Corporation | Method, system and computer program for enhanced speech recognition of digits input strings |
US20090150341A1 (en) | 2007-12-05 | 2009-06-11 | Microsoft Corporation | Generation of alternative phrasings for short descriptions |
US7562392B1 (en) | 1999-05-19 | 2009-07-14 | Digimarc Corporation | Methods of interacting with audio and ambient music |
US7567899B2 (en) | 2004-12-30 | 2009-07-28 | All Media Guide, Llc | Methods and apparatus for audio recognition |
US7580832B2 (en) | 2004-07-26 | 2009-08-25 | M2Any Gmbh | Apparatus and method for robust classification of audio signals, and method for establishing and operating an audio-signal database, as well as computer program |
US20090216525A1 (en) | 2008-02-22 | 2009-08-27 | Vocera Communications, Inc. | System and method for treating homonyms in a speech recognition system |
US20090228799A1 (en) | 2008-02-29 | 2009-09-10 | Sony Corporation | Method for visualizing audio data |
US20090240488A1 (en) | 2008-03-19 | 2009-09-24 | Yap, Inc. | Corrective feedback loop for automated speech recognition |
US20090240499A1 (en) | 2008-03-19 | 2009-09-24 | Zohar Dvir | Large vocabulary quick learning speech recognition system |
US7606708B2 (en) | 2005-02-01 | 2009-10-20 | Samsung Electronics Co., Ltd. | Apparatus, method, and medium for generating grammar network for use in speech recognition and dialogue speech recognition |
US20090271199A1 (en) | 2008-04-24 | 2009-10-29 | International Business Machines | Records Disambiguation In A Multimodal Application Operating On A Multimodal Device |
US20100014828A1 (en) | 2008-07-17 | 2010-01-21 | Indata Corporation | Video management system and method |
US20100042414A1 (en) | 2008-08-18 | 2010-02-18 | At&T Intellectual Property I, L.P. | System and method for improving name dialer performance |
WO2010018586A2 (en) | 2008-08-14 | 2010-02-18 | Tunewiki Inc | A method and a system for real time music playback syncronization, dedicated players, locating audio content, following most listened-to lists and phrase searching for sing-along |
US20100049514A1 (en) | 2005-08-31 | 2010-02-25 | Voicebox Technologies, Inc. | Dynamic speech sharpening |
US7698136B1 (en) | 2003-01-28 | 2010-04-13 | Voxify, Inc. | Methods and apparatus for flexible speech recognition |
US20100100384A1 (en) | 2008-10-21 | 2010-04-22 | Microsoft Corporation | Speech Recognition System with Display Information |
US20100124892A1 (en) | 2008-11-19 | 2010-05-20 | Concert Technology Corporation | System and method for internet radio station program discovery |
US7743092B2 (en) | 1999-11-12 | 2010-06-22 | Wood Lawson A | Method for recognizing and distributing music |
US20100158488A1 (en) | 2001-07-31 | 2010-06-24 | Gracenote, Inc. | Multiple step identification of recordings |
US7756874B2 (en) | 2000-07-06 | 2010-07-13 | Microsoft Corporation | System and methods for providing automatic classification of media entities according to consonance properties |
US7765097B1 (en) | 2006-03-20 | 2010-07-27 | Intuit Inc. | Automatic code generation via natural language processing |
US20100205166A1 (en) | 1999-11-10 | 2010-08-12 | Boulter Jeffrey R | Internet radio and broadcast method |
US20100211693A1 (en) | 2010-05-04 | 2010-08-19 | Aaron Steven Master | Systems and Methods for Sound Recognition |
US20100241418A1 (en) | 2009-03-23 | 2010-09-23 | Sony Corporation | Voice recognition device and voice recognition method, language model generating device and language model generating method, and computer program |
US20100250497A1 (en) | 2007-01-05 | 2010-09-30 | Redlich Ron M | Electromagnetic pulse (EMP) hardened information infrastructure with extractor, cloud dispersal, secure storage, content analysis and classification and method therefor |
US20100286979A1 (en) | 2007-08-01 | 2010-11-11 | Ginger Software, Inc. | Automatic context sensitive language correction and enhancement using an internet corpus |
US20100312782A1 (en) | 2009-06-05 | 2010-12-09 | Microsoft Corporation | Presenting search results according to query domains |
US7853664B1 (en) | 2000-07-31 | 2010-12-14 | Landmark Digital Services Llc | Method and system for purchasing pre-recorded music |
US20110035219A1 (en) | 2009-08-04 | 2011-02-10 | Autonomy Corporation Ltd. | Automatic spoken language identification based on phoneme sequence patterns |
US20110046951A1 (en) | 2009-08-21 | 2011-02-24 | David Suendermann | System and method for building optimal state-dependent statistical utterance classifiers in spoken dialog systems |
US7904297B2 (en) | 2005-05-31 | 2011-03-08 | Robert Bosch Gmbh | Dialogue management using scripts and combined confidence scores |
US7908135B2 (en) | 2006-05-31 | 2011-03-15 | Victor Company Of Japan, Ltd. | Music-piece classification based on sustain regions |
US20110071819A1 (en) | 2009-09-22 | 2011-03-24 | Tanya Miller | Apparatus, system, and method for natural language processing |
US20110082688A1 (en) | 2009-10-01 | 2011-04-07 | Samsung Electronics Co., Ltd. | Apparatus and Method for Analyzing Intention |
US20110131043A1 (en) | 2007-12-25 | 2011-06-02 | Fumihiro Adachi | Voice recognition system, voice recognition method, and program for voice recognition |
US20110173208A1 (en) | 2010-01-13 | 2011-07-14 | Rovi Technologies Corporation | Rolling audio recognition |
US20110213475A1 (en) | 2009-08-28 | 2011-09-01 | Tilman Herberger | System and method for interactive visualization of music properties |
US8013230B2 (en) | 2007-12-17 | 2011-09-06 | Sony Corporation | Method for music structure analysis |
US20110244784A1 (en) | 2004-02-19 | 2011-10-06 | Landmark Digital Services Llc | Method and apparatus for identification of broadcast source |
US20110276334A1 (en) | 2000-12-12 | 2011-11-10 | Avery Li-Chun Wang | Methods and Systems for Synchronizing Media |
US8073684B2 (en) | 2003-04-25 | 2011-12-06 | Texas Instruments Incorporated | Apparatus and method for automatic classification/identification of similar compressed audio files |
US8099281B2 (en) | 2005-06-06 | 2012-01-17 | Nunance Communications, Inc. | System and method for word-sense disambiguation by recursive partitioning |
US20120029670A1 (en) | 2010-07-29 | 2012-02-02 | Soundhound, Inc. | System and methods for continuous audio matching |
US20120035924A1 (en) | 2010-08-06 | 2012-02-09 | Google Inc. | Disambiguating input based on context |
US20120046936A1 (en) | 2009-04-07 | 2012-02-23 | Lemi Technology, Llc | System and method for distributed audience feedback on semantic analysis of media content |
US20120065960A1 (en) | 2010-09-14 | 2012-03-15 | International Business Machines Corporation | Generating parser combination by combining language processing parsers |
US20120089400A1 (en) | 2010-10-06 | 2012-04-12 | Caroline Gilles Henton | Systems and methods for using homophone lexicons in english text-to-speech |
US20120216178A1 (en) | 2008-12-10 | 2012-08-23 | International Business Machines Corporation | Compiler generator |
US8296179B1 (en) | 2007-05-02 | 2012-10-23 | Monster Worldwide, Inc. | Targeted advertisement placement based on explicit and implicit criteria matching |
US20120303371A1 (en) | 2011-05-23 | 2012-11-29 | Nuance Communications, Inc. | Methods and apparatus for acoustic disambiguation |
US20120323557A1 (en) | 2011-06-19 | 2012-12-20 | Detlef Koll | Speech Recognition Using Context-Aware Recognition Models |
US20130006631A1 (en) | 2011-06-28 | 2013-01-03 | Utah State University | Turbo Processing of Speech Recognition |
US8358966B2 (en) | 2004-08-31 | 2013-01-22 | Astro West Llc | Detecting and measuring exposure to media content items |
US20130041647A1 (en) | 2011-08-11 | 2013-02-14 | Apple Inc. | Method for disambiguating multiple readings in language conversion |
US20130055223A1 (en) | 2011-08-25 | 2013-02-28 | Myezapp Inc. | Compiler with Error Handling |
US20130052939A1 (en) | 2011-08-30 | 2013-02-28 | Clear Channel Management Services, Inc. | Broadcast Source Identification Based on Matching Broadcast Signal Fingerprints |
US20130096911A1 (en) | 2010-04-21 | 2013-04-18 | Universite Catholique De Louvain | Normalisation of noisy typewritten texts |
US20130111440A1 (en) * | 2011-10-28 | 2013-05-02 | Michael Forster | Methods, Apparatuses, and Computer-Readable Media for Computing Checksums for Effective Caching in Continuous Distributed Builds |
US8447608B1 (en) | 2008-12-10 | 2013-05-21 | Adobe Systems Incorporated | Custom language models for audio content |
US20130151250A1 (en) | 2011-12-08 | 2013-06-13 | Lenovo (Singapore) Pte. Ltd | Hybrid speech recognition |
WO2013177213A2 (en) | 2012-05-24 | 2013-11-28 | Soundhound, Inc. | Systems and methods for enabling natural language processing |
US20140019483A1 (en) | 2010-07-29 | 2014-01-16 | Soundhound, Inc. | Systems and Methods for Generating and Using Shared Natural Language Libraries |
US20140032220A1 (en) | 2012-07-27 | 2014-01-30 | Solomon Z. Lerner | Method and Apparatus for Responding to a Query at a Dialog System |
US20140039895A1 (en) | 2012-08-03 | 2014-02-06 | Veveo, Inc. | Method for using pauses detected in speech input to assist in interpreting the input during conversational interaction for information retrieval |
US20140067394A1 (en) | 2012-08-28 | 2014-03-06 | King Abdulaziz City For Science And Technology | System and method for decoding speech |
US20140074470A1 (en) | 2012-09-11 | 2014-03-13 | Google Inc. | Phonetic pronunciation |
US8762156B2 (en) | 2011-09-28 | 2014-06-24 | Apple Inc. | Speech recognition repair using contextual information |
US20140205974A1 (en) | 2011-06-30 | 2014-07-24 | Rosetta Stone, Ltd. | Statistical machine translation framework for modeling phonological errors in computer assisted pronunciation training system |
US8843369B1 (en) | 2013-12-27 | 2014-09-23 | Google Inc. | Speech endpointing based on voice profile |
US20140297252A1 (en) | 2012-12-06 | 2014-10-02 | Raytheon Bbn Technologies Corp. | Active error detection and resolution for linguistic translation |
US20140316785A1 (en) | 1999-11-12 | 2014-10-23 | Nuance Communications, Inc. | Speech recognition system interactive agent |
US20140324433A1 (en) | 2013-04-26 | 2014-10-30 | Wistron Corporation | Method and device for learning language and computer readable recording medium |
US20140324427A1 (en) | 2003-05-15 | 2014-10-30 | At&T Intellectual Property Ii, L.P. | System and dialog manager developed using modular spoken-dialog components |
US20140358533A1 (en) | 2013-05-30 | 2014-12-04 | International Business Machines Corporation | Pronunciation accuracy in speech recognition |
US8924212B1 (en) | 2005-08-26 | 2014-12-30 | At&T Intellectual Property Ii, L.P. | System and method for robust access and entry to large structured data using voice form-filling |
US20150039317A1 (en) | 2013-07-31 | 2015-02-05 | Microsoft Corporation | System with multiple simultaneous speech recognizers |
WO2015017746A1 (en) | 2013-08-02 | 2015-02-05 | Herdman Rodrick A | Device for mounting and securing a handgun |
US20150106082A1 (en) | 2013-10-16 | 2015-04-16 | Interactive Intelligence Group, Inc. | System and Method for Learning Alternate Pronunciations for Speech Recognition |
US20150112679A1 (en) | 2013-10-18 | 2015-04-23 | Via Technologies, Inc. | Method for building language model, speech recognition method and electronic apparatus |
US20150161985A1 (en) | 2013-12-09 | 2015-06-11 | Google Inc. | Pronunciation verification |
US20160148615A1 (en) | 2014-11-26 | 2016-05-26 | Samsung Electronics Co., Ltd. | Method and electronic device for voice recognition |
US20160232894A1 (en) | 2013-10-08 | 2016-08-11 | Samsung Electronics Co., Ltd. | Method and apparatus for performing voice recognition on basis of device information |
US9646628B1 (en) | 2015-06-26 | 2017-05-09 | Amazon Technologies, Inc. | Noise cancellation for open microphone mode |
US20170178623A1 (en) | 2015-12-22 | 2017-06-22 | Oren Shamir | Technologies for end-of-sentence detection using syntactic coherence |
US9697827B1 (en) | 2012-12-11 | 2017-07-04 | Amazon Technologies, Inc. | Error reduction in speech processing |
US20180061399A1 (en) | 2016-08-30 | 2018-03-01 | Lenovo Enterprise Solutions (Singapore) Pte. Ltd. | Spoken utterance stop event other than pause or cessation in spoken utterances stream |
US20180308489A1 (en) | 2017-04-24 | 2018-10-25 | Samsung Electronics Co., Ltd. | Electronic device for providing speech recognition service and method thereof |
US20180330723A1 (en) | 2017-05-12 | 2018-11-15 | Apple Inc. | Low-latency intelligent automated assistant |
US10224030B1 (en) | 2013-03-14 | 2019-03-05 | Amazon Technologies, Inc. | Dynamic gazetteers for personalized entity recognition |
US20220374708A1 (en) * | 2021-05-18 | 2022-11-24 | Keepcon Corporation | System and method for content automated classification |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7092888B1 (en) * | 2001-10-26 | 2006-08-15 | Verizon Corporate Services Group Inc. | Unsupervised training in natural language call routing |
US20040077519A1 (en) | 2002-06-28 | 2004-04-22 | The Procter & Gamble Co. | Ionic liquid based products and method of using the same |
US7379867B2 (en) * | 2003-06-03 | 2008-05-27 | Microsoft Corporation | Discriminative training of language models for text and speech classification |
US7440895B1 (en) * | 2003-12-01 | 2008-10-21 | Lumenvox, Llc. | System and method for tuning and testing in a speech recognition system |
US20100312784A1 (en) | 2009-06-08 | 2010-12-09 | Microsoft Corporation | Notification-based cache invalidation for complex database queries |
US20120035942A1 (en) | 2010-08-06 | 2012-02-09 | Sven Graupner | Managing business relationships using a third-party service |
-
2013
- 2013-03-15 US US13/842,735 patent/US10957310B1/en active Active
-
2018
- 2018-12-04 US US16/209,854 patent/US10996931B1/en active Active
-
2021
- 2021-04-08 US US17/225,997 patent/US11776533B2/en active Active
-
2023
- 2023-10-02 US US18/375,906 patent/US20240029721A1/en active Pending
Patent Citations (333)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3919479A (en) | 1972-09-21 | 1975-11-11 | First National Bank Of Boston | Broadcast signal identification system |
US4450531A (en) | 1982-09-10 | 1984-05-22 | Ensco, Inc. | Broadcast signal recognition system and method |
US4697209A (en) | 1984-04-26 | 1987-09-29 | A. C. Nielsen Company | Methods and apparatus for automatically identifying programs viewed or recorded |
US4739398A (en) | 1986-05-02 | 1988-04-19 | Control Data Corporation | Method, apparatus and system for recognizing broadcast segments |
US4928249A (en) | 1987-02-27 | 1990-05-22 | Societe Anonyme Dite : Smh Alcatel | Operating system for an electronic franking machine |
US4959850A (en) | 1987-05-29 | 1990-09-25 | Kabushiki Kaisha Toshiba | Radio telephone apparatus |
US4843562A (en) | 1987-06-24 | 1989-06-27 | Broadcast Data Systems Limited Partnership | Broadcast information classification system and method |
US4918730A (en) | 1987-06-24 | 1990-04-17 | Media Control-Musik-Medien-Analysen Gesellschaft Mit Beschrankter Haftung | Process and circuit arrangement for the automatic recognition of signal sequences |
US5019899A (en) | 1988-11-01 | 1991-05-28 | Control Data Corporation | Electronic data encoding and recognition system |
US5054074A (en) | 1989-03-02 | 1991-10-01 | International Business Machines Corporation | Optimized speech recognition system and method |
US5033087A (en) | 1989-03-14 | 1991-07-16 | International Business Machines Corp. | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system |
US5164915A (en) | 1990-09-26 | 1992-11-17 | Information Storage Devices, Inc. | Cascading analog record/playback devices |
US5457768A (en) | 1991-08-13 | 1995-10-10 | Kabushiki Kaisha Toshiba | Speech recognition apparatus using syntactic and semantic analysis |
US5436653A (en) | 1992-04-30 | 1995-07-25 | The Arbitron Company | Method and system for recognition of broadcast segments |
US5577249A (en) | 1992-07-31 | 1996-11-19 | International Business Machines Corporation | Method for finding a reference token sequence in an original token string within a database of token strings using appended non-contiguous substrings |
US5437050A (en) | 1992-11-09 | 1995-07-25 | Lamb; Robert G. | Method and apparatus for recognizing broadcast information using multi-frequency magnitude detection |
US6611607B1 (en) | 1993-11-18 | 2003-08-26 | Digimarc Corporation | Integrating digital watermarks in multimedia content |
US5511000A (en) | 1993-11-18 | 1996-04-23 | Kaloi; Dennis M. | Electronic solid-state record/playback device and system |
US5862260A (en) | 1993-11-18 | 1999-01-19 | Digimarc Corporation | Methods for surveying dissemination of proprietary empirical data |
US5581658A (en) | 1993-12-14 | 1996-12-03 | Infobase Systems, Inc. | Adaptive system for broadcast program identification and reporting |
US5956683A (en) | 1993-12-22 | 1999-09-21 | Qualcomm Incorporated | Distributed voice recognition system |
US5845306A (en) | 1994-06-01 | 1998-12-01 | Mitsubishi Electric Information Technology Center America, Inc. | Context based system for accessing dictionary entries |
US5664270A (en) | 1994-07-19 | 1997-09-09 | Kinetic Concepts, Inc. | Patient interface system |
US6314577B1 (en) | 1994-10-27 | 2001-11-06 | Michael H. Pocock | Apparatus and method to generate and access broadcast information |
US5708477A (en) | 1994-12-01 | 1998-01-13 | Forbes; Scott Jared | Video signal identifier for controlling a VCR and television based on the occurrence of commercials |
US5687279A (en) | 1994-12-23 | 1997-11-11 | Intel Corporation | Retro-storing analog information in a digital storage circuit |
US5634084A (en) | 1995-01-20 | 1997-05-27 | Centigram Communications Corporation | Abbreviation and acronym/initialism expansion procedures for a text to speech reader |
US5542138A (en) | 1995-02-06 | 1996-08-06 | Williams; Terry N. | Bedside control unit for a hospital bed |
US6614914B1 (en) | 1995-05-08 | 2003-09-02 | Digimarc Corporation | Watermark embedder and reader |
US20020048350A1 (en) | 1995-05-26 | 2002-04-25 | Michael S. Phillips | Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system |
US6292767B1 (en) | 1995-07-18 | 2001-09-18 | Nuance Communications | Method and system for building and running natural language understanding systems |
US6629066B1 (en) | 1995-07-18 | 2003-09-30 | Nuance Communications | Method and system for building and running natural language understanding systems |
US6122403A (en) | 1995-07-27 | 2000-09-19 | Digimarc Corporation | Computer system linked by using information in data objects |
US6505160B1 (en) | 1995-07-27 | 2003-01-07 | Digimarc Corporation | Connected audio and other media objects |
US7017208B2 (en) | 1995-08-04 | 2006-03-28 | Hill-Rom Services, Inc. | Hospital bed |
US5974409A (en) | 1995-08-23 | 1999-10-26 | Microsoft Corporation | System and method for locating information in an on-line network |
US6594628B1 (en) | 1995-09-21 | 2003-07-15 | Qualcomm, Incorporated | Distributed voice recognition system |
US6108626A (en) | 1995-10-27 | 2000-08-22 | Cselt-Centro Studi E Laboratori Telecomunicazioni S.P.A. | Object oriented audio coding |
US5874686A (en) | 1995-10-31 | 1999-02-23 | Ghias; Asif U. | Apparatus and method for searching a melody |
US5880386A (en) | 1995-11-30 | 1999-03-09 | Yamaha Corporation | Musical information processing system with automatic data transfer |
US5907815A (en) | 1995-12-07 | 1999-05-25 | Texas Instruments Incorporated | Portable computer stored removable mobile telephone |
US5991737A (en) | 1996-03-11 | 1999-11-23 | Connexus Corporation | Automated consumer response to publicly broadcast information |
US6804645B1 (en) | 1996-04-02 | 2004-10-12 | Siemens Aktiengesellschaft | Dynamic phoneme dictionary for speech recognition |
US6510325B1 (en) | 1996-04-19 | 2003-01-21 | Mack, Ii Gawins A. | Convertible portable telephone |
US20010014891A1 (en) | 1996-05-24 | 2001-08-16 | Eric M. Hoffert | Display of media previews |
US5918223A (en) | 1996-07-22 | 1999-06-29 | Muscle Fish | Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information |
US6931451B1 (en) | 1996-10-03 | 2005-08-16 | Gotuit Media Corp. | Systems and methods for modifying broadcast programming |
US20070204319A1 (en) | 1996-12-05 | 2007-08-30 | Subutai Ahmad | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US6188985B1 (en) | 1997-01-06 | 2001-02-13 | Texas Instruments Incorporated | Wireless voice-activated device for control of a processor-based host system |
US5963957A (en) | 1997-04-28 | 1999-10-05 | Philips Electronics North America Corporation | Bibliographic music data base with normalized musical themes |
US6067516A (en) | 1997-05-09 | 2000-05-23 | Siemens Information | Speech and text messaging system with distributed speech recognition and speaker database transfers |
US6405029B1 (en) | 1997-06-19 | 2002-06-11 | Byard G. Nilsson | Wireless prepaid telephone system with dispensable instruments |
US6049710A (en) | 1997-06-19 | 2000-04-11 | Kimberley Nanette Engen | Wireless prepaid telephone system with dispensable instruments |
US7190971B1 (en) | 1997-07-29 | 2007-03-13 | Sony Corporation | Information processing apparatus and method, information processing system, and transmission medium |
US6640306B1 (en) | 1997-08-15 | 2003-10-28 | Sony Corporation | Data transmitter, data reproduction method, terminal, and method for transferring data transmitter |
US6163767A (en) | 1997-09-19 | 2000-12-19 | International Business Machines Corporation | Speech recognition method and system for recognizing single or un-correlated Chinese characters |
US6487532B1 (en) | 1997-09-24 | 2002-11-26 | Scansoft, Inc. | Apparatus and method for distinguishing similar-sounding utterances speech recognition |
WO1999018518A2 (en) | 1997-10-03 | 1999-04-15 | Peter Polash | Internet based musical indexing system for radio |
US6092039A (en) | 1997-10-31 | 2000-07-18 | International Business Machines Corporation | Symbiotic automatic speech recognition and vocoder |
US6504089B1 (en) | 1997-12-24 | 2003-01-07 | Canon Kabushiki Kaisha | System for and method of searching music data, and recording medium for use therewith |
US6098042A (en) | 1998-01-30 | 2000-08-01 | International Business Machines Corporation | Homograph filter for speech synthesis system |
US6182128B1 (en) | 1998-03-05 | 2001-01-30 | Touchmusic Entertainment Llc | Real-time music distribution systems |
JPH11272274A (en) | 1998-03-19 | 1999-10-08 | Tomoya Sonoda | Method for retrieving piece of music by use of singing voice |
US6121530A (en) | 1998-03-19 | 2000-09-19 | Sonoda; Tomonari | World Wide Web-based melody retrieval system with thresholds determined by using distribution of pitch and span of notes |
EP0944033A1 (en) | 1998-03-19 | 1999-09-22 | Tomonari Sonoda | Melody retrieval system and method |
US6201176B1 (en) | 1998-05-07 | 2001-03-13 | Canon Kabushiki Kaisha | System and method for querying a music database |
US5969283A (en) | 1998-06-17 | 1999-10-19 | Looney Productions, Llc | Music organizer and entertainment center |
US6345256B1 (en) | 1998-08-13 | 2002-02-05 | International Business Machines Corporation | Automated method and apparatus to package digital content for electronic distribution using the identity of the source content |
US6385434B1 (en) | 1998-09-16 | 2002-05-07 | Motorola, Inc. | Wireless access unit utilizing adaptive spectrum exploitation |
US7233321B1 (en) | 1998-12-15 | 2007-06-19 | Intel Corporation | Pointing device with integrated audio input |
JP2000187671A (en) | 1998-12-21 | 2000-07-04 | Tomoya Sonoda | Music retrieval system with singing voice using network and singing voice input terminal equipment to be used at the time of retrieval |
US7209892B1 (en) | 1998-12-24 | 2007-04-24 | Universal Music Group, Inc. | Electronic music/media distribution system |
US6246986B1 (en) | 1998-12-31 | 2001-06-12 | At&T Corp. | User barge-in enablement in large vocabulary speech recognition systems |
US6233682B1 (en) | 1999-01-22 | 2001-05-15 | Bernhard Fritsch | Distribution of musical products by a web site vendor over the internet |
US7058376B2 (en) | 1999-01-27 | 2006-06-06 | Logan James D | Radio receiving, recording and playback system |
US20020198713A1 (en) | 1999-01-29 | 2002-12-26 | Franz Alexander M. | Method and apparatus for perfoming spoken language translation |
US6996529B1 (en) | 1999-03-15 | 2006-02-07 | British Telecommunications Public Limited Company | Speech synthesis with prosodic phrase boundary information |
US6631346B1 (en) | 1999-04-07 | 2003-10-07 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for natural language parsing using multiple passes and tags |
US6408272B1 (en) | 1999-04-12 | 2002-06-18 | General Magic, Inc. | Distributed voice user interface |
US6434520B1 (en) | 1999-04-16 | 2002-08-13 | International Business Machines Corporation | System and method for indexing and querying audio archives |
US7562392B1 (en) | 1999-05-19 | 2009-07-14 | Digimarc Corporation | Methods of interacting with audio and ambient music |
US6363349B1 (en) | 1999-05-28 | 2002-03-26 | Motorola, Inc. | Method and apparatus for performing distributed speech processing in a communication system |
US20030050784A1 (en) | 1999-07-01 | 2003-03-13 | Hoffberg Mark B. | Content-driven speech- or audio-browser |
US6519564B1 (en) | 1999-07-01 | 2003-02-11 | Koninklijke Philips Electronics N.V. | Content-driven speech-or audio-browser |
US20030233225A1 (en) * | 1999-08-24 | 2003-12-18 | Virtual Research Associates, Inc. | Natural language sentence parser |
US20030167166A1 (en) | 1999-09-04 | 2003-09-04 | International Business Machines Corporation | Speech recognition system |
US6708150B1 (en) | 1999-09-09 | 2004-03-16 | Zanavi Informatics Corporation | Speech recognition apparatus and speech recognition navigation apparatus |
US7783489B2 (en) | 1999-09-21 | 2010-08-24 | Iceberg Industries Llc | Audio identification system and method |
US20020023020A1 (en) | 1999-09-21 | 2002-02-21 | Kenyon Stephen C. | Audio identification system and method |
US7174293B2 (en) | 1999-09-21 | 2007-02-06 | Iceberg Industries Llc | Audio identification system and method |
US7266343B1 (en) | 1999-09-29 | 2007-09-04 | Nokia Corporation | Method of and system for providing identification of a piece of information |
US6941275B1 (en) | 1999-10-07 | 2005-09-06 | Remi Swierczek | Music identification system |
US20080059188A1 (en) | 1999-10-19 | 2008-03-06 | Sony Corporation | Natural Language Interface Control System |
US20100205166A1 (en) | 1999-11-10 | 2010-08-12 | Boulter Jeffrey R | Internet radio and broadcast method |
US20140316785A1 (en) | 1999-11-12 | 2014-10-23 | Nuance Communications, Inc. | Speech recognition system interactive agent |
US7743092B2 (en) | 1999-11-12 | 2010-06-22 | Wood Lawson A | Method for recognizing and distributing music |
US6633846B1 (en) | 1999-11-12 | 2003-10-14 | Phoenix Solutions, Inc. | Distributed realtime speech recognition system |
US20100235341A1 (en) | 1999-11-12 | 2010-09-16 | Phoenix Solutions, Inc. | Methods and Systems for Searching Using Spoken Input and User Context Information |
US20050086059A1 (en) | 1999-11-12 | 2005-04-21 | Bennett Ian M. | Partial speech processing device & method for use in distributed systems |
US7257536B1 (en) | 1999-11-23 | 2007-08-14 | Radiant Systems, Inc. | Audio request interaction system |
US20010005823A1 (en) | 1999-12-24 | 2001-06-28 | Uwe Fischer | Method and system for generating a characteristic identifier for digital data and for detecting identical digital data |
US6535849B1 (en) | 2000-01-18 | 2003-03-18 | Scansoft, Inc. | Method and system for generating semi-literal transcripts for speech recognition systems |
US7444353B1 (en) | 2000-01-31 | 2008-10-28 | Chen Alexander C | Apparatus for delivering music and information |
US6834308B1 (en) | 2000-02-17 | 2004-12-21 | Audible Magic Corporation | Method and apparatus for identifying media content presented on a media playing device |
US7225132B2 (en) | 2000-03-14 | 2007-05-29 | British Telecommunications Plc | Method for assigning an identification code |
US20040167779A1 (en) | 2000-03-14 | 2004-08-26 | Sony Corporation | Speech recognition apparatus, speech recognition method, and recording medium |
US20010053974A1 (en) | 2000-03-14 | 2001-12-20 | Helmut Lucke | Speech recognition apparatus, speech recognition method, and recording medium |
US7206820B1 (en) | 2000-03-18 | 2007-04-17 | Digimarc Corporation | System for linking from object to remote resource |
US20010049601A1 (en) | 2000-03-24 | 2001-12-06 | John Kroeker | Phonetic data processing system and method |
US6633845B1 (en) | 2000-04-07 | 2003-10-14 | Hewlett-Packard Development Company, L.P. | Music summarization system and method |
US20020138265A1 (en) | 2000-05-02 | 2002-09-26 | Daniell Stevens | Error correction in speech recognition |
US6542869B1 (en) | 2000-05-11 | 2003-04-01 | Fuji Xerox Co., Ltd. | Method for automatic analysis of audio including music and speech |
US6453252B1 (en) | 2000-05-15 | 2002-09-17 | Creative Technology Ltd. | Process for identifying audio content |
US20020042707A1 (en) | 2000-06-19 | 2002-04-11 | Gang Zhao | Grammar-packaged parsing |
US6850288B2 (en) | 2000-07-04 | 2005-02-01 | Matsushita Electric Industrial Co., Ltd. | Apparatus for providing programming |
US7756874B2 (en) | 2000-07-06 | 2010-07-13 | Microsoft Corporation | System and methods for providing automatic classification of media entities according to consonance properties |
US20020083060A1 (en) | 2000-07-31 | 2002-06-27 | Wang Avery Li-Chun | System and methods for recognizing sound and music signals in high noise and distortion |
US7853664B1 (en) | 2000-07-31 | 2010-12-14 | Landmark Digital Services Llc | Method and system for purchasing pre-recorded music |
US20060122839A1 (en) | 2000-07-31 | 2006-06-08 | Avery Li-Chun Wang | System and methods for recognizing sound and music signals in high noise and distortion |
US6990453B2 (en) | 2000-07-31 | 2006-01-24 | Landmark Digital Services Llc | System and methods for recognizing sound and music signals in high noise and distortion |
US20030083863A1 (en) | 2000-09-08 | 2003-05-01 | Ringger Eric K. | Augmented-word language model |
US20020049037A1 (en) | 2000-09-13 | 2002-04-25 | Christensen Kelly M. | System and method for ordering and delivering media content |
US6507727B1 (en) | 2000-10-13 | 2003-01-14 | Robert F. Henrick | Purchase and delivery of digital content using multiple devices and data networks |
US6879950B1 (en) | 2000-10-19 | 2005-04-12 | Motorola, Inc. | System and method of decoding a packed representation of multiple parses |
US7085716B1 (en) | 2000-10-26 | 2006-08-01 | Nuance Communications, Inc. | Speech recognition using word-in-phrase command |
US20020198719A1 (en) | 2000-12-04 | 2002-12-26 | International Business Machines Corporation | Reusable voiceXML dialog components, subdialogs and beans |
US20020072982A1 (en) | 2000-12-12 | 2002-06-13 | Shazam Entertainment Ltd. | Method and system for interacting with a user in an experiential environment |
US20110276334A1 (en) | 2000-12-12 | 2011-11-10 | Avery Li-Chun Wang | Methods and Systems for Synchronizing Media |
US20020116191A1 (en) | 2000-12-26 | 2002-08-22 | International Business Machines Corporation | Augmentation of alternate word lists by acoustic confusability criterion |
US20020138630A1 (en) | 2000-12-27 | 2002-09-26 | Solomon Barry M. | Music scheduling algorithm |
US20030023437A1 (en) | 2001-01-27 | 2003-01-30 | Pascale Fung | System and method for context-based spontaneous speech recognition |
US20020111806A1 (en) | 2001-02-13 | 2002-08-15 | International Business Machines Corporation | Dynamic language model mixtures with history-based buckets |
US20020156627A1 (en) | 2001-02-20 | 2002-10-24 | International Business Machines Corporation | Speech recognition apparatus and computer system therefor, speech recognition method and program and recording medium therefor |
US20030163320A1 (en) | 2001-03-09 | 2003-08-28 | Nobuhide Yamazaki | Voice synthesis device |
US20030004717A1 (en) | 2001-03-22 | 2003-01-02 | Nikko Strom | Histogram grammar weighting and error corrective training of grammar weights |
US7373209B2 (en) | 2001-03-22 | 2008-05-13 | Matsushita Electric Industrial Co., Ltd. | Sound features extracting apparatus, sound data registering apparatus, sound data retrieving apparatus, and methods and programs for implementing the same |
US20020174431A1 (en) | 2001-05-15 | 2002-11-21 | John Bowman | Method and system for receiving music related information via an internet connection |
US20020198705A1 (en) | 2001-05-30 | 2002-12-26 | Burnett Gregory C. | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
US20020181671A1 (en) | 2001-06-01 | 2002-12-05 | Logan James D. | Cellular telephone with audio recording subsystem |
US20020193895A1 (en) | 2001-06-18 | 2002-12-19 | Ziqiang Qian | Enhanced encoder for synchronizing multimedia files into an audio bit stream |
US20020198789A1 (en) | 2001-06-22 | 2002-12-26 | Sony Corp. And Sony Music Entertainment, Inc. | Apparatus and method for identifying and purchasing music |
US20030009335A1 (en) | 2001-07-05 | 2003-01-09 | Johan Schalkwyk | Speech recognition with dynamic grammars |
US7328153B2 (en) | 2001-07-20 | 2008-02-05 | Gracenote, Inc. | Automatic identification of sound recordings |
US20100158488A1 (en) | 2001-07-31 | 2010-06-24 | Gracenote, Inc. | Multiple step identification of recordings |
US20070120689A1 (en) | 2001-08-03 | 2007-05-31 | Zerhusen Robert M | Patient point-of-care computer system |
US20030040901A1 (en) | 2001-08-21 | 2003-02-27 | Wang Yeyi | Method and apparatus for dynamic grammars and focused semantic parsing |
US20030078928A1 (en) | 2001-10-23 | 2003-04-24 | Dorosario Alden | Network wide ad targeting |
US7089541B2 (en) * | 2001-11-30 | 2006-08-08 | Sun Microsystems, Inc. | Modular parser architecture with mini parsers |
US20040019497A1 (en) | 2001-12-04 | 2004-01-29 | Volk Andrew R. | Method and system for providing listener-requested music over a network |
US6995309B2 (en) | 2001-12-06 | 2006-02-07 | Hewlett-Packard Development Company, L.P. | System and method for music identification |
US20030106413A1 (en) | 2001-12-06 | 2003-06-12 | Ramin Samadani | System and method for music identification |
US20030110035A1 (en) | 2001-12-12 | 2003-06-12 | Compaq Information Technologies Group, L.P. | Systems and methods for combining subword detection and word detection for processing a spoken input |
US20030125945A1 (en) | 2001-12-14 | 2003-07-03 | Sean Doyle | Automatically improving a voice recognition system |
WO2003061285A2 (en) | 2001-12-24 | 2003-07-24 | Scientific Generics Limited | Captioning system |
US20030187649A1 (en) | 2002-03-27 | 2003-10-02 | Compaq Information Technologies Group, L.P. | Method to expand inputs for word or document searching |
US20030191645A1 (en) | 2002-04-05 | 2003-10-09 | Guojun Zhou | Statistical pronunciation model for text to speech |
US20030192424A1 (en) | 2002-04-11 | 2003-10-16 | Yuji Koike | Lyric display method, lyric display computer program and lyric display apparatus |
EP1367590A2 (en) | 2002-05-31 | 2003-12-03 | Pioneer Corporation | Music recording/reproducing apparatus, list generation method, and information recording medium |
US6967275B2 (en) | 2002-06-25 | 2005-11-22 | Irobot Corporation | Song-matching system and method |
US20040002858A1 (en) | 2002-06-27 | 2004-01-01 | Hagai Attias | Microphone array signal enhancement using mixture models |
US20090063147A1 (en) | 2002-06-28 | 2009-03-05 | Conceptual Speech Llc | Phonetic, syntactic and conceptual analysis driven speech recognition system and method |
US20110288855A1 (en) | 2002-06-28 | 2011-11-24 | Conceptual Speech Llc | Multi-phoneme streamer and knowledge representation speech recognition system and method |
US20040193420A1 (en) | 2002-07-15 | 2004-09-30 | Kennewick Robert A. | Mobile systems and methods for responding to natural language speech utterance |
US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
US20060169126A1 (en) | 2002-09-18 | 2006-08-03 | Takehiko Ishiwata | Music classification device, music classification method, and program |
US20050256715A1 (en) | 2002-10-08 | 2005-11-17 | Yoshiyuki Okimoto | Language model generation and accumulation device, speech recognition device, language model creation method, and speech recognition method |
US20080126089A1 (en) | 2002-10-31 | 2008-05-29 | Harry Printz | Efficient Empirical Determination, Computation, and Use of Acoustic Confusability Measures |
US7698136B1 (en) | 2003-01-28 | 2010-04-13 | Voxify, Inc. | Methods and apparatus for flexible speech recognition |
US20040231498A1 (en) | 2003-02-14 | 2004-11-25 | Tao Li | Music feature extraction using wavelet coefficient histograms |
US20060189298A1 (en) | 2003-03-06 | 2006-08-24 | Maurizio Marcelli | Method and software program product for mutual authentication in a communications network |
WO2004091307A2 (en) | 2003-04-08 | 2004-10-28 | Advanced Bionutriton Corporation | Feed additives against diseasse infection in terrestrial and aquatic animals |
US8073684B2 (en) | 2003-04-25 | 2011-12-06 | Texas Instruments Incorporated | Apparatus and method for automatic classification/identification of similar compressed audio files |
US20070288444A1 (en) | 2003-05-06 | 2007-12-13 | International Business Machines Corporation | Web-based customer service interface |
US20140324427A1 (en) | 2003-05-15 | 2014-10-30 | At&T Intellectual Property Ii, L.P. | System and dialog manager developed using modular spoken-dialog components |
US7725319B2 (en) | 2003-07-07 | 2010-05-25 | Dialogic Corporation | Phoneme lattice construction and its application to speech recognition and keyword spotting |
US20050010412A1 (en) | 2003-07-07 | 2005-01-13 | Hagai Aronowitz | Phoneme lattice construction and its application to speech recognition and keyword spotting |
US7323629B2 (en) | 2003-07-16 | 2008-01-29 | Univ Iowa State Res Found Inc | Real time music recognition and display system |
US20050016360A1 (en) | 2003-07-24 | 2005-01-27 | Tong Zhang | System and method for automatic classification of music |
US20050027699A1 (en) | 2003-08-01 | 2005-02-03 | Amr Awadallah | Listings optimization using a plurality of data sources |
US20050033574A1 (en) | 2003-08-06 | 2005-02-10 | Samsung Electronics Co., Ltd. | Method and apparatus handling speech recognition errors in spoken dialogue systems |
US20050060685A1 (en) | 2003-09-11 | 2005-03-17 | Ingo Franz | Program generator |
US20050143970A1 (en) | 2003-09-11 | 2005-06-30 | Voice Signal Technologies, Inc. | Pronunciation discovery for spoken words |
US7266495B1 (en) | 2003-09-12 | 2007-09-04 | Nuance Communications, Inc. | Method and system for learning linguistically valid word pronunciations from acoustic data |
US7379875B2 (en) | 2003-10-24 | 2008-05-27 | Microsoft Corporation | Systems and methods for generating audio thumbnails |
US20050125232A1 (en) | 2003-10-31 | 2005-06-09 | Gadd I. M. | Automated speech-enabled application creation method and apparatus |
US20070168413A1 (en) | 2003-12-05 | 2007-07-19 | Sony Deutschland Gmbh | Visualization and control techniques for multimedia digital content |
US20050144065A1 (en) | 2003-12-19 | 2005-06-30 | Palo Alto Research Center Incorporated | Keyword advertisement management with coordinated bidding among advertisers |
US20050144064A1 (en) | 2003-12-19 | 2005-06-30 | Palo Alto Research Center Incorporated | Keyword advertisement management |
US20050137939A1 (en) | 2003-12-19 | 2005-06-23 | Palo Alto Research Center Incorporated | Server-based keyword advertisement management |
US20110244784A1 (en) | 2004-02-19 | 2011-10-06 | Landmark Digital Services Llc | Method and apparatus for identification of broadcast source |
US20070168409A1 (en) | 2004-02-26 | 2007-07-19 | Kwan Cheung | Method and apparatus for automatic detection and identification of broadcast audio and video signals |
US20050254366A1 (en) | 2004-05-14 | 2005-11-17 | Renaud Amar | Method and apparatus for selecting an audio track based upon audio excerpts |
US20050273326A1 (en) | 2004-06-02 | 2005-12-08 | Stmicroelectronics Asia Pacific Pte. Ltd. | Energy-based audio pattern recognition |
US20060004572A1 (en) | 2004-06-30 | 2006-01-05 | Microsoft Corporation | Homonym processing in the context of voice-activated command systems |
US7858868B2 (en) | 2004-07-09 | 2010-12-28 | Sony Deutschland Gmbh | Method for classifying music using Gish distance values |
US20090031882A1 (en) | 2004-07-09 | 2009-02-05 | Sony Deutschland Gmbh | Method for Classifying Music |
US7580832B2 (en) | 2004-07-26 | 2009-08-25 | M2Any Gmbh | Apparatus and method for robust classification of audio signals, and method for establishing and operating an audio-signal database, as well as computer program |
US8358966B2 (en) | 2004-08-31 | 2013-01-22 | Astro West Llc | Detecting and measuring exposure to media content items |
US20060059225A1 (en) | 2004-09-14 | 2006-03-16 | A9.Com, Inc. | Methods and apparatus for automatic generation of recommended links |
US20060069547A1 (en) | 2004-09-15 | 2006-03-30 | Microsoft Corporation | Creating a speech recognition grammar for alphanumeric concepts |
US20080148224A1 (en) | 2004-10-21 | 2008-06-19 | International Business Machines Corporation | Method and Apparatus for Automatically Converting Numeric Data to a Processor Efficient Format for Performing Arithmetic Operations |
US20060095250A1 (en) | 2004-11-03 | 2006-05-04 | Microsoft Corporation | Parser for natural language processing |
US20060129396A1 (en) | 2004-12-09 | 2006-06-15 | Microsoft Corporation | Method and apparatus for automatic grammar generation from data entries |
US20060200350A1 (en) | 2004-12-22 | 2006-09-07 | David Attwater | Multi dimensional confidence |
US20060155694A1 (en) | 2004-12-29 | 2006-07-13 | Chowdhury Abdur R | Query routing |
US7567899B2 (en) | 2004-12-30 | 2009-07-28 | All Media Guide, Llc | Methods and apparatus for audio recognition |
US7606708B2 (en) | 2005-02-01 | 2009-10-20 | Samsung Electronics Co., Ltd. | Apparatus, method, and medium for generating grammar network for use in speech recognition and dialogue speech recognition |
US20080270129A1 (en) | 2005-02-17 | 2008-10-30 | Loquendo S.P.A. | Method and System for Automatically Providing Linguistic Formulations that are Outside a Recognition Domain of an Automatic Speech Recognition System |
US20060242017A1 (en) | 2005-03-09 | 2006-10-26 | Medio Systems, Inc. | Method and system of bidding for advertisement placement on computing devices |
US20060224384A1 (en) | 2005-03-31 | 2006-10-05 | International Business Machines Corporation | System and method for automatic speech recognition |
US7904297B2 (en) | 2005-05-31 | 2011-03-08 | Robert Bosch Gmbh | Dialogue management using scripts and combined confidence scores |
US20060277052A1 (en) | 2005-06-01 | 2006-12-07 | Microsoft Corporation | Variable speed playback of digital audio |
US8099281B2 (en) | 2005-06-06 | 2012-01-17 | Nunance Communications, Inc. | System and method for word-sense disambiguation by recursive partitioning |
US20060277030A1 (en) | 2005-06-06 | 2006-12-07 | Mark Bedworth | System, Method, and Technique for Identifying a Spoken Utterance as a Member of a List of Known Items Allowing for Variations in the Form of the Utterance |
US20060282266A1 (en) | 2005-06-13 | 2006-12-14 | Microsoft Corporation | Static analysis of grammars |
US20070010195A1 (en) | 2005-07-08 | 2007-01-11 | Cingular Wireless Llc | Mobile multimedia services ecosystem |
US20070016404A1 (en) | 2005-07-15 | 2007-01-18 | Samsung Electronics Co., Ltd. | Method and apparatus to extract important spectral component from audio signal and low bit-rate audio signal coding and/or decoding method and apparatus using the same |
US20070038453A1 (en) | 2005-08-09 | 2007-02-15 | Kabushiki Kaisha Toshiba | Speech recognition system |
US7672916B2 (en) | 2005-08-16 | 2010-03-02 | The Trustees Of Columbia University In The City Of New York | Methods, systems, and media for music classification |
US20080022844A1 (en) | 2005-08-16 | 2008-01-31 | Poliner Graham E | Methods, systems, and media for music classification |
US8924212B1 (en) | 2005-08-26 | 2014-12-30 | At&T Intellectual Property Ii, L.P. | System and method for robust access and entry to large structured data using voice form-filling |
US20100049514A1 (en) | 2005-08-31 | 2010-02-25 | Voicebox Technologies, Inc. | Dynamic speech sharpening |
US7516074B2 (en) | 2005-09-01 | 2009-04-07 | Auditude, Inc. | Extraction and matching of characteristic fingerprints from audio signals |
US20070055500A1 (en) | 2005-09-01 | 2007-03-08 | Sergiy Bilobrov | Extraction and matching of characteristic fingerprints from audio signals |
US20080249982A1 (en) | 2005-11-01 | 2008-10-09 | Ohigo, Inc. | Audio search system |
US7464065B2 (en) | 2005-11-21 | 2008-12-09 | International Business Machines Corporation | Object specific language extension interface for a multi-level data structure |
US20070156392A1 (en) | 2005-12-30 | 2007-07-05 | International Business Machines Corporation | Method and system for automatically building natural language understanding models |
US20070208569A1 (en) | 2006-03-03 | 2007-09-06 | Balan Subramanian | Communicating across voice and text channels with emotion preservation |
US7765097B1 (en) | 2006-03-20 | 2010-07-27 | Intuit Inc. | Automatic code generation via natural language processing |
US7899818B2 (en) | 2006-03-29 | 2011-03-01 | A9.Com, Inc. | Method and system for providing focused search results by excluding categories |
US20070239676A1 (en) | 2006-03-29 | 2007-10-11 | Stonehocker Timothy P | Method and system for providing focused search results |
US20070260456A1 (en) | 2006-05-02 | 2007-11-08 | Xerox Corporation | Voice message converter |
US20070260634A1 (en) | 2006-05-04 | 2007-11-08 | Nokia Corporation | Apparatus, system, method, and computer program product for synchronizing the presentation of media content |
US20110132173A1 (en) | 2006-05-31 | 2011-06-09 | Victor Company Of Japan, Ltd. | Music-piece classifying apparatus and method, and related computed program |
US7908135B2 (en) | 2006-05-31 | 2011-03-15 | Victor Company Of Japan, Ltd. | Music-piece classification based on sustain regions |
US20110132174A1 (en) | 2006-05-31 | 2011-06-09 | Victor Company Of Japan, Ltd. | Music-piece classifying apparatus and method, and related computed program |
WO2008004181A2 (en) | 2006-07-06 | 2008-01-10 | Koninklijke Philips Electronics N.V. | Methods and devices for managing a play-list of digital content |
US20080026355A1 (en) | 2006-07-27 | 2008-01-31 | Sony Ericsson Mobile Communications Ab | Song lyrics download for karaoke applications |
US20080046247A1 (en) | 2006-08-21 | 2008-02-21 | Gakuto Kurata | System And Method For Supporting Text-To-Speech |
US20080059185A1 (en) | 2006-08-25 | 2008-03-06 | Hoon Chung | Speech recognition system for mobile terminal |
US20080071520A1 (en) | 2006-09-14 | 2008-03-20 | David Lee Sanford | Method and system for improving the word-recognition rate of speech recognition software |
US20080082510A1 (en) | 2006-10-03 | 2008-04-03 | Shazam Entertainment Ltd | Method for High-Throughput Identification of Distributed Broadcast Content |
US8086171B2 (en) | 2006-10-03 | 2011-12-27 | Shazam Entertainment Ltd. | Method and system for identification of distributed broadcast content |
US7881657B2 (en) | 2006-10-03 | 2011-02-01 | Shazam Entertainment, Ltd. | Method for high-throughput identification of distributed broadcast content |
US20090064029A1 (en) | 2006-11-27 | 2009-03-05 | Brightqube, Inc. | Methods of Creating and Displaying Images in a Dynamic Mosaic |
US20080134264A1 (en) | 2006-11-30 | 2008-06-05 | Motorola, Inc. | Method and apparatus for interactivity with broadcast media |
US20080154951A1 (en) | 2006-12-22 | 2008-06-26 | Yahoo! Inc. | Link Retrofitting of Digital Media Objects |
US20090013255A1 (en) | 2006-12-30 | 2009-01-08 | Matthew John Yuschik | Method and System for Supporting Graphical User Interfaces |
US20100250497A1 (en) | 2007-01-05 | 2010-09-30 | Redlich Ron M | Electromagnetic pulse (EMP) hardened information infrastructure with extractor, cloud dispersal, secure storage, content analysis and classification and method therefor |
US20080172224A1 (en) | 2007-01-11 | 2008-07-17 | Microsoft Corporation | Position-dependent phonetic models for reliable pronunciation identification |
US8116746B2 (en) | 2007-03-01 | 2012-02-14 | Microsoft Corporation | Technologies for finding ringtones that match a user's hummed rendition |
US20080215319A1 (en) | 2007-03-01 | 2008-09-04 | Microsoft Corporation | Query by humming for ringtone search and download |
US20080228496A1 (en) | 2007-03-15 | 2008-09-18 | Microsoft Corporation | Speech-centric multimodal user interface design in mobile technology |
US20080235017A1 (en) | 2007-03-22 | 2008-09-25 | Honda Motor Co., Ltd. | Voice interaction device, voice interaction method, and voice interaction program |
US20080235872A1 (en) | 2007-03-30 | 2008-10-02 | Newkirk David C | User interface for hospital bed |
US20080255937A1 (en) | 2007-04-10 | 2008-10-16 | Yahoo! Inc. | System for optimizing the performance of online advertisements using a network of users and advertisers |
US20080256115A1 (en) | 2007-04-11 | 2008-10-16 | Oleg Beletski | Systems, apparatuses and methods for identifying transitions of content |
US8296179B1 (en) | 2007-05-02 | 2012-10-23 | Monster Worldwide, Inc. | Targeted advertisement placement based on explicit and implicit criteria matching |
US20090030686A1 (en) | 2007-07-27 | 2009-01-29 | Fuliang Weng | Method and system for computing or determining confidence scores for parse trees at all levels |
US20090037174A1 (en) | 2007-07-31 | 2009-02-05 | Microsoft Corporation | Understanding spoken location information based on intersections |
US20100286979A1 (en) | 2007-08-01 | 2010-11-11 | Ginger Software, Inc. | Automatic context sensitive language correction and enhancement using an internet corpus |
US20090125306A1 (en) | 2007-09-19 | 2009-05-14 | International Business Machines Corporation | Method, system and computer program for enhanced speech recognition of digits input strings |
US20090112593A1 (en) | 2007-10-24 | 2009-04-30 | Harman Becker Automotive Systems Gmbh | System for recognizing speech for searching a database |
US20090119097A1 (en) | 2007-11-02 | 2009-05-07 | Melodis Inc. | Pitch selection modules in a system for automatic transcription of sung or hummed melodies |
US20090125298A1 (en) | 2007-11-02 | 2009-05-14 | Melodis Inc. | Vibrato detection modules in a system for automatic transcription of sung or hummed melodies |
US20090125301A1 (en) | 2007-11-02 | 2009-05-14 | Melodis Inc. | Voicing detection modules in a system for automatic transcription of sung or hummed melodies |
US20090150341A1 (en) | 2007-12-05 | 2009-06-11 | Microsoft Corporation | Generation of alternative phrasings for short descriptions |
US8013230B2 (en) | 2007-12-17 | 2011-09-06 | Sony Corporation | Method for music structure analysis |
US20110131043A1 (en) | 2007-12-25 | 2011-06-02 | Fumihiro Adachi | Voice recognition system, voice recognition method, and program for voice recognition |
US20090216525A1 (en) | 2008-02-22 | 2009-08-27 | Vocera Communications, Inc. | System and method for treating homonyms in a speech recognition system |
US20090228799A1 (en) | 2008-02-29 | 2009-09-10 | Sony Corporation | Method for visualizing audio data |
US20090240488A1 (en) | 2008-03-19 | 2009-09-24 | Yap, Inc. | Corrective feedback loop for automated speech recognition |
US20090240499A1 (en) | 2008-03-19 | 2009-09-24 | Zohar Dvir | Large vocabulary quick learning speech recognition system |
US20090271199A1 (en) | 2008-04-24 | 2009-10-29 | International Business Machines | Records Disambiguation In A Multimodal Application Operating On A Multimodal Device |
US20100014828A1 (en) | 2008-07-17 | 2010-01-21 | Indata Corporation | Video management system and method |
WO2010018586A2 (en) | 2008-08-14 | 2010-02-18 | Tunewiki Inc | A method and a system for real time music playback syncronization, dedicated players, locating audio content, following most listened-to lists and phrase searching for sing-along |
US20100042414A1 (en) | 2008-08-18 | 2010-02-18 | At&T Intellectual Property I, L.P. | System and method for improving name dialer performance |
US20100100384A1 (en) | 2008-10-21 | 2010-04-22 | Microsoft Corporation | Speech Recognition System with Display Information |
US20100124892A1 (en) | 2008-11-19 | 2010-05-20 | Concert Technology Corporation | System and method for internet radio station program discovery |
US8447608B1 (en) | 2008-12-10 | 2013-05-21 | Adobe Systems Incorporated | Custom language models for audio content |
US20120216178A1 (en) | 2008-12-10 | 2012-08-23 | International Business Machines Corporation | Compiler generator |
US20100241418A1 (en) | 2009-03-23 | 2010-09-23 | Sony Corporation | Voice recognition device and voice recognition method, language model generating device and language model generating method, and computer program |
US20120046936A1 (en) | 2009-04-07 | 2012-02-23 | Lemi Technology, Llc | System and method for distributed audience feedback on semantic analysis of media content |
US20100312782A1 (en) | 2009-06-05 | 2010-12-09 | Microsoft Corporation | Presenting search results according to query domains |
US20110035219A1 (en) | 2009-08-04 | 2011-02-10 | Autonomy Corporation Ltd. | Automatic spoken language identification based on phoneme sequence patterns |
US20110046951A1 (en) | 2009-08-21 | 2011-02-24 | David Suendermann | System and method for building optimal state-dependent statistical utterance classifiers in spoken dialog systems |
US20110213475A1 (en) | 2009-08-28 | 2011-09-01 | Tilman Herberger | System and method for interactive visualization of music properties |
US20110071819A1 (en) | 2009-09-22 | 2011-03-24 | Tanya Miller | Apparatus, system, and method for natural language processing |
US20110082688A1 (en) | 2009-10-01 | 2011-04-07 | Samsung Electronics Co., Ltd. | Apparatus and Method for Analyzing Intention |
US20110173208A1 (en) | 2010-01-13 | 2011-07-14 | Rovi Technologies Corporation | Rolling audio recognition |
US20130096911A1 (en) | 2010-04-21 | 2013-04-18 | Universite Catholique De Louvain | Normalisation of noisy typewritten texts |
US20120232683A1 (en) | 2010-05-04 | 2012-09-13 | Aaron Steven Master | Systems and Methods for Sound Recognition |
US20100211693A1 (en) | 2010-05-04 | 2010-08-19 | Aaron Steven Master | Systems and Methods for Sound Recognition |
US8694537B2 (en) | 2010-07-29 | 2014-04-08 | Soundhound, Inc. | Systems and methods for enabling natural language processing |
US20120029670A1 (en) | 2010-07-29 | 2012-02-02 | Soundhound, Inc. | System and methods for continuous audio matching |
US20140019483A1 (en) | 2010-07-29 | 2014-01-16 | Soundhound, Inc. | Systems and Methods for Generating and Using Shared Natural Language Libraries |
US20120035924A1 (en) | 2010-08-06 | 2012-02-09 | Google Inc. | Disambiguating input based on context |
US20120065960A1 (en) | 2010-09-14 | 2012-03-15 | International Business Machines Corporation | Generating parser combination by combining language processing parsers |
US20120089400A1 (en) | 2010-10-06 | 2012-04-12 | Caroline Gilles Henton | Systems and methods for using homophone lexicons in english text-to-speech |
US20120303371A1 (en) | 2011-05-23 | 2012-11-29 | Nuance Communications, Inc. | Methods and apparatus for acoustic disambiguation |
US20120323557A1 (en) | 2011-06-19 | 2012-12-20 | Detlef Koll | Speech Recognition Using Context-Aware Recognition Models |
US20130006631A1 (en) | 2011-06-28 | 2013-01-03 | Utah State University | Turbo Processing of Speech Recognition |
US20140205974A1 (en) | 2011-06-30 | 2014-07-24 | Rosetta Stone, Ltd. | Statistical machine translation framework for modeling phonological errors in computer assisted pronunciation training system |
US20130041647A1 (en) | 2011-08-11 | 2013-02-14 | Apple Inc. | Method for disambiguating multiple readings in language conversion |
US20130055223A1 (en) | 2011-08-25 | 2013-02-28 | Myezapp Inc. | Compiler with Error Handling |
US20130052939A1 (en) | 2011-08-30 | 2013-02-28 | Clear Channel Management Services, Inc. | Broadcast Source Identification Based on Matching Broadcast Signal Fingerprints |
US8762156B2 (en) | 2011-09-28 | 2014-06-24 | Apple Inc. | Speech recognition repair using contextual information |
US20130111440A1 (en) * | 2011-10-28 | 2013-05-02 | Michael Forster | Methods, Apparatuses, and Computer-Readable Media for Computing Checksums for Effective Caching in Continuous Distributed Builds |
US20130151250A1 (en) | 2011-12-08 | 2013-06-13 | Lenovo (Singapore) Pte. Ltd | Hybrid speech recognition |
WO2013177213A2 (en) | 2012-05-24 | 2013-11-28 | Soundhound, Inc. | Systems and methods for enabling natural language processing |
US20140032220A1 (en) | 2012-07-27 | 2014-01-30 | Solomon Z. Lerner | Method and Apparatus for Responding to a Query at a Dialog System |
US20140039895A1 (en) | 2012-08-03 | 2014-02-06 | Veveo, Inc. | Method for using pauses detected in speech input to assist in interpreting the input during conversational interaction for information retrieval |
US20140067394A1 (en) | 2012-08-28 | 2014-03-06 | King Abdulaziz City For Science And Technology | System and method for decoding speech |
US20140074470A1 (en) | 2012-09-11 | 2014-03-13 | Google Inc. | Phonetic pronunciation |
US20140297252A1 (en) | 2012-12-06 | 2014-10-02 | Raytheon Bbn Technologies Corp. | Active error detection and resolution for linguistic translation |
US9697827B1 (en) | 2012-12-11 | 2017-07-04 | Amazon Technologies, Inc. | Error reduction in speech processing |
US10224030B1 (en) | 2013-03-14 | 2019-03-05 | Amazon Technologies, Inc. | Dynamic gazetteers for personalized entity recognition |
US20140324433A1 (en) | 2013-04-26 | 2014-10-30 | Wistron Corporation | Method and device for learning language and computer readable recording medium |
US20140358533A1 (en) | 2013-05-30 | 2014-12-04 | International Business Machines Corporation | Pronunciation accuracy in speech recognition |
US20150039317A1 (en) | 2013-07-31 | 2015-02-05 | Microsoft Corporation | System with multiple simultaneous speech recognizers |
WO2015017746A1 (en) | 2013-08-02 | 2015-02-05 | Herdman Rodrick A | Device for mounting and securing a handgun |
US20160232894A1 (en) | 2013-10-08 | 2016-08-11 | Samsung Electronics Co., Ltd. | Method and apparatus for performing voice recognition on basis of device information |
US20150106082A1 (en) | 2013-10-16 | 2015-04-16 | Interactive Intelligence Group, Inc. | System and Method for Learning Alternate Pronunciations for Speech Recognition |
US20150112679A1 (en) | 2013-10-18 | 2015-04-23 | Via Technologies, Inc. | Method for building language model, speech recognition method and electronic apparatus |
US20150161985A1 (en) | 2013-12-09 | 2015-06-11 | Google Inc. | Pronunciation verification |
US8843369B1 (en) | 2013-12-27 | 2014-09-23 | Google Inc. | Speech endpointing based on voice profile |
US20160148615A1 (en) | 2014-11-26 | 2016-05-26 | Samsung Electronics Co., Ltd. | Method and electronic device for voice recognition |
US9646628B1 (en) | 2015-06-26 | 2017-05-09 | Amazon Technologies, Inc. | Noise cancellation for open microphone mode |
US20170178623A1 (en) | 2015-12-22 | 2017-06-22 | Oren Shamir | Technologies for end-of-sentence detection using syntactic coherence |
US20180061399A1 (en) | 2016-08-30 | 2018-03-01 | Lenovo Enterprise Solutions (Singapore) Pte. Ltd. | Spoken utterance stop event other than pause or cessation in spoken utterances stream |
US20180308489A1 (en) | 2017-04-24 | 2018-10-25 | Samsung Electronics Co., Ltd. | Electronic device for providing speech recognition service and method thereof |
US20180330723A1 (en) | 2017-05-12 | 2018-11-15 | Apple Inc. | Low-latency intelligent automated assistant |
US20220374708A1 (en) * | 2021-05-18 | 2022-11-24 | Keepcon Corporation | System and method for content automated classification |
Non-Patent Citations (181)
Title |
---|
"Aurix Enhances Effectiveness of Leading Search Software," Aurix.com—News. Jun. 1, 2010. http://www.aurix.com/pages/3808/Aurix.sub.-enhances.sub.--effectiveness.- sub.-of .sub.--leading.sub.-search.sub.-software.htm. |
"Avaya Self-Service Optimization. Optimize the Performace of your Avaya Self-Service applications", Avaya, Inc., 2011, 4 pgs. |
"Can't Get That Song Out of Your Head," Copyright 2007, The Jakarta Post, May 20, 2007, Factiva, Inc, 2 Pages. |
"Contact Center Business Planning Guide", Avaya Inc., 2010, 8 pgs. |
"Do you know the true cost of IVR migration?" Datasheet. Aspect Software Incorporated, Dec. 2012, 4 pgs. |
"ECMAScript Language Specification", ECMA-262, ECMA International, 5.1 Edition, Jun. 2011, 258 pgs. |
"Grammatical Framework" Version 3.3.3, Copyright Mar. 2012 [retrieved on Sep. 23, 2012], Retrieved from Internet: <http://www.grammaticalframework.org>. 4 pages. |
"Hearing it Loud & Clear at SpeechTEK 2010," Aurix.com—News. Jul. 21, 2010, http://www.aurix.com/pages/4161/State.sub.-of .sub.--the.sub.--art.sub.--speech.sub.-technology.htm. |
"Nuance Recognizer 9.0: Language Pack Guide", Nuance Communications, Inc., 2007, 34 pgs. |
"VoiceXML Tutorial. Developer documentation", Bevocal, Inc., 2005, 68 pgs. |
Amdal, et al., "Pronunciation variation modeling in automatic speech recogniction", Teletronikk 99.2, Feb. 2003, pp. 70-82. |
App Shopper MusicID: http://appshopper.com/music/musicid. Last changed Jul. 14, 2014. |
App Shopper Shazam: http://appshopper.com/music/shazam. Last changed Jul. 24, 2014. |
ATT, "Conversant VIS Version 4.0 Whole Word Bilignual Speech Recognition", Issue 1, Oct. 1993, 86 pgs. |
Bechet, F., et al., "Large Span statistical models: application to homophone disambiguation for large vocabulary speech recognition in French," Eurospeech, Sep. 1999, p. 1-4. |
Benson, et al. "Sync Kit: A Persistent Client-Side Database Caching Toolkit for Data Intensive Websites," Proceedings of the 19th International Conference on Worid Wide Web, Apr. 2010. pp. 121-130. http://dl.acm.org/citation.cfm?id=1772704. cited byapplicant. |
Bisani, et al., "Joint-sequence models for grapheme-to-phoneme conversion", Speech Communication 50.5, May 2008, pp. 434-451. |
Blackburn, S G. "Search by Humming". University of Southampton, Department of Electronics and Computer Science, Faculty of Engineering, May 8, 1997, 69 Pages. |
Blackburn, S G., "Content Based Retrieval and Navigation of Music Using Melodic Pitch Contours," University of Southampton, Department of Electronics and Computer Science, Faculty of Engineering and Applied Science, Sep. 26, 2000, 136 Pages. |
Blackburn, S G., "Content Based Retrieval and Navigation of Music," University of Southampton, Department of Electronics and Computer Science, Faculty of Engineering and Applied Science, Mar. 10, 1999, 41 Pages. |
Blackburn, S., et al. "A Tool for Content Based Navigation of Music," University of Southampton, Department of Electronics and Computer Science, Multimedia Research Group, Copyright 1998 ACM 1-58113-036-8/98/0008, pp. 361-368. |
Brick, T, et al. "Incremental Natural Language Parsing for HRI", Journal of the Association for Computing Machinery, Mar. 10-12, 2007, 8 pgs. |
Casey, M. A., et al., "Content-Based Music Information Retrieval: Current Directions and Future Challenges". Apr. 2008, vol. 96, No. 4, Copyright 2008 IEEE, Retrieved from IEEE Xplore [retrieved on Dec. 29, 2008 at 18:02], 29 Pages. |
Charniak, E., et al. "Edge-Based Best-First Chart Parsing", 1998, 8 pgs. |
Choi, J., et al., "On applying phonological rules for a., Korean Continuous Speech Recognition System," Proceedings of ICSP '97, Aug. 1997, pp. 491-496. |
Chou, T., et al., "Music Databases: Indexing Techniques and Implementation", Proceedings of International Workshop on Multimedia Database Management Systems, IEEE, dated Aug. 14-16, 1996, pp. 46-53, 8 pages. |
Conway, D., et al., "Synopsis 5: Regexes and Rules", Version 158, Created Jun. 24, 2002 and Last Modified Jul. 31, 2012 [retrieved Sep. 26, 2012], Retrieved from Internet: <http://perlcabal.org/syn/s05.html>, 65 pages. |
Copestake, A, et al., "Minimal Recursion Semantics: An Introduction" Research on Language and Computation, vol. 3, pp. 281-332, 2005. |
Cremelie, et al., "In search of better pronounciation models for speech recognition," Speech Communication 29.2, Nov. 1999, pp. 115-136. |
Crisostomo, A. "Constituents and Phrases", Jul. 2011, 43 pgs. |
De Mareail, Philippe Boula, et al. "A French Phonetic Lexicon with Variants for Speech and Language Processing." LREC. Jun. 2000, pp. 1-4. |
Deobhakta, N., "Natural Language Processing, User Interfaces, and the Semantic Web". Proceedings from the Third Annual HCI Symposium, Dec. 2012, 22 pgs. |
Derouault, A., et al., "Natural language modeling for phoneme-to-text transcription," IEEE Transactions on Pattern Analysis & Machine Intelligence, Nov. 6, 1986, pp. 742-749. |
Feldman, "NLP Meets the Jabberwocky: Natural Language Processing in Information Retrieval," Online, Information Today, Inc., Copyright May 1999, 16 pages. |
Feng, "A General Framework for Building Natural Language Understanding Modules in Voice Search," 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), Mar. 2010. (Accessed Jul. 24, 2014—IEEE)http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5494951. |
First Products with Gracenote Technology to Ship in 2008. Warren's Consumer Electronics Daily. Copyright 2007 Warren Publishing, Inc. Sep. 18, 2007. Factiva, Inc. 2 pages. |
Furui, S., "Digital Speech Processing, Synthesis, and Recognition". Second Edition, Revised and Expanded. Nov. 17, 2000. ISBN 978-0824704520. 17 pages. |
Gaudinat, Amaud, et al. "Syntax-based speech recognition: how a syntactic parser can help a recognition system." EuroSpeech. Sep. 1999, pp. 1-4. |
Ghias, A., et al. "Query by Humming," Musical Information Retrieval in an Audio Database, Cornell University 1995, 6 Pages. |
Ghias, A., et al. "Query by Humming—Musical Information Retrieval in an Audio Database," ACM Multimedia 95—Electronic Proceedings, San Francisco, CA, Nov. 5-9, 1995, 13 Pages. |
Gracenote Mobile MusicID: http://web.archive.org/web/20100123211802/http://www.gracenote.com/busine- ss.sub.-solutions/mobileMusic/. Last accessed Jan. 23, 2010. |
Gracenote Readies New Services, But Video Initiative Stalls. Warren's Consumer Electronics Daily. Copyright 2005 Warren Publishing, Inc. vol. 5; Issue 122. Jun. 24, 2005. Factiva, Inc. 2 pages. |
Gracenote: MusicID, available at http://www.gracenote.com/business.sub.--solutions/music.sub.--id/, last accessed Aug. 4, 2010. |
Graham, P., "Parsing with ATNs. On Lisp: Advanced Techniques for Common Lisp", Engelwood Cliffs, NJ, Prentice Hall, 1993, 16 pgs. |
Grammar Reference, BeVocal, Inc. (NUANCE), Mountain View, CA, Copyright 2005, 102 pages. |
Grammar's Developers Guide, Nuance Speech Recognition System, Version 8.5, Copyright 2003 Nuance Communications Inc., Menlo Park, CA, 262 pages. |
Gravier, G., et al., "Introducing contextual transcription rules in large vocabulary speech recognition," The Integration of Phonetic knowledge in speech technology, Spinger Netherlands, 2005, pp. 87-106. |
Guzzino, Didier, "Active: A Unified Platform for Building Intelligent Applications", Jan. 23, 2008, 263 pgs. |
Han, B., et al. "M-Musics: Mobile Content-Based Music Retrieval System". Copyright 2007, Augsburg, Bavaria, Germany. ACM 978-1-59593-01-8/07/0009. Sep. 23-28, 2007. pp. 469-470. 2 Pages. |
Harrison, et al., "Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules bases on language transfer", INTERSPEECH, Sep. 2008, pp. 2787-2790. |
Holter, T., et al., "Maximum likelihood modelling of pronounciation variation," Speech Communication 29.2, Nov. 1999, pp. 177-191. |
Horn, P., "What was that song? With a wireless phone, find out what you heard on the radio," The Inquirer, Philadelphia, Pennsylvania, USA, dated Feb. 11, 1999, 3 pages. |
Huang, L, et al., "Dynamic Programming for Linear-Time Incremental Parsing". Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 1077-1086, Jul. 2010. |
Huet, et al., "Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition", Computer Speech and Langauge; ScienceDirect; 22 pages; received Jul. 9, 2008; available online Oct. 30, 2009. |
Hum That Tune, Then Find it on the Web. NPR: Weekend Edition—Saturday, WKSA. Copyright 2006 National Public Radio. Dec. 23, 2006. Factiva, Inc. 2 pages. |
InData Corporation, DepoView DVD, Video Review Software Product Brochure, Jun. 2008, 4 Pgs. |
InData Corporation, DepoView Video Review Software Product Description, "InData's Newest Video Deposition Viewer", Dec. 2007, 2 pgs. |
InData Corporation, DepoView Video Review Software Product Description, http://indatacorp.com/depoview.html, accessed Nov. 8, 2011, 2 Pgs. |
Indukuri, et al. "Natural Language Querying Over Databases Using Cascaded CRFs," Lecture Notes in Computer Science, Sep. 2010, http://www.springerlink.com/content/5w1x27650475304m. |
Iqbal, R, et al., "A Negation Query Engine for Complex Query Transformations". Journal of Science and Technology, pp. 193-204, 2013. |
Jamil, "A Natural Language Interface Plug-In for Cooperative Query Answering in Biological Databases," BMC Genomics, Nov. 2011. (Accessed Sep. 27, 2012 via SpringerLink) http://www.biomedcentral.com/1471-2164/13/S3/S4. |
Jang, J.R., et al. "A General Framework of Progressive Filtering and its Application to Query to Singing/Humming". IEEE Transactions on Audio, Speech, and Language Processing, vol. 16. No. 2, Feb. 2008. pp. 350-358. 9 Pages. |
JavaScript Quick Reference, BeVocal Inc. (NUANCE), Mountain View, CA, Copyright 2005, 24 pages. |
Jurafsky et al., Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition, 3rd edition draft dated Sep. 21, 2021, 629 pages. Retrieved on Nov. 4, 2021. Retrieved from the internet [URL: https://web.stanford.edu/˜jurafsky/slp3/ed3book_sep212021.pdf ]. |
Klein, D, et al. "Accurate Unlexicalized Parsing", Proceedings of the 41st Meeting of the Association for Computational Linguistics, vol. 1, pp. 423-430, 2003. |
Kolias, et al. "Design and implementation of a VoiceXML-driven wiki application for assistive environments on the web," Personal and Ubiquitous Computing, Sep. 2010. vol. 14, No. 6, p. 527-539,http://www.icsd.aegean.gr/publication.sub.-files/journal/295233664.pdf. |
Kosugi, N., et al. "A Practical Query-By-Humming System for a Large Music Database". NTT Laboratories, Japan. ACM Multimedia Los Angeles, Ca, USA. Copyright ACM 2000 1-58113-198-4/00/10. pp. 333-342. 10 Pages. |
Langanke, "Direct Voice Control Speech Data Entry and Database Query Models," International Symposium on Logistics and Industrial Informatics, Sep. 2007. (Accessed Jul. 24, 2014—IEEE)http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=4343522. |
Larson, et al. "NYT to Release Thesaurus and Enter Linked Data Cloud," NY Times Blogs, Jun. 2009. http://open.blogs.nytimes.com/2009/06/26/nyt-to-release-thesaurus-and-ent- erlinked-data-cloud/. |
Li, et al. "Robust Audio Identification for MP3 Popular Music," SIGIR '10 Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, Jul. 2010. p. 627-634.http://dLacm.org/citation.cfm?doid=1835449.1835554. |
Liu, et al. "Content-Based Retrieval of MP3 Music Objects," CIKM '01 Proceedings of the tenth international conference on Information and knowledge management, 2001. p. 506-511. http://dx.doi.org/10.1145/502585.502670. |
Matsubara, S, et al., "Chart-based Parsing and Transfer in Incremental Spoken Language Translation", Proceedings of the Fourth Natural Language Processing Pacific Rim Symposium, 1997, 4 pgs. |
McKeown, K., "Semantic Analysis: Syntax-Driven Semantics", 27 pgs. [retrieved Jun. 17, 2014]. Retrieved from the Internet: <URL: <http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=2&ved=0CCEQFjAB&url=http%3A%2F%2Fwww1.cs.columbia.edu%2F˜kathy%2FNLP%2FClassSlides%2FClass13SemanticAnalysis%2Fsemantics.ppt&ei=Xdb4VMPDIcvsoAS2soGABg&usg=AFQjCNGiXuBWLO-oQB_MLor_kN_8ATdpRg&sig2=BnvJvvJJo3OApAC6ny0guQ&bvm=bv.87611401,d.cGU>>. |
McNab, R. J., et al. "The New Zealand Digital Library MELody inDEX". University of Waikato, Department of Computer Science. D-Lib Magazine, May 1997 [retrieved on Jun. 12, 2011 at 11:25:49 AM]. ISSN 1082-9873. Retrieved from the Internet: <http://dlib.org/dlib/May97/meldex/05wiillen.html>, 13 pages. |
McNab, R. J., et al. "Towards the Digital Music Library: Tune Retrieval from Acoustic Input". University of Waikato, Department of Computer Science, School of Education. DL 1996, Bethesda MD USA. Copyright 1996 ACM 0-89791-830-4/96/03. pp. 11-18. 8 Pages. |
McPherson, J.R., et al., "Usage of the MELDEX Digital Music Library", 1999, in Proceedings of the International Symposium on Music Information Retrieval, (Bloomington, IN, USA, 2001), pp. 19-20, 2 pages. |
Melodis Rolls Out midomi mobile. Wireless News. Copyright 2008 M2 Communications, Ltd. Mar. 6, 2008. 1 Page. |
Miller, L.G., et al., "Syntactic Analysis for large vocabulary speech recognition using a context-free covering grammar," Acoustics, Speech, and Signal processing, 1988, ICASSP—88, 1988 International Conference on. IEEE, Apr. 1988, pp. 271-274. |
Mobile Music: Comcast Cellular First in U.S. to Trial Breakthrough Interactive Music Service Called *CD. Copyright PR Newswire, New York. ProQuest LLC. Feb. 11, 1999. Retrieved from the Internet: <http://proquest.umi.com.libproxy.mit.edu/pqdwb?did+38884944&sid=3&Fmt=3&clientId=5482&RQT=309&VName=PQD>. 3 pages. |
Mohri, M, et al., "Weighted Finite-State Transducers in Speech Recognition", Computer Speech and Language, Jan. 2002, 26 pgs. |
Murase, T, et al., "Incremental CFG Parsing with Statistical Lexical Dependencies", Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, Nov. 2001, 9 pgs. |
Nelson, J., V Cast Song ID from Verizon Wireless. May 21, 2007 [retrieved on Jul. 24, 2014], Retrieved from Internet: <http://www.verizonwireless.com/news/article/2007/05/pr2007-05-21a.html>. |
New, B., "Question Answering at TREC", Pre-Internship report, Mar. 2008. 4 pgs. |
Nortel Norstar, "Voice Mail Speech Recognition Automated Attendant", Product manual [online]. 29 pgs. Nortel Norstar [retrieved Sep. 4, 2012]. Retrieved from the Internet: <URL: https://downloads.avaya.com/css/P8/documents/100141923>. |
Norvell, T., "A Short Introduction to Regular Expressions and Context Free Grammars", Project report, Nov. 2002, 5 pgs. |
OMRAS2—AudioDB-Populating and Querying an AudioDB Instance. (No archived version available—accessed Jul. 24, 2014 via Google) http://omras2.org/audioDB/tutorial1. 3 pages. |
OMRAS2—Ontology-Driven Music Retrieval & Annotation Sharing Service. Overview—Apr. 24, 2009 [Accessed Jul. 24, 2014—Archive.org] http://web.archive.org/web/20090424083019/http://www.omras2.org/overview. 2 pages. |
Pardo, B., et al. "The VocalSearch Music Search Engine". EECS, Northwestern University. JCDL 2008, Pittsburgh, Pennsylvania, USA. Jun. 16-20, 2008, ACM 978-1-59593-998-2/08/06. p. 430. 1 Page. |
PCT/US2009/066458—International Preliminary Report on Patentability dated Jun. 7, 2011, 7 pages. |
PCT/US2009/066458—International Search Report, dated Jun. 23, 2010, 16 pages. |
PCT/US2013/042097—International Search Report & Written Opinion dated Dec. 2, 2013. |
Perennou, Guy, et al. "MHATLex: Lexical Resources for Modelling the French Pronunciation." LREC. Jun. 2000, pp. 1-8. |
Phillips, W., "Introduction to Natural Language Processing," CCSI 2006 The Mind Project, retrieved online: <http://www.mind.ilstu.edu/curriculum/protothinker/natural_language_processing.php> accessed on Nov. 25, 2016, 24 pages. |
Quesada, J, et al. "Design of a Natural Language Command Dialogue System". Project deliverable 3.2, SIRIDUS, 2000, 91 pgs. |
Ranbom, Larissa J., et al. "Lexical representation of phonological variation in spoken word recognition." Journal of Memory and Language 57.2, Aug. 2007, pp. 273-298. |
Ranta, A., "Creating Linguistic Resources with the Grammatical Framewor," LREC Tutorial, Malta, May 17, 2010, 75 pages. |
Ranta, A., "Grammatical Framework Tutorial", Copyright Dec. 2010 for GF 3.2 [retrieved on Sep. 13, 2012], Retrieved from Internet: <http://www.grammaticalframework.org/doc/tutorial/gf-tutorial.html>. 68 pages. |
Ranta, A., "Grammatical Framework: Programming with Multilingual Grammars," Slides for the GF book, CSLI Stanford, Copyright 2011, 440 pages. |
Saltzman, M., "The Best Things in life are Free—For Your iPhone," Home Electronics and Technology, for Canwest News Service. Copyright 2008 Edmonton Journal. Nov. 12, 2008. Factiva, Inc. 2 pages. |
Schaden, S., "Rule-based lexical modelling of foreign-accented pronounciation variants," Proceedings of the tenth conference on European chapter of the Association of Computation Linguistics—vol. 2, Association for Computational Linguistics, 2003, pp. 159-162. |
Schiel, F., et al., "Statistical modeling of pronunciation: it's not the model, it's the data," Modeling Pronounciation Variation for Automatic Speech Recognition, May 1998, pp. 1-7. |
Seneff, S, et al., "Galaxy—II: A Reference Architecture for Conversational System Development", Proceedings of the International Conference on Spoken Language Processing, Nov. 1998, 5 pgs. |
Seneff, S., "TINA: A Natural Language System for Spoken Language Applications", Journal of the Association for Computational Linguistics, 18 (1), pp. 61-82, Mar. 1992. |
Shazam: http://web.archive.org/web/20100501190631/http://www.shazam.com/. Last accessed May 1, 2010. |
Song, J., et al. "Query by Humming: Matching Humming Query to Polyphonic Audio," LG Electronics, Seoul, Korea. Copyright 2002 IEEE. 0-7809-7304-9/02. pp. 329-332. 4 Pages. |
Sony Ericsson's W850i Walkman Phone Now Available in the Middle East. Al-Bawaba News, 2006 Al-Bawaba. Dec. 11, 2006. Factiva, Inc. <www.albawaba.com>. 2 pages. |
Stolcke, A., "An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities", Journal of the Association for Computational Linguistics, 21 (2), Jun. 1995, pp. 165-201. |
Svendsen, Torbjorn. "Pronunciation modeling for speech technology." Proc. Intl. Conf. on Signal Processing and Communication (SPCOM04), Jan. 2005, pp. 1-6. |
Taylor, C., "Company Lets Listeners Dial for CDs," Billboard, vol. 1, No. 26, General Interest Module, Jun. 26, 1999, pp. 86-87, 2 pages. |
Tesprasit, et al., "A context-sensitive homograph disambiguation in Thai text-to-speech synthesis", Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of hte Proceedings of HLT-NAACL 2003, (2003), pp. 1-3. |
Typke, R., et al., "A Survey of Music Information Retrieval Systems," Universiteit Utrecht, The Netherlands. Copyright 2005 Queen Mary, University of London. 8 Pages. |
U.S. Appl. No. 13/193,514—Office Action dated Aug. 22, 2014, 21 pages. |
U.S. Appl. No. 13/193,514—Office Action dated Jan. 6, 2014, 21 pages. |
U.S. Appl. No. 13/193,514—Office Action dated Jul. 17, 2015, 16 pages. |
U.S. Appl. No. 13/193,514—Response to Aug. 22, 2014 Office Action, filed Dec. 22, 2014, 11 pages. |
U.S. Appl. No. 13/193,514—Response to Jan. 6 Office Action filed May 6, 2014, 11 pages. |
U.S. Appl. No. 13/401,728—Notice of Allowance dated Mar. 4, 2015, 8 pages. |
U.S. Appl. No. 13/401,728—Office Action dated Jul. 17, 2014, 11 pages. |
U.S. Appl. No. 13/480,400—Office Action dated May 7, 2013, 15 pages. |
U.S. Appl. No. 13/480,400—Response to May 7 Office Action filed Jul. 8, 2013, 16 pages. |
U.S. Appl. No. 13/842,735—Final Office Action dated Dec. 13, 2019, 14 pages. |
U.S. Appl. No. 13/842,735—Final Office Action dated May 22, 2017, 10 pages. |
U.S. Appl. No. 13/842,735—Nonfinal Office Action dated Dec. 15, 2017, 14 pages. |
U.S. Appl. No. 13/842,735—Notice of Allowance dated Jan. 7, 2016, 11 pages. |
U.S. Appl. No. 13/842,735—Notice of Allowance dated Jul. 13, 2016, 10 pages. |
U.S. Appl. No. 13/842,735—Notice of Allowance dated Sep. 11, 2015, 10 pages. |
U.S. Appl. No. 13/842,735—Office Action dated Feb. 20, 2015, 12 pages. |
U.S. Appl. No. 13/842,735—Office Action dated Nov. 10, 2016, 8 pages. |
U.S. Appl. No. 13/842,735—Response to Feb. 20 Office Action filed Aug. 19, 2015, 15 pages. |
U.S. Appl. No. 13/842,735—Response to Final Office Action dated May 22, 2017, filed Nov. 21, 2017, 11 pages. |
U.S. Appl. No. 13/842,735—Response to Notice of Allowance dated Jul. 13, 2016, filed Oct. 11, 2016, 14 pages. |
U.S. Appl. No. 13/842,735—Response to Office Action dated 125 Jun. 2019, filed Nov. 25, 2019, 11 pages. |
U.S. Appl. No. 13/842,735—Response to Office Action dated Nov. 10, 2016, filed May 8, 2017, 11 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated Aug. 15, 2016, 5 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated Aug. 3, 2015, 5 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated Feb. 5, 2016, 5 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated Jan. 23, 2015, 9 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated Jan. 31, 2018, 5 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated May 18, 2015, 5 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated Sep. 5, 2017, 7 pages. |
U.S. Appl. No. 13/843,290—Notice of Allowance dated Sep. 5, 2018, 11 pages. |
U.S. Appl. No. 13/843,290—Office Action dated Jan. 19, 2017, 7 pages. |
U.S. Appl. No. 13/843,290—Response after Notice of Allowance dated Jan. 18, 2018, filed Apr. 25, 2018, 10 pages. |
U.S. Appl. No. 13/843,290—Response after Notice of Allowance dated Sep. 5, 2017, filed Nov. 27, 2017, 10 pages. |
U.S. Appl. No. 13/843,290—Response to Notice of Allowance dated Aug. 15, 2016, filed Nov. 15, 2016, 11 pages. |
U.S. Appl. No. 13/844,028—Office Action dated Nov. 7, 2014, 15 pages. |
U.S. Appl. No. 14/634,642—Amendment with RCE filed Jun. 6, 2018, 14 pages. |
U.S. Appl. No. 14/634,642—Final Office Action dated Feb. 12, 2019, 12 pages. |
U.S. Appl. No. 14/634,642—Interview Summary and Proposed Examiner's Amendment filed Jan. 30, 2018, 10 pages. |
U.S. Appl. No. 14/634,642—Non-final Office Action dated Jun. 27, 2018, 14 pages. |
U.S. Appl. No. 14/634,642—Non-final Office Action dated Sep. 22, 2017, 24 pages. |
U.S. Appl. No. 14/634,642—Notice of Allowance dated Mar. 7, 2018, 22 pages. |
U.S. Appl. No. 14/634,642—Office Action dated Jul. 28, 2016, 27 pages. |
U.S. Appl. No. 14/634,642—Office Action dated Mar. 23, 2016, 15 pages. |
U.S. Appl. No. 14/634,642—Response to Jul. 28 Office Action filed Sep. 15, 2016, 8 pages. |
U.S. Appl. No. 14/634,642—Response to Mar. 23 Office Action filed May 17, 2016, 13 pages. |
U.S. Appl. No. 14/634,642—Response to Non-final Office Action dated Jun. 27, 2018 filed Dec. 27, 2018, 9 pages. |
U.S. Appl. No. 14/634,642—Response to Non-final Office Action dated Sep. 22, 2017 filed Jan. 22, 2018, 29 pages. |
U.S. Appl. No. 14/634,642—Supplemental Response to Non-final Office Action dated Sep. 22, 2017 filed Jan. 22, 2018, 15 pages. |
U.S. Appl. No. 60/134,782—Rhoads, G., "Methods and Systems Employing Digital Watermarking," filed May 19, 1999, 47 pages. |
U.S. Appl. No. 60/155,064—Kenyon, S., "Automatic Program Identification System and Method," filed Sep. 21, 1999, 49 pages. |
U.S. Appl. No. 60/158,087—Swierczek, R., "Music Identification System," filed Oct. 7, 1999, 12 pages. |
U.S. Appl. No. 60/166,965—Finley, M., et al., "Broadcast Media Purchasing System," filed Nov. 23, 1999, 21 pages. |
U.S. Appl. No. 60/186,565—Swierczek, R., "Music Identification System," filed Mar. 2, 2000, 14 pages. |
U.S. Appl. No. 60/218,824—Kenyon, S., et al., "Audio Identification System and Method," filed Jul. 18, 2000, 45 pages. |
U.S. Appl. No. 60/222,023—Wang, et al., "Method and Apparatus for Recognizing Sound and Music Signals in High Noise and Distortion" filed Jul. 31, 2000, 26 pages. |
Venkatachalam, V., Cazzanti, L., Chillon, N., Wells, M., "Automatic Identification of Sound Recordings," Signal Processing Magazine, IEEE, Mar. 2004, 92-99, vol. 21, Issue 2. |
VoiceXML Tutorial (NUANCE), BeVocal, Inc., Mountain View, CA, Copyright 2005, 68 pages. |
Wagstaff, J., "Loose Wire: New Service Identifies Songs You Hum," WSJA Weekend Journal. Copyright 2006, Dow Jones & Company, Inc. Dec. 25, 2006. Factiva, Inc. 2 pages. |
Wang, A., "The Shazam Music Recognition Service". Communications of the ACM, vol. 49, No. 8. Aug. 2006. ACM 0001-0782/06/0800. pp. 44-48. 5 pages. |
Wang, A.L., "An Industrial-Strength Audio Search Algorithm," In ISMIR 2003, 4th Symposium Conference on Music Information Retrieval(Oct. 26, 2003), pp. 7-13. |
Wilcox, B., "Beyond Façade: Pattern Matching for Natural Language Applications," The Gamasutra—The Art & Business of Making Games, Copyright 2016 UBM Tech, retrieved online: <http://gamasutra.com/view/feature/134765/beyond_fa%C3%A7ade_pattern_matching_.php> accessed on Nov. 25, 2016, 14 pages. |
Wold, E., et al., "Classification, Search, and Retrieval of Audio", Muslce Fish, Berkeley, CA, USA, CRC Handbook of Multimedia Computing 1999, pp. 1-19, 18 pages. |
Wold, et al., "Content-Based Classification, Search and Retrieval of Audio", IEEE Multimedia 1070-986X/96, vol. 3, No. 3: Fall 1996, pp. 27-36 (17 pages). |
Xu, et al. "Music Identification via Vocabulary Tree with MFCC Peaks," MIRUM '11 Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies, 2011. p. 21-26.http://dl.acm.org/citation.cfm?doid=2072529.2072537. |
Yao, "The Effects of Phonological Neighborhoods in Pronunication Variation in Conversational Speech", UC Berkeley Electroinic Theses and Dissertatioms, Jan. 1, 2011, 218 pages. |
Young, S. L., et al. "High level knowledge sources in usable speech recognition systems." Communications of the ACM 32.2, Feb. 1989, pp. 183-194. |
Younger, D. H., "Recognition and parsing of context-free languages in time n3", Information and Control, vol. 10, Issue 2, Feb. 1967, pp. 189-208. |
Yu, et al. "A Query-by-Singing System for Retrieving Karaoke Music," IEEE Transactions on Multimedia, Dec. 2008, vol. 10, No. 8, p. 1626-1637. http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=4694852. |
Zhu, Y., et al. "Warping Indexes with Envelope Transforms for Query by Humming". New York University, New York. SIGMOD Copyright 2003, San Diego, CA. Jun. 9-12, 2003. ACM 1-58113-634-X/03/06. pp. 181-192. 12 Pages. |
Zlatanov, T. "Cultured Perl", Nov. 2004, 13 pgs. [retrieved Oct. 22, 2014]. Retrieved from the Internet: <URL: http://www.ibm.com/developerworks/linux/library/l-cpregex/index.html>. |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220180893A1 (en) * | 2020-12-08 | 2022-06-09 | Line Corporation | Method and system for generating multimedia content |
Also Published As
Publication number | Publication date |
---|---|
US20210224043A1 (en) | 2021-07-22 |
US10996931B1 (en) | 2021-05-04 |
US10957310B1 (en) | 2021-03-23 |
US20240029721A1 (en) | 2024-01-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11776533B2 (en) | Building a natural language understanding application using a received electronic record containing programming code including an interpret-block, an interpret-statement, a pattern expression and an action statement | |
US7072837B2 (en) | Method for processing initially recognized speech in a speech recognition session | |
JP5162697B2 (en) | Generation of unified task-dependent language model by information retrieval method | |
US7630892B2 (en) | Method and apparatus for transducer-based text normalization and inverse text normalization | |
Tur et al. | Spoken language understanding: Systems for extracting semantic information from speech | |
US8117022B2 (en) | Method and system for machine understanding, knowledge, and conversation | |
EP1538535A2 (en) | Determination of meaning for text input in natural language understanding systems | |
US11295730B1 (en) | Using phonetic variants in a local context to improve natural language understanding | |
JPH0320800A (en) | Method and device for recognizing voice | |
US11538466B2 (en) | Development of voice and other interaction applications | |
JPH11149297A (en) | Verbal dialog system for information access | |
US20210056962A1 (en) | Development of Voice and Other Interaction Applications | |
CN113779062A (en) | SQL statement generation method and device, storage medium and electronic equipment | |
US10762890B1 (en) | Development of voice and other interaction applications | |
Arısoy et al. | A unified language model for large vocabulary continuous speech recognition of Turkish | |
CN112905637B (en) | Multi-round text-to-SQL method and system based on conversation rewriting model | |
Sridhar et al. | A Compiler-based approach for natural language to code conversion | |
KR102106797B1 (en) | Apparatus for analyzing complex sentence, and recording medium therefor | |
Nugues | Language processing with Perl and Prolog | |
KR101134455B1 (en) | Speech recognition apparatus and its method | |
Di Fabbrizio et al. | AT&t help desk. | |
Noormamode et al. | A speech engine for mauritian creole | |
Attanayake | Statistical language modelling and novel parsing techniques for enhanced creation and editing of mathematical e-content using spoken input | |
Hockey et al. | Comparison of grammar-based and statistical language models trained on the same data | |
Wigmore | Speech-based creation and editing of mathematical content |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
AS | Assignment |
Owner name: SOUNDHOUND, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MOHAJER, KEYVAN;EMAMI, SEYED MAJID;WILSON, CHRISTOPHER;AND OTHERS;SIGNING DATES FROM 20150205 TO 20150206;REEL/FRAME:055877/0762 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
AS | Assignment |
Owner name: ACP POST OAK CREDIT II LLC, TEXAS Free format text: SECURITY INTEREST;ASSIGNORS:SOUNDHOUND, INC.;SOUNDHOUND AI IP, LLC;REEL/FRAME:063349/0355 Effective date: 20230414 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
AS | Assignment |
Owner name: SOUNDHOUND AI IP HOLDING, LLC, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SOUNDHOUND, INC.;REEL/FRAME:064083/0484 Effective date: 20230510 |
|
AS | Assignment |
Owner name: SOUNDHOUND AI IP, LLC, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SOUNDHOUND AI IP HOLDING, LLC;REEL/FRAME:064205/0676 Effective date: 20230510 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: SOUNDHOUND AI IP, LLC, CALIFORNIA Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:ACP POST OAK CREDIT II LLC, AS COLLATERAL AGENT;REEL/FRAME:067698/0845 Effective date: 20240610 Owner name: SOUNDHOUND, INC., CALIFORNIA Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:ACP POST OAK CREDIT II LLC, AS COLLATERAL AGENT;REEL/FRAME:067698/0845 Effective date: 20240610 |
|
AS | Assignment |
Owner name: MONROE CAPITAL MANAGEMENT ADVISORS, LLC, AS COLLATERAL AGENT, ILLINOIS Free format text: SECURITY INTEREST;ASSIGNOR:SOUNDHOUND, INC.;REEL/FRAME:068526/0413 Effective date: 20240806 |
|
AS | Assignment |
Owner name: SOUNDHOUND, INC., CALIFORNIA Free format text: TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENTS;ASSIGNOR:MONROE CAPITAL MANAGEMENT ADVISORS, LLC, AS COLLATERAL AGENT;REEL/FRAME:069480/0312 Effective date: 20241201 |