US20100050150A1 - Method and System for Developing Speech Applications - Google Patents
Method and System for Developing Speech Applications Download PDFInfo
- Publication number
- US20100050150A1 US20100050150A1 US12/490,194 US49019409A US2010050150A1 US 20100050150 A1 US20100050150 A1 US 20100050150A1 US 49019409 A US49019409 A US 49019409A US 2010050150 A1 US2010050150 A1 US 2010050150A1
- Authority
- US
- United States
- Prior art keywords
- application
- voice
- data
- vxml
- voice application
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 22
- 238000012360 testing method Methods 0.000 description 17
- 238000013461 design Methods 0.000 description 15
- 230000018109 developmental process Effects 0.000 description 12
- 230000006399 behavior Effects 0.000 description 11
- 238000009877 rendering Methods 0.000 description 11
- 238000011161 development Methods 0.000 description 8
- 238000004891 communication Methods 0.000 description 6
- 238000007726 management method Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 230000010354 integration Effects 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 230000003542 behavioural effect Effects 0.000 description 3
- 238000013501 data transformation Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 238000012552 review Methods 0.000 description 3
- 230000003068 static effect Effects 0.000 description 3
- 230000006978 adaptation Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000013479 data entry Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000000275 quality assurance Methods 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013506 data mapping Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 238000013101 initial test Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 210000004258 portal system Anatomy 0.000 description 1
- 238000011165 process development Methods 0.000 description 1
- 230000002250 progressing effect Effects 0.000 description 1
- 230000033772 system development Effects 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4938—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals comprising a voice browser which renders and interprets, e.g. VoiceXML
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Definitions
- the present invention is in the area of voice application software systems and pertains particularly to an improved method for developing voice application systems enabling in progress documenting and subsequent demonstration capability of voice application features, capabilities, and configurable options to end users of an enterprise.
- a speech application is one of the most challenging applications to develop, deploy and maintain in a communications (typically telephony) environment.
- Expertise required for developing and deploying a viable application includes expertise in computer telephony integration (CTI) and software, voice recognition software, text-to-speech hardware software, and speech application logic.
- CTI computer telephony integration
- VXML voice extensive markup language
- the expertise require to develop a speech solution has been reduced somewhat.
- VXML is a language that enables a software developer to focus on the application logic of the voice application without being required to configuring underlying telephony components.
- the developed voice application is run on a VXML interpreter that resides on and executes on the associated telephony system to deliver the solution.
- a typical architecture of a VXML-compliant telephony system comprises a voice application server and a VXML-compliant telephony server. Typical steps for development and deployment of a VXML enabled IVR solutions are briefly described below.
- VXML pages which are results of functioning application logic, are rendered by a VXML rendering engine based on a specified generation sequence.
- an object facade to the telephony server 130 is created comprising the corresponding VXML pages and is sent to the telephony server over a network, which can be the Internet, an Intranet, or an Ethernet network.
- the VXML pages are integrated into the rendering engine such that they can be displayed according to set workflow at the voice application server.
- the VXML-telephony server is configured to enable proper retrieval of specific VXML pages from the rendering engine within the voice application server.
- a triggering mechanism is provided to the voice application server so that when a triggering event occurs, an appropriate outbound call is placed from the server.
- a VXML interpreter, a voice recognition text-to-speech engine, and the telephony hardware/software are provided within the VXML-telephony server and comprise server function.
- the telephony hardware/software along with the VXML interpreter was and still is, in some cases, packaged as an off-the-shelf IVR-enabling technology.
- the voice application server is typically written in a programming language such as Java and packaged as an enterprise Java Bean archive.
- the presentation logic required is handled by the rendering engine and is written in JSP or PERL.
- one system described in a U.S. patent reference entitled “Method and Apparatus for Development and Deployment of a Voice Software Application for Distribution to one or more Application Consumers”, which is referenced above as a priority document, includes a voice application server that is connected to a data network for storing and serving voice applications.
- the voice application server has a data connection to a network communications server connected to a communications network such as the well-known PSTN network.
- the communication server routes the created voice applications to their intended recipients.
- the above system includes a computer station connected to the data network, the computer capable of accessing the voice application server over the network.
- the system also includes a special client software application hosted on the computer station, the application for enabling users to create voice applications and manage their states.
- the user creates voice application using object modeling and linking.
- the applications, once created, are then stored in the voice application server for deployment.
- the user can control and manage deployment and state of deployed applications including scheduled deployment and repeat deployments in terms of intended recipients.
- the voice application server in this system has the capability of accessing a network server and Web site hosted therein and pulling data from the site for use in the voice application.
- An operator of the computer station provides templates that the application server may use in data-to-voice (TTS) rendering. Therefore, the Web-based data targeted may be synthesized and spoken to an end user interacting with the application.
- TTS data-to-voice
- Enhanced data organization feature and security feature are also provided with this system.
- a vocabulary management system is included as part of the system and is adapted to constrain voice recognition processing associated with text-to-speech and speech-to-text rendering associated with any application in a state of interaction with a user.
- the management portion of the system includes a vocabulary management server connected to the voice application server and to the telephony server, and an instance of vocabulary management software running on the management server.
- an administrator accessing the vocabulary management server uses the vocabulary management software to create unique vocabulary sets or lists that are specific to selected portions of vocabulary associated with target data sources the vocabulary sets differing in content according to administrator direction.
- CPE customer premise equipment
- a voice application as a packaged software be useable immediately out of the box.
- Such a solution should be platform independent, configurable without requiring professional services, and be easily upgradeable.
- software voice applications available on the market may exhibit all or some of the following weaknesses.
- a method for creating a voice application comprising steps of (a) gathering voice application requirements; (b) generating a product requirements document; (c) creating a voice user interface from information generated at step (b); (d) implementing the interface created at step (c); (e) creating one or more data interfaces from information generated at step (b); (f) implementing the one or more interfaces; (g) generating one or more templates for system usage reporting; and (h) conducting one or more system usage studies.
- the voice application requirements include data about defined customers, internal and external data sources and paths, platform and speech engine used, business rules and practices, and data transformation protocols.
- the product requirements document is in the form of an electronic wizard containing fields for data entry.
- the product requirements document is an executable part of the voice application software and functions as a tutorial and specification of feature options including test result data, the document available on a pre-sale basis.
- the voice user interface is tested for system usage.
- the voice user interface provides documentation of system usage for the voice application.
- the voice user interface is implemented on architecture that simulates or physically represents customer premise equipment.
- the one or more data interfaces are adapted to transform internal data to a format usable for Voice extensible Markup Language (VXML) rendering.
- the one or more data interfaces include one or more interfaces to external Web-based data sources.
- the data interfaces include support for middleware applications used for data migration.
- step (f) the data interfaces are implemented on architecture that simulates or physically represents customer premise equipment.
- the system usage data includes caller behavior patterns, dialog recognition statistics, system satisfaction statistics, and voice application path selection statistics.
- a computer system comprising a processor; and a machine readable instruction upon when executed enables the machine to facilitate a method for creating a voice application including steps of (a) gathering voice application requirements; (b) generating a product requirements document; (c) creating a voice user interface from information generated at step (b); (d) implementing the interface created at step (c); (e) creating one or more data interfaces from information generated at step (b); (f) implementing the one or more interfaces; (g) generating one or more templates for system usage reporting; and (h) conducting one or more system usage studies.
- the voice application requirements include data about defined customers, internal and external data sources and paths, platform and speech engine used, business rules and practices, and data transformation protocols.
- the product requirements document is in the form of an electronic wizard containing fields for data entry.
- the product requirements document is an executable part of the voice application software and functions as a tutorial and specification of feature options including test result data, the document available on a pre-sale basis.
- the voice user interface is tested for system usage.
- the voice user interface provides documentation of system usage for the voice application.
- the voice user interface is implemented on architecture that simulates or physically represents customer premise equipment.
- the one or more data interfaces are adapted to transform internal data to a format usable for Voice extensible Markup Language (VXML) rendering.
- the one or more data interfaces include one or more interfaces to external Web-based data sources.
- the data interfaces include support for middleware applications used for data migration.
- step (f) the data interfaces are implemented on architecture that simulates or physically represents customer premise equipment.
- the system usage data includes caller behavior patterns, dialog recognition statistics, system satisfaction statistics, and voice application path selection statistics.
- FIG. 1 is an architectural overview of a VXML network enabled by provision of voice application development software according to an embodiment of the present invention.
- FIG. 2 is a process flow diagram illustrating a system method for developing a voice application according to an embodiment of the present invention.
- FIG. 1 is an architectural overview 1000 of a VXML network enabled by provision of voice application development software according to an embodiment of the present invention.
- Voice extensible Markup Language (VXML) network 1000 includes a data network 1001 .
- Data network 1001 may be an Internet network, an Intranet network or any other type of data network whether a corporate wide-area-network, local-area-network or private network.
- the well-known Internet network is a preferred example of data network 1001 because of the high public access characteristic and geographic reach and accessibility to other networks.
- Data network 1001 may be referred to hereinafter as Internet network 1001 although it should not be construed as a limitation of practice.
- VXML network 1000 also includes a telephone network, which in this case is a public-switched-telephone-network (PSTN) 1002 .
- PSTN 1002 may be bridged to Internet 1001 for communication through any one of several known types of gateways (not illustrated).
- a voice application server 1003 is illustrated in this example and has ported access to Internet 1001 by way of an Internet data line 1119 .
- Application server 1003 is typical of that known to the inventor and described with reference to a U.S. patent application entitled “Method and Apparatus for Development and Deployment of a Voice Software Application for Distribution to one or more Application Consumers” S/N Ser. No. 10/173,333, which Server 1003 may be hosted by an enterprise and may be an integral portion of the customer premise equipment (CPE) of the enterprise that would benefit from a developed voice application.
- CPE customer premise equipment
- a VXML-enabled telephony server 1004 is illustrated in this example and may be hosted by a same enterprise that hosts server 1003 .
- Telephony server 1004 may also be hosted by a separate enterprise providing services to the first enterprise. In actual practice, VXML telephony server 1004 would reside somewhere within PSTN network 1002 .
- customers may access VXML functionality from PSTN network 1002 as is illustrated by a voice interaction system 1007 , which is connected to a receiving device like a telephone 1005 over a telephony link 1116 .
- PSTN customers may access a Web-based interactive voice portal 1006 located within Internet network 1001 via a network bridge connection 1117 .
- interactive voice system 1007 or portal 1006 may be accessed from the Internet side of VXML communication network 1000 .
- Telephony sever 1004 contains a VXML interpreter 1120 adapted to interpret VXML pages rendered thereto by a VXML rendering engine 1124 , which is contained within voice application server 1003 .
- a text to speech engine 1121 transforms text to synthesized speech, which is processed using telephony hardware software 1122 to deliver voice over telephony trunk 1118 to a telephony switch 1007 to which a customer operating receiving device 1005 has connection to over telephone line 1116 . Therefore the domain of interaction is between receiving device 1005 and telephony hardware software 1122 .
- switch 1007 may be enhanced locally depending on the architecture.
- Application server 1003 contains voice application logic 1125 , database and resource adapter 1126 , and voice application validation and configuration software 1127 . All of the just-mentioned components are part of a voice application that, according to an embodiment of the present invention, is rigorously tested and tuned before it is released to a client for use.
- a developer station 1009 is illustrated in this example and is adapted for assessing requirements of a voice application and developing, testing and documenting the process in order to provide a platform independent application that an end user may implement immediately on an enterprise VXML system architecture without being required to develop components and with reduced debugging and configuration tasking.
- Developer station 1009 has an instance of developer software (DSW) 1010 installed thereon and adapted for full software development from assessing requirements to live system usage testing in a representative CPE architecture.
- DSW developer software
- a voice application solution may be developed using DSW 1010 that meets all of the general requirements of one or more similar enterprises.
- options may be included in one package that may be useable in more than one business scenario using a variety of architectures and equipment types.
- a client station 1008 is illustrated in this example and represents an administrator's station that may receive and implement a voice application onto VXML architecture 1000 .
- a client software application (CL) 1011 is provided to station 1008 , in this example, as a resident piece of a voice application developed with DSW 1010 .
- CL 1011 for example enables the administrator to implement the voice solution onto voice application server 1003 for use.
- station 1008 may be used to configure the voice application according to any needs including use of external data sources and/or internal data sources for VXML rendering.
- a voice application developed using DSW may be a static application or a dynamic application that returns data in real time by accessing external or internal data sources while customers interact with the system.
- One such external data source may be a Web-based data source such as Web server 1128 accessible to voice application server 1003 during runtime via a network link 1115 .
- Server 1128 is also accessible to client station 1008 via a network link 1113 .
- DSW 1010 contains process development tools that provide solutions to the challenges at the vendor's side before shipment and implementation at client CPE. In this way many tasks related to software development, coding, voice user interface design, database and data resource adaptation, and data transformation from existing systems into VXML interpretable states using applicable middleware solutions are eliminated altogether from the point of view of the client.
- FIG. 2 is a process flow diagram 2000 illustrating a systematic method used in development of a voice application according to an embodiment of the present invention.
- voice application requirements are gathered.
- Voice application requirements are gathered from multiple sources including requirements of definitional customers (customers whose needs can be defined), existing voice application systems, and data sources including Web sources.
- a product requirements document PRD
- a PRD document contains all of the requirements for completing a useable voice application. All application features and components are described within the PRD.
- the PRD is manually generated and maintained. However, it is possible in one embodiment to automatically generate a PRD using an electronic wizard. Such a generated PRD may n some embodiments, be replayed using automated presentation means such as a wizard or a tutorial.
- An enterprise may review a PRD specific to a voice application and may gain an understanding related to whether the application functionality meets their needs or not before they make a purchase. Because the PRD describes features that might be implemented in a future release of a product, enterprises may use the future release information as a roadmap for the application. The PRD may also describe useful application capabilities that enterprises had not previously contemplated or thought possible to automate.
- a PRD describes in detail requirements that support the application itself including reporting features and requirements and configuration options and requirements. In this way an enterprise may understand upfront the full scope of what is required in order to implement the product. The defined configuration options help determine the amount of effort required to modify the application to meet specific enterprise requirements.
- Options for PRD presentation include but shall not be limited to simple documentation in the form of an electronic document that is read only and may be printed to a more robust document a presentation such as made viewable electronically by an electronic wizard or tutorial
- a voice user interface (VUI) design is created.
- VUI voice user interface
- a VUI is designed with consideration given to multiple configuration options listed as available in the PRD of step 2002 .
- a design is selected, tested, and prototyped using a focus group before any hard coding begins. Best VUI design practices are incorporated into the design, increasing the quality of the application and reducing or eliminating a requirement for an enterprise to acquire services from a specialist with VUI design skills.
- the style of interface is selected that is appropriate to the application functionality and type of business the application is created for.
- VUI design that includes a range of different user behaviors may result in higher call completion rates during actual use.
- the VUI specification undergoes several design reviews to ensure that it accurately reflects the requirements in the PRD and contains best VUI design practices.
- the VUI design is validated. An enterprise may review a VUI design and determine whether it is appropriate for their callers before making a commitment.
- VUI documentation is created wherein the VUI design specification serves partly as documentation for the voice application and is available for reference at a later data when application modifications are made.
- step 2006 the designed VUI is implemented.
- a voice application is implemented based on the VUI design and the business and processing rules specified in the PRD. Prompts, grammars, dialog flows, rules and configuration options are put in place in this step. Coding standards and consistent naming conventions are applied during the implementation. Coding standards are reviewed as well to ensure the quality of the implementation. Because applications are consistently coded to predefined standards, they are easier to maintain and upgrade.
- step 2007 application functionality and behavior may be tested. In this step an enterprises can easily evaluate and test an application's functionality and behavior without utilizing their own CPE before purchase. Application testing may be carried out for any or all of the working components of the application.
- step 2008 if the application passes all initial testing, then at step 2010 , appropriate data interfaces are implemented. If some components have failed testing at step 2008 , then those bugs are fixed in step 2009 and the components are re-tested again until all components pass.
- data interfaces are implemented. Interfaces are created to map internal enterprise data into the appropriate places in the call flow of the voice application.
- the mapping of data from enterprise systems into appropriate formats for VXML rendering in voice applications is performed in step 2010 and, therefore does not have to be custom developed on the enterprise side after purchase.
- Data mapping typically includes transformation of back-end data into the formats necessary for high-quality voice read back, including the association of recorded prompts with variable data values.
- Back-end integration is simplified by provision of standardized and well-defined data interfaces and a data-to-VUI mapping layer. This pre-integration with common enterprise back-end systems significantly reduces the time and cost of integrating the applications into production environments on the enterprise side.
- the interfaces may conform to any data integration standard, such as open financial exchange (OFX) in financial services for example.
- OFX open financial exchange
- a voice application implemented on a client CPE performs much better and is initially more reliable when it has been tested thoroughly before it is implemented on enterprise CPE.
- standardized reporting templates are created that are adapted to contain test result data, caller behavioral data, statistical data, and other important data for use in fine tuning application functionality and system usage.
- Detailed report information is available in real-time, once an application is deployed, to business users about the functioning of the application and the behavior of callers accessing it.
- the reports describe what dialog paths callers traversed, what was recognized in each dialog, what errors they encountered, how many times and when they requested help, and whether they successfully completed their transactions, so that the response rate to promotions or marketing offers placed in the dialog flow can readily be assessed and enterprises can easily identify problem areas or “hot spots” in the application that might require tuning.
- step 2012 application functionality and behavioral results are documented.
- Online documentation enables an enterprise to understand how to modify application behavior when they need to change business rules or insert new application functionality.
- the functionality and behavior of the application is documented in detail.
- the documentation also describes how to configure the application and contains the complete call flow.
- the documentation also serves as a reference point for determining correct application behavior in the event of application errors.
- a step 1213 application functionality is validated.
- functionality and behavior of an application is validated through a quality assurance process based on a previously defined test plan. All paths through the call flow may be tested including those that might be altered by configuration settings. An application does not proceed through to a production release until it meets pre-determined quality criteria.
- step 2014 some or all of application functionality may pass the pre-determined criteria at which time system usage testing ensues at step 2015 . If some or all of application functionality does not meet the criteria for release, then at step 2016 , bugs in the application are identified and fixed. The application may then proceed to another testing phase at step 2013 until every component passes at step 2009 . In this way an enterprise may receive a high quality product that has been rigorously tested by dedicated quality assurance professionals, and any known issues that were not addressed in the shipped version of the application are noted in end-user documentation.
- step 2015 application system usage ability is analyzed.
- System usage reports describe the anticipated results of the application when it is deployed, including expected automation and satisfaction rates.
- the functionality and behavior of the application is also validated through a series of focus groups and system usage tests. These take the form of interactive discussion groups with the appropriate types of target callers, as well as system usage tests with live callers who are asked to complete a series of specific interaction tasks using equipment that simulates or physically represents customer premise equipment.
- Quantitative and qualitative results from the system usage reports are analyzed and appropriate modifications are made to the application.
- system usage tests Through the inclusion of system usage tests, enterprises receive a high quality VUI that has already been experienced and commented on by consumers with similar profiles to their customers. Enterprises do not have to undertake the extensive costs associated with doing their own system usage testing.
- Process steps 2001 through 2015 may be repeated over several validation iterations beginning back at step 2001 and progressing through the entire cycle again.
- One with skill in the art of voice application design will appreciate that the method of the present invention can be extended to client side environments for further fine-tuning of voice application functionality after deployment.
- the method of the invention can be implemented as a software suite, a software wizard, or a text instruction without departing from the spirit and scope of the present invention.
- pre-sale documentation such as PRD, VUI, and system usage reporting may also be extended to client software using any or a combination of the above-described vehicles without departing from the spirit and scope of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
Abstract
A method for is provided for creating a voice application comprising steps of (a) gathering voice application requirements; (b) generating a product requirements document; (c) creating a voice user interface from information generated at step (b); (d) implementing the interface created at step (c); (e) creating one or more data interfaces from information generated at step (b); (f) implementing the one or more interfaces; (g) generating one or more templates for system usage reporting; and (h) conducting one or more system usage studies. The method may be enabled by computer system execution of a machine-readable instruction.
Description
- The present application is a Continuation of co-pending U.S. patent application Ser. No. 11/036,293, filed on Jan. 13, 2005, the disclosure of which is incorporated by reference herein. That application claims priority to provisional patent application 60/591,817 filed on Jul. 27, 2004. That application is also a Continuation In Part of U.S. patent application Ser. No. 10/190,077, entitled “Method for Automated Harvesting of Data from A Web site using a Voice Portal System”, filed on Jul. 2, 2002, which is a Continuation In Part of U.S. patent application Ser. No. 10/173,333, entitled “Method and Apparatus for Development and Deployment of a Voice Software Application for Distribution to one or more Application Consumers”, filed on Jun. 14, 2002. The disclosures of all of the above referenced applications are incorporated by reference herein in their entirety.
- The present invention is in the area of voice application software systems and pertains particularly to an improved method for developing voice application systems enabling in progress documenting and subsequent demonstration capability of voice application features, capabilities, and configurable options to end users of an enterprise.
- A speech application is one of the most challenging applications to develop, deploy and maintain in a communications (typically telephony) environment. Expertise required for developing and deploying a viable application includes expertise in computer telephony integration (CTI) and software, voice recognition software, text-to-speech hardware software, and speech application logic.
- With the relatively recent advent of voice extensive markup language (VXML) the expertise require to develop a speech solution has been reduced somewhat. VXML is a language that enables a software developer to focus on the application logic of the voice application without being required to configuring underlying telephony components. Typically, the developed voice application is run on a VXML interpreter that resides on and executes on the associated telephony system to deliver the solution.
- A typical architecture of a VXML-compliant telephony system comprises a voice application server and a VXML-compliant telephony server. Typical steps for development and deployment of a VXML enabled IVR solutions are briefly described below.
- Firstly, a new application database is created or an existing one is modified to support VXML. Application logic is designed in terms of workflow and adapted to handle the routing operations of the IVR system. VXML pages, which are results of functioning application logic, are rendered by a VXML rendering engine based on a specified generation sequence.
- Secondly, an object facade to the telephony server 130 is created comprising the corresponding VXML pages and is sent to the telephony server over a network, which can be the Internet, an Intranet, or an Ethernet network. The VXML pages are integrated into the rendering engine such that they can be displayed according to set workflow at the voice application server.
- Thirdly, the VXML-telephony server is configured to enable proper retrieval of specific VXML pages from the rendering engine within the voice application server. A triggering mechanism is provided to the voice application server so that when a triggering event occurs, an appropriate outbound call is placed from the server.
- A VXML interpreter, a voice recognition text-to-speech engine, and the telephony hardware/software are provided within the VXML-telephony server and comprise server function. In early art, the telephony hardware/software along with the VXML interpreter was and still is, in some cases, packaged as an off-the-shelf IVR-enabling technology. Arguably the most important feature, however, of the entire system is the voice application server. The application logic is typically written in a programming language such as Java and packaged as an enterprise Java Bean archive. The presentation logic required is handled by the rendering engine and is written in JSP or PERL.
- Later in the art according to at least one system known to the inventor, improvements were made over prior art. For example, one system described in a U.S. patent reference entitled “Method and Apparatus for Development and Deployment of a Voice Software Application for Distribution to one or more Application Consumers”, which is referenced above as a priority document, includes a voice application server that is connected to a data network for storing and serving voice applications. The voice application server has a data connection to a network communications server connected to a communications network such as the well-known PSTN network. The communication server routes the created voice applications to their intended recipients.
- The above system includes a computer station connected to the data network, the computer capable of accessing the voice application server over the network. The system also includes a special client software application hosted on the computer station, the application for enabling users to create voice applications and manage their states. In this system, the user creates voice application using object modeling and linking. The applications, once created, are then stored in the voice application server for deployment. The user can control and manage deployment and state of deployed applications including scheduled deployment and repeat deployments in terms of intended recipients.
- There are several enhanced feature available with this system including a capability of developing and deploying a voice application using Web-based data as source data. The voice application server in this system has the capability of accessing a network server and Web site hosted therein and pulling data from the site for use in the voice application. An operator of the computer station provides templates that the application server may use in data-to-voice (TTS) rendering. Therefore, the Web-based data targeted may be synthesized and spoken to an end user interacting with the application. Enhanced data organization feature and security feature are also provided with this system.
- In such a system where templates are used to enable voice application dialog transactions, voice application rules and voice recognition data are consulted for the appropriate content interpretation and response protocol so that the synthesized voice presented as response dialog through the voice portal to the user is both appropriate in content and hopefully error free in expression. The database is therefore optimized with vocabulary words that enable a very wide range of speech covering many different vocabulary words akin to many differing business scenarios. Later enhancements made to this system include a capability of tailoring vocabulary recognition engines for active voice applications according to client parameters.
- To achieve the above, a vocabulary management system is included as part of the system and is adapted to constrain voice recognition processing associated with text-to-speech and speech-to-text rendering associated with any application in a state of interaction with a user. The management portion of the system includes a vocabulary management server connected to the voice application server and to the telephony server, and an instance of vocabulary management software running on the management server. In practice, an administrator accessing the vocabulary management server uses the vocabulary management software to create unique vocabulary sets or lists that are specific to selected portions of vocabulary associated with target data sources the vocabulary sets differing in content according to administrator direction.
- Many other enhancements, all of which are known to the inventors, have been provided through system development and refinement over time. Some of these more static improvements include capabilities for static and dynamic resource caching; constraint-based dialect recognition; behavioral adaptation with dynamic response selection; script language bridging between disparate speech engines; and needs inference with dynamic response selection.
- With the advent of end-user-configurable voice applications, which have only recently materialized, many vendors are rushing to provide pre-packaged IVR and self-service voice-application solutions that end users may purchase and employ within their own customer premise equipment (CPE) for servicing their caller base.
- It is desirable that a voice application as a packaged software be useable immediately out of the box. Such a solution should be platform independent, configurable without requiring professional services, and be easily upgradeable. However, software voice applications available on the market may exhibit all or some of the following weaknesses.
-
- They may only be available through professional services or connected to a professional service fee.
- They may run only on a single platform or work with a single brand of speech engine.
- They may contain no pre-built data interfaces.
- There may be no available cohesive documentation explaining the development, testing, and application feature upgrading process.
- The vendor may be unable to articulate features clearly, produce a useable roadmap for the product, or even be able to demonstrate the product in a way that benefits the customer and enables the customer to determine if the application will meet the needs of the enterprise.
- What is clearly needed is an improved design method and system that allows a developer of a Voice XML system to produce a usable voice application that solves the problems above.
- A method is provided for creating a voice application comprising steps of (a) gathering voice application requirements; (b) generating a product requirements document; (c) creating a voice user interface from information generated at step (b); (d) implementing the interface created at step (c); (e) creating one or more data interfaces from information generated at step (b); (f) implementing the one or more interfaces; (g) generating one or more templates for system usage reporting; and (h) conducting one or more system usage studies.
- In a preferred embodiment, in step (a), the voice application requirements include data about defined customers, internal and external data sources and paths, platform and speech engine used, business rules and practices, and data transformation protocols. In one embodiment, in step (b), the product requirements document is in the form of an electronic wizard containing fields for data entry.
- In one embodiment, in step (b), the product requirements document is an executable part of the voice application software and functions as a tutorial and specification of feature options including test result data, the document available on a pre-sale basis. In a preferred embodiment, in step (c) the voice user interface is tested for system usage. In a preferred embodiment, in step (c), the voice user interface provides documentation of system usage for the voice application.
- In a preferred embodiment, in step (d), the voice user interface is implemented on architecture that simulates or physically represents customer premise equipment. Also in a preferred embodiment, in step (e), the one or more data interfaces are adapted to transform internal data to a format usable for Voice extensible Markup Language (VXML) rendering. In this embodiment, in step (e), the one or more data interfaces include one or more interfaces to external Web-based data sources. Still in this embodiment, in step (e), the data interfaces include support for middleware applications used for data migration.
- In one embodiment, in step (f), the data interfaces are implemented on architecture that simulates or physically represents customer premise equipment. In preferred application, in step (g), the system usage data includes caller behavior patterns, dialog recognition statistics, system satisfaction statistics, and voice application path selection statistics.
- According to another aspect of the present invention, a computer system is provided comprising a processor; and a machine readable instruction upon when executed enables the machine to facilitate a method for creating a voice application including steps of (a) gathering voice application requirements; (b) generating a product requirements document; (c) creating a voice user interface from information generated at step (b); (d) implementing the interface created at step (c); (e) creating one or more data interfaces from information generated at step (b); (f) implementing the one or more interfaces; (g) generating one or more templates for system usage reporting; and (h) conducting one or more system usage studies. In a preferred embodiment, in step (a), the voice application requirements include data about defined customers, internal and external data sources and paths, platform and speech engine used, business rules and practices, and data transformation protocols. In one embodiment, in step (b), the product requirements document is in the form of an electronic wizard containing fields for data entry.
- In one embodiment, in step (b), the product requirements document is an executable part of the voice application software and functions as a tutorial and specification of feature options including test result data, the document available on a pre-sale basis. In a preferred embodiment, in step (c) the voice user interface is tested for system usage. In a preferred embodiment, in step (c), the voice user interface provides documentation of system usage for the voice application.
- In a preferred embodiment, in step (d), the voice user interface is implemented on architecture that simulates or physically represents customer premise equipment. Also in a preferred embodiment, in step (e), the one or more data interfaces are adapted to transform internal data to a format usable for Voice extensible Markup Language (VXML) rendering. In this embodiment, in step (e), the one or more data interfaces include one or more interfaces to external Web-based data sources. Still in this embodiment, in step (e), the data interfaces include support for middleware applications used for data migration.
- In one embodiment, in step (f), the data interfaces are implemented on architecture that simulates or physically represents customer premise equipment. In preferred application, in step (g), the system usage data includes caller behavior patterns, dialog recognition statistics, system satisfaction statistics, and voice application path selection statistics.
-
FIG. 1 is an architectural overview of a VXML network enabled by provision of voice application development software according to an embodiment of the present invention. -
FIG. 2 is a process flow diagram illustrating a system method for developing a voice application according to an embodiment of the present invention. -
FIG. 1 is anarchitectural overview 1000 of a VXML network enabled by provision of voice application development software according to an embodiment of the present invention. Voice extensible Markup Language (VXML)network 1000 includes adata network 1001.Data network 1001 may be an Internet network, an Intranet network or any other type of data network whether a corporate wide-area-network, local-area-network or private network. The well-known Internet network is a preferred example ofdata network 1001 because of the high public access characteristic and geographic reach and accessibility to other networks.Data network 1001 may be referred to hereinafter asInternet network 1001 although it should not be construed as a limitation of practice. -
VXML network 1000 also includes a telephone network, which in this case is a public-switched-telephone-network (PSTN) 1002.PSTN 1002 may be bridged toInternet 1001 for communication through any one of several known types of gateways (not illustrated). - A
voice application server 1003 is illustrated in this example and has ported access toInternet 1001 by way of anInternet data line 1119. -
Application server 1003 is typical of that known to the inventor and described with reference to a U.S. patent application entitled “Method and Apparatus for Development and Deployment of a Voice Software Application for Distribution to one or more Application Consumers” S/N Ser. No. 10/173,333, whichServer 1003 may be hosted by an enterprise and may be an integral portion of the customer premise equipment (CPE) of the enterprise that would benefit from a developed voice application. - A VXML-enabled
telephony server 1004 is illustrated in this example and may be hosted by a same enterprise that hostsserver 1003. -
Telephony server 1004 may also be hosted by a separate enterprise providing services to the first enterprise. In actual practice,VXML telephony server 1004 would reside somewhere withinPSTN network 1002. - In a state-of-art VXML network, customers may access VXML functionality from
PSTN network 1002 as is illustrated by avoice interaction system 1007, which is connected to a receiving device like atelephone 1005 over atelephony link 1116. Likewise PSTN customers may access a Web-basedinteractive voice portal 1006 located withinInternet network 1001 via anetwork bridge connection 1117. Likewise,interactive voice system 1007 or portal 1006 may be accessed from the Internet side ofVXML communication network 1000. There are many variant connection possibilities. - Telephony sever 1004 contains a
VXML interpreter 1120 adapted to interpret VXML pages rendered thereto by aVXML rendering engine 1124, which is contained withinvoice application server 1003. In this embodiment a text tospeech engine 1121 transforms text to synthesized speech, which is processed usingtelephony hardware software 1122 to deliver voice overtelephony trunk 1118 to atelephony switch 1007 to which a customeroperating receiving device 1005 has connection to overtelephone line 1116. Therefore the domain of interaction is between receivingdevice 1005 andtelephony hardware software 1122. In one embodiment,switch 1007 may be enhanced locally depending on the architecture. -
Application server 1003 containsvoice application logic 1125, database andresource adapter 1126, and voice application validation andconfiguration software 1127. All of the just-mentioned components are part of a voice application that, according to an embodiment of the present invention, is rigorously tested and tuned before it is released to a client for use. - A
developer station 1009 is illustrated in this example and is adapted for assessing requirements of a voice application and developing, testing and documenting the process in order to provide a platform independent application that an end user may implement immediately on an enterprise VXML system architecture without being required to develop components and with reduced debugging and configuration tasking. -
Developer station 1009 has an instance of developer software (DSW) 1010 installed thereon and adapted for full software development from assessing requirements to live system usage testing in a representative CPE architecture. A voice application solution may be developed usingDSW 1010 that meets all of the general requirements of one or more similar enterprises. Thus, options may be included in one package that may be useable in more than one business scenario using a variety of architectures and equipment types. - A
client station 1008 is illustrated in this example and represents an administrator's station that may receive and implement a voice application ontoVXML architecture 1000. A client software application (CL) 1011 is provided tostation 1008, in this example, as a resident piece of a voice application developed withDSW 1010.CL 1011, for example enables the administrator to implement the voice solution ontovoice application server 1003 for use. As well,station 1008 may be used to configure the voice application according to any needs including use of external data sources and/or internal data sources for VXML rendering. It is noted herein that a voice application developed using DSW may be a static application or a dynamic application that returns data in real time by accessing external or internal data sources while customers interact with the system. One such external data source may be a Web-based data source such asWeb server 1128 accessible tovoice application server 1003 during runtime via anetwork link 1115.Server 1128 is also accessible toclient station 1008 via anetwork link 1113. - As was described in the background section of this specification, major challenges in third-party voice application development include testing issues, system usage issues, VUI development issues, proprietary issues, and service dependency issues.
DSW 1010 contains process development tools that provide solutions to the challenges at the vendor's side before shipment and implementation at client CPE. In this way many tasks related to software development, coding, voice user interface design, database and data resource adaptation, and data transformation from existing systems into VXML interpretable states using applicable middleware solutions are eliminated altogether from the point of view of the client. -
FIG. 2 is a process flow diagram 2000 illustrating a systematic method used in development of a voice application according to an embodiment of the present invention. Atstep 2001, voice application requirements are gathered. Voice application requirements are gathered from multiple sources including requirements of definitional customers (customers whose needs can be defined), existing voice application systems, and data sources including Web sources. After product requirements are quantified, a product requirements document (PRD) is generated atstep 2002. A PRD document contains all of the requirements for completing a useable voice application. All application features and components are described within the PRD. In one embodiment the PRD is manually generated and maintained. However, it is possible in one embodiment to automatically generate a PRD using an electronic wizard. Such a generated PRD may n some embodiments, be replayed using automated presentation means such as a wizard or a tutorial. - An enterprise may review a PRD specific to a voice application and may gain an understanding related to whether the application functionality meets their needs or not before they make a purchase. Because the PRD describes features that might be implemented in a future release of a product, enterprises may use the future release information as a roadmap for the application. The PRD may also describe useful application capabilities that enterprises had not previously contemplated or thought possible to automate.
- A PRD describes in detail requirements that support the application itself including reporting features and requirements and configuration options and requirements. In this way an enterprise may understand upfront the full scope of what is required in order to implement the product. The defined configuration options help determine the amount of effort required to modify the application to meet specific enterprise requirements. Options for PRD presentation include but shall not be limited to simple documentation in the form of an electronic document that is read only and may be printed to a more robust document a presentation such as made viewable electronically by an electronic wizard or tutorial
- At step 2003 a voice user interface (VUI) design is created. A VUI is designed with consideration given to multiple configuration options listed as available in the PRD of
step 2002. In this step, a design is selected, tested, and prototyped using a focus group before any hard coding begins. Best VUI design practices are incorporated into the design, increasing the quality of the application and reducing or eliminating a requirement for an enterprise to acquire services from a specialist with VUI design skills. The style of interface is selected that is appropriate to the application functionality and type of business the application is created for. - A comprehensive VUI design that includes a range of different user behaviors may result in higher call completion rates during actual use. The VUI specification undergoes several design reviews to ensure that it accurately reflects the requirements in the PRD and contains best VUI design practices. At
step 2004, the VUI design is validated. An enterprise may review a VUI design and determine whether it is appropriate for their callers before making a commitment. - At
step 2005, VUI documentation is created wherein the VUI design specification serves partly as documentation for the voice application and is available for reference at a later data when application modifications are made. - At
step 2006 the designed VUI is implemented. A voice application is implemented based on the VUI design and the business and processing rules specified in the PRD. Prompts, grammars, dialog flows, rules and configuration options are put in place in this step. Coding standards and consistent naming conventions are applied during the implementation. Coding standards are reviewed as well to ensure the quality of the implementation. Because applications are consistently coded to predefined standards, they are easier to maintain and upgrade. Atstep 2007 application functionality and behavior may be tested. In this step an enterprises can easily evaluate and test an application's functionality and behavior without utilizing their own CPE before purchase. Application testing may be carried out for any or all of the working components of the application. - At
step 2008, if the application passes all initial testing, then atstep 2010, appropriate data interfaces are implemented. If some components have failed testing atstep 2008, then those bugs are fixed instep 2009 and the components are re-tested again until all components pass. - At
step 2010, data interfaces are implemented. Interfaces are created to map internal enterprise data into the appropriate places in the call flow of the voice application. The mapping of data from enterprise systems into appropriate formats for VXML rendering in voice applications is performed instep 2010 and, therefore does not have to be custom developed on the enterprise side after purchase. Data mapping typically includes transformation of back-end data into the formats necessary for high-quality voice read back, including the association of recorded prompts with variable data values. Back-end integration is simplified by provision of standardized and well-defined data interfaces and a data-to-VUI mapping layer. This pre-integration with common enterprise back-end systems significantly reduces the time and cost of integrating the applications into production environments on the enterprise side. The interfaces may conform to any data integration standard, such as open financial exchange (OFX) in financial services for example. - It is noted herein that a voice application implemented on a client CPE performs much better and is initially more reliable when it has been tested thoroughly before it is implemented on enterprise CPE.
- At
step 2011, standardized reporting templates are created that are adapted to contain test result data, caller behavioral data, statistical data, and other important data for use in fine tuning application functionality and system usage. Detailed report information is available in real-time, once an application is deployed, to business users about the functioning of the application and the behavior of callers accessing it. The reports describe what dialog paths callers traversed, what was recognized in each dialog, what errors they encountered, how many times and when they requested help, and whether they successfully completed their transactions, so that the response rate to promotions or marketing offers placed in the dialog flow can readily be assessed and enterprises can easily identify problem areas or “hot spots” in the application that might require tuning. - At
step 2012 application functionality and behavioral results are documented. Online documentation enables an enterprise to understand how to modify application behavior when they need to change business rules or insert new application functionality. The functionality and behavior of the application is documented in detail. The documentation also describes how to configure the application and contains the complete call flow. The documentation also serves as a reference point for determining correct application behavior in the event of application errors. - A step 1213, application functionality is validated. In this step, functionality and behavior of an application is validated through a quality assurance process based on a previously defined test plan. All paths through the call flow may be tested including those that might be altered by configuration settings. An application does not proceed through to a production release until it meets pre-determined quality criteria.
- At
step 2014 some or all of application functionality may pass the pre-determined criteria at which time system usage testing ensues atstep 2015. If some or all of application functionality does not meet the criteria for release, then at step 2016, bugs in the application are identified and fixed. The application may then proceed to another testing phase atstep 2013 until every component passes atstep 2009. In this way an enterprise may receive a high quality product that has been rigorously tested by dedicated quality assurance professionals, and any known issues that were not addressed in the shipped version of the application are noted in end-user documentation. - At
step 2015, application system usage ability is analyzed. System usage reports describe the anticipated results of the application when it is deployed, including expected automation and satisfaction rates. The functionality and behavior of the application is also validated through a series of focus groups and system usage tests. These take the form of interactive discussion groups with the appropriate types of target callers, as well as system usage tests with live callers who are asked to complete a series of specific interaction tasks using equipment that simulates or physically represents customer premise equipment. Quantitative and qualitative results from the system usage reports are analyzed and appropriate modifications are made to the application. Through the inclusion of system usage tests, enterprises receive a high quality VUI that has already been experienced and commented on by consumers with similar profiles to their customers. Enterprises do not have to undertake the extensive costs associated with doing their own system usage testing. -
Process steps 2001 through 2015 may be repeated over several validation iterations beginning back atstep 2001 and progressing through the entire cycle again. One with skill in the art of voice application design will appreciate that the method of the present invention can be extended to client side environments for further fine-tuning of voice application functionality after deployment. On the developers side the method of the invention can be implemented as a software suite, a software wizard, or a text instruction without departing from the spirit and scope of the present invention. Likewise pre-sale documentation such as PRD, VUI, and system usage reporting may also be extended to client software using any or a combination of the above-described vehicles without departing from the spirit and scope of the present invention. - The spirit and scope of the present invention should only be limited by the following claims.
Claims (1)
1. A method for creating a voice application comprising steps of:
(a) gathering voice application requirements;
(b) generating a product requirements document;
(c) creating a voice user interface from information generated at step (b);
(d) implementing the interface created at step (c);
(e) creating one or more data interfaces from information generated at step (b);
(f) implementing the one or more interfaces;
(g) generating one or more templates for system usage reporting; and
(h) conducting one or more system usage studies.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/490,194 US20100050150A1 (en) | 2002-06-14 | 2009-06-23 | Method and System for Developing Speech Applications |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/173,333 US20030007609A1 (en) | 2001-07-03 | 2002-06-14 | Method and apparatus for development, deployment, and maintenance of a voice software application for distribution to one or more consumers |
US10/190,077 US20030055884A1 (en) | 2001-07-03 | 2002-07-02 | Method for automated harvesting of data from a Web site using a voice portal system |
US59181704P | 2004-07-27 | 2004-07-27 | |
US11/036,293 US20050149331A1 (en) | 2002-06-14 | 2005-01-13 | Method and system for developing speech applications |
US12/490,194 US20100050150A1 (en) | 2002-06-14 | 2009-06-23 | Method and System for Developing Speech Applications |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/036,293 Continuation US20050149331A1 (en) | 2002-06-14 | 2005-01-13 | Method and system for developing speech applications |
Publications (1)
Publication Number | Publication Date |
---|---|
US20100050150A1 true US20100050150A1 (en) | 2010-02-25 |
Family
ID=46303700
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/036,293 Abandoned US20050149331A1 (en) | 2002-06-14 | 2005-01-13 | Method and system for developing speech applications |
US12/490,194 Abandoned US20100050150A1 (en) | 2002-06-14 | 2009-06-23 | Method and System for Developing Speech Applications |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/036,293 Abandoned US20050149331A1 (en) | 2002-06-14 | 2005-01-13 | Method and system for developing speech applications |
Country Status (1)
Country | Link |
---|---|
US (2) | US20050149331A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106383720A (en) * | 2016-09-30 | 2017-02-08 | 山东浪潮云服务信息科技有限公司 | Cloud sharing and collaborating method of software product prototype |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050163136A1 (en) * | 2003-11-17 | 2005-07-28 | Leo Chiu | Multi-tenant self-service VXML portal |
US7697673B2 (en) | 2003-11-17 | 2010-04-13 | Apptera Inc. | System for advertisement selection, placement and delivery within a multiple-tenant voice interaction service system |
US8768711B2 (en) * | 2004-06-17 | 2014-07-01 | Nuance Communications, Inc. | Method and apparatus for voice-enabling an application |
US7660719B1 (en) * | 2004-08-19 | 2010-02-09 | Bevocal Llc | Configurable information collection system, method and computer program product utilizing speech recognition |
US20070055520A1 (en) * | 2005-08-31 | 2007-03-08 | Microsoft Corporation | Incorporation of speech engine training into interactive user tutorial |
US8315874B2 (en) * | 2005-12-30 | 2012-11-20 | Microsoft Corporation | Voice user interface authoring tool |
US7937687B2 (en) * | 2006-09-01 | 2011-05-03 | Verizon Patent And Licensing Inc. | Generating voice extensible markup language (VXML) documents |
US8838455B1 (en) * | 2008-06-13 | 2014-09-16 | West Corporation | VoiceXML browser and supporting components for mobile devices |
MX2011002420A (en) * | 2008-12-19 | 2011-04-05 | Ibm | Method and system for generating vocal user interface code from a data meta-model. |
US9081550B2 (en) * | 2011-02-18 | 2015-07-14 | Nuance Communications, Inc. | Adding speech capabilities to existing computer applications with complex graphical user interfaces |
EP2608506A1 (en) * | 2011-12-01 | 2013-06-26 | Krea Icerik Hizmetleri Ve Produksiyon Anonim Sirketi | A System And Method For Developing and monitoring IVR dialogs |
US9607102B2 (en) * | 2014-09-05 | 2017-03-28 | Nuance Communications, Inc. | Task switching in dialogue processing |
CN105094807A (en) * | 2015-06-25 | 2015-11-25 | 三星电子(中国)研发中心 | Method and device for implementing voice control |
KR102759365B1 (en) | 2016-05-24 | 2025-01-24 | 삼성전자주식회사 | Electronic device having speech recognition function and operating method of Electronic device |
Citations (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010012335A1 (en) * | 1998-02-06 | 2001-08-09 | Lance Kaufman | Preference based telecommunication information service |
US20010013001A1 (en) * | 1998-10-06 | 2001-08-09 | Michael Kenneth Brown | Web-based platform for interactive voice response (ivr) |
US20010037405A1 (en) * | 2000-04-07 | 2001-11-01 | Sideek Sinnathambi Mohamed | Wireless web generation from conventional web sites by pattern identification and dynamic content extraction |
US20010043234A1 (en) * | 2000-01-03 | 2001-11-22 | Mallik Kotamarti | Incorporating non-native user interface mechanisms into a user interface |
US20020010626A1 (en) * | 2000-05-22 | 2002-01-24 | Eyal Agmoni | Internert advertising and information delivery system |
US20020015480A1 (en) * | 1998-12-08 | 2002-02-07 | Neil Daswani | Flexible multi-network voice/data aggregation system architecture |
US20020019881A1 (en) * | 2000-06-16 | 2002-02-14 | Bokhari Wasiq M. | System, method and computer program product for habitat-based universal application of functions to network data |
US20020032234A1 (en) * | 1999-06-01 | 2002-03-14 | Drugtech Corporation | Nutritional supplements |
US20020032564A1 (en) * | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
US20020045437A1 (en) * | 2000-08-18 | 2002-04-18 | Alexander Kesler | Tracing a location of a mobile device |
US20020047298A1 (en) * | 2000-10-19 | 2002-04-25 | Vitra Patente Ag | Seat made of molded plywood glued together in layers |
US20020052747A1 (en) * | 2000-08-21 | 2002-05-02 | Sarukkai Ramesh R. | Method and system of interpreting and presenting web content using a voice browser |
US20020062393A1 (en) * | 2000-08-10 | 2002-05-23 | Dana Borger | Systems, methods and computer program products for integrating advertising within web content |
US20020077823A1 (en) * | 2000-10-13 | 2002-06-20 | Andrew Fox | Software development systems and methods |
US20020091524A1 (en) * | 2000-10-25 | 2002-07-11 | David Guedalia | Method and system for voice browsing web sites |
US20020111855A1 (en) * | 2001-01-23 | 2002-08-15 | Hammerstad Diane R. | Delivery of time significant advertising |
US20020156815A1 (en) * | 2001-04-19 | 2002-10-24 | International Business Machines Corporation | Method and apparatus for the separation of web layout, logic, and data when used in server-side scripting languages |
US20020173964A1 (en) * | 2001-03-30 | 2002-11-21 | International Business Machines Corporation | Speech driven data selection in a voice-enabled program |
US20020184610A1 (en) * | 2001-01-22 | 2002-12-05 | Kelvin Chong | System and method for building multi-modal and multi-channel applications |
US20020198719A1 (en) * | 2000-12-04 | 2002-12-26 | International Business Machines Corporation | Reusable voiceXML dialog components, subdialogs and beans |
US20030007609A1 (en) * | 2001-07-03 | 2003-01-09 | Yuen Michael S. | Method and apparatus for development, deployment, and maintenance of a voice software application for distribution to one or more consumers |
US20030032409A1 (en) * | 2001-03-16 | 2003-02-13 | Hutcheson Stewart Douglas | Method and system for distributing content over a wireless communications system |
US20030046316A1 (en) * | 2001-04-18 | 2003-03-06 | Jaroslav Gergic | Systems and methods for providing conversational computing via javaserver pages and javabeans |
US20030065743A1 (en) * | 2001-09-28 | 2003-04-03 | Jenny Patrick Duncan | Method and system for distributing requests for content |
US20030088422A1 (en) * | 2001-11-01 | 2003-05-08 | Denenberg Lawrence A | Method and system for providing a voice application bookmark |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6199082B1 (en) * | 1995-07-17 | 2001-03-06 | Microsoft Corporation | Method for delivering separate design and content in a multimedia publishing system |
US6064666A (en) * | 1996-11-15 | 2000-05-16 | International Business Machines Corporation | Cross service common user image association |
US6014437A (en) * | 1997-02-03 | 2000-01-11 | International Business Machines Corporation | Multi service platform architecture for telephone networks |
US5974413A (en) * | 1997-07-03 | 1999-10-26 | Activeword Systems, Inc. | Semantic user interface |
US6269336B1 (en) * | 1998-07-24 | 2001-07-31 | Motorola, Inc. | Voice browser for interactive services and methods thereof |
US6385583B1 (en) * | 1998-10-02 | 2002-05-07 | Motorola, Inc. | Markup language for interactive services and methods thereof |
US6185535B1 (en) * | 1998-10-16 | 2001-02-06 | Telefonaktiebolaget Lm Ericsson (Publ) | Voice control of a user interface to service applications |
US7082397B2 (en) * | 1998-12-01 | 2006-07-25 | Nuance Communications, Inc. | System for and method of creating and browsing a voice web |
US6757362B1 (en) * | 2000-03-06 | 2004-06-29 | Avaya Technology Corp. | Personal virtual assistant |
US6466654B1 (en) * | 2000-03-06 | 2002-10-15 | Avaya Technology Corp. | Personal virtual assistant with semantic tagging |
US6501956B1 (en) * | 2000-10-17 | 2002-12-31 | Intervoice Limited Partnership | Providing blended interface for wireless information services |
US20020193997A1 (en) * | 2001-03-09 | 2002-12-19 | Fitzpatrick John E. | System, method and computer program product for dynamic billing using tags in a speech recognition framework |
US7174297B2 (en) * | 2001-03-09 | 2007-02-06 | Bevocal, Inc. | System, method and computer program product for a dynamically configurable voice portal |
US6870913B2 (en) * | 2001-03-15 | 2005-03-22 | Voxiva, Inc. | System and method for collecting, disseminating and managing information using a voice and data base system |
US6801604B2 (en) * | 2001-06-25 | 2004-10-05 | International Business Machines Corporation | Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources |
US6813342B1 (en) * | 2001-10-17 | 2004-11-02 | Bevocal, Inc. | Implicit area code determination during voice activated dialing |
US6978129B1 (en) * | 2002-03-28 | 2005-12-20 | Bellsouth Intellectual Property Corporation | Telephone with dynamically programmable push buttons for access to advanced applications |
-
2005
- 2005-01-13 US US11/036,293 patent/US20050149331A1/en not_active Abandoned
-
2009
- 2009-06-23 US US12/490,194 patent/US20100050150A1/en not_active Abandoned
Patent Citations (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010012335A1 (en) * | 1998-02-06 | 2001-08-09 | Lance Kaufman | Preference based telecommunication information service |
US20010013001A1 (en) * | 1998-10-06 | 2001-08-09 | Michael Kenneth Brown | Web-based platform for interactive voice response (ivr) |
US20020015480A1 (en) * | 1998-12-08 | 2002-02-07 | Neil Daswani | Flexible multi-network voice/data aggregation system architecture |
US20020032234A1 (en) * | 1999-06-01 | 2002-03-14 | Drugtech Corporation | Nutritional supplements |
US20010043234A1 (en) * | 2000-01-03 | 2001-11-22 | Mallik Kotamarti | Incorporating non-native user interface mechanisms into a user interface |
US20010037405A1 (en) * | 2000-04-07 | 2001-11-01 | Sideek Sinnathambi Mohamed | Wireless web generation from conventional web sites by pattern identification and dynamic content extraction |
US20020032564A1 (en) * | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
US20020010626A1 (en) * | 2000-05-22 | 2002-01-24 | Eyal Agmoni | Internert advertising and information delivery system |
US20020019881A1 (en) * | 2000-06-16 | 2002-02-14 | Bokhari Wasiq M. | System, method and computer program product for habitat-based universal application of functions to network data |
US20020062393A1 (en) * | 2000-08-10 | 2002-05-23 | Dana Borger | Systems, methods and computer program products for integrating advertising within web content |
US20020045437A1 (en) * | 2000-08-18 | 2002-04-18 | Alexander Kesler | Tracing a location of a mobile device |
US20020052747A1 (en) * | 2000-08-21 | 2002-05-02 | Sarukkai Ramesh R. | Method and system of interpreting and presenting web content using a voice browser |
US20020077823A1 (en) * | 2000-10-13 | 2002-06-20 | Andrew Fox | Software development systems and methods |
US20020047298A1 (en) * | 2000-10-19 | 2002-04-25 | Vitra Patente Ag | Seat made of molded plywood glued together in layers |
US20020091524A1 (en) * | 2000-10-25 | 2002-07-11 | David Guedalia | Method and system for voice browsing web sites |
US20020198719A1 (en) * | 2000-12-04 | 2002-12-26 | International Business Machines Corporation | Reusable voiceXML dialog components, subdialogs and beans |
US20020184610A1 (en) * | 2001-01-22 | 2002-12-05 | Kelvin Chong | System and method for building multi-modal and multi-channel applications |
US20020111855A1 (en) * | 2001-01-23 | 2002-08-15 | Hammerstad Diane R. | Delivery of time significant advertising |
US20030032409A1 (en) * | 2001-03-16 | 2003-02-13 | Hutcheson Stewart Douglas | Method and system for distributing content over a wireless communications system |
US20020173964A1 (en) * | 2001-03-30 | 2002-11-21 | International Business Machines Corporation | Speech driven data selection in a voice-enabled program |
US20030046316A1 (en) * | 2001-04-18 | 2003-03-06 | Jaroslav Gergic | Systems and methods for providing conversational computing via javaserver pages and javabeans |
US20020156815A1 (en) * | 2001-04-19 | 2002-10-24 | International Business Machines Corporation | Method and apparatus for the separation of web layout, logic, and data when used in server-side scripting languages |
US20030007609A1 (en) * | 2001-07-03 | 2003-01-09 | Yuen Michael S. | Method and apparatus for development, deployment, and maintenance of a voice software application for distribution to one or more consumers |
US20030009339A1 (en) * | 2001-07-03 | 2003-01-09 | Yuen Michael S. | Method and apparatus for improving voice recognition performance in a voice application distribution system |
US20030055884A1 (en) * | 2001-07-03 | 2003-03-20 | Yuen Michael S. | Method for automated harvesting of data from a Web site using a voice portal system |
US20030065743A1 (en) * | 2001-09-28 | 2003-04-03 | Jenny Patrick Duncan | Method and system for distributing requests for content |
US20030088422A1 (en) * | 2001-11-01 | 2003-05-08 | Denenberg Lawrence A | Method and system for providing a voice application bookmark |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106383720A (en) * | 2016-09-30 | 2017-02-08 | 山东浪潮云服务信息科技有限公司 | Cloud sharing and collaborating method of software product prototype |
Also Published As
Publication number | Publication date |
---|---|
US20050149331A1 (en) | 2005-07-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20100050150A1 (en) | Method and System for Developing Speech Applications | |
US7526681B2 (en) | Software testing framework | |
US20110299672A1 (en) | System and methods for dynamic integration of a voice application with one or more Web services | |
US7206391B2 (en) | Method for creating and deploying system changes in a voice application system | |
CA2493533C (en) | System and process for developing a voice application | |
US7913229B2 (en) | Computer-implemented system for generating automated tests from a web application | |
EP0948786B1 (en) | System and method for developing a voice dialogue for a voice response unit | |
US7992127B2 (en) | Method and system of encapsulating web site transactions for computer-aided generation of web services | |
US8155959B2 (en) | Dialog system for human agent to correct abnormal output | |
US20050152516A1 (en) | System for managing voice files of a voice prompt server | |
US20110044437A1 (en) | Method and System for Presenting Dynamic Commercial Content to Clients Interacting with a Voice Extensible Markup Language system | |
US20130298110A1 (en) | Software Visualization Using Code Coverage Information | |
US20090119586A1 (en) | Automatic Generation of Interactive Systems From a Formalized Description Language | |
US20080126925A1 (en) | Using a backend simulator to test and develop xforms templates before linking the xforms templates to backend data processing systems | |
US20080127082A1 (en) | System and method for requirements-based application configuration | |
US20050283764A1 (en) | Method and apparatus for validating a voice application | |
US20220206929A1 (en) | Managing a global network of virtual testers | |
US20240143493A1 (en) | System and methods for testing microservices | |
US12013777B2 (en) | Controlling heterogeneous component-based testing in a portable automation framework with test scripts in both API mode and UI mode | |
US11269712B1 (en) | Customized categorial error handling framework for heterogeneous component-based testing in a portable automation framework | |
Schulz et al. | Towards model-based testing with architecture models | |
US20070234279A1 (en) | System and method for creating designs for over the phone voice enabled services | |
Buezas et al. | Umbra designer: Graphical modelling for telephony services | |
KR100348762B1 (en) | Method for creating service scenario for network messaging system | |
Schulz | Integrating performance tests in a generative software development platform |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: APPTERA, INC.,CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:EHRLICH, STEVEN CLIVE;REEL/FRAME:023497/0538 Effective date: 20050126 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |