CN115050361A - Live voice interaction method, device, storage medium and electronic device - Google Patents
Live voice interaction method, device, storage medium and electronic device Download PDFInfo
- Publication number
- CN115050361A CN115050361A CN202210592020.7A CN202210592020A CN115050361A CN 115050361 A CN115050361 A CN 115050361A CN 202210592020 A CN202210592020 A CN 202210592020A CN 115050361 A CN115050361 A CN 115050361A
- Authority
- CN
- China
- Prior art keywords
- information
- voice
- interaction
- state
- liquidity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/21—Server components or server architectures
- H04N21/218—Source of audio or video content, e.g. local disk arrays
- H04N21/2187—Live feed
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4788—Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
本公开关于一种直播语音交互方法、装置、存储介质及电子设备。上述方法包括在直播应用被打开的情况下,响应于接收到语音交互模式唤醒信息的情况,将上述直播应用的工作状态配置为语音交互状态,上述语音交互状态为通过解析语音输入设备接收到的信息确定对象及进行针对上述对象的流通性信息交互的状态;在上述工作状态为上述语音交互状态的情况下,响应于接收到流通性信息的情况,基于上述流通性信息生成针对目标对象的交互管理信息;在上述工作状态为上述语音交互状态的情况下,响应于接收到交互触发信息的情况,基于上述交互触发信息和上述交互管理信息生成上述目标对象的流通性信息交互内容。本公开通过全程语音的方式即可在直播间完成购物。
The present disclosure relates to a live voice interaction method, device, storage medium and electronic device. The above method includes, when the live broadcast application is opened, in response to receiving the voice interaction mode wake-up information, configuring the working state of the above-mentioned live broadcast application as a voice interaction state, and the above voice interaction state is received by parsing the voice input device. The information determines the object and the state of the liquidity information interaction for the above-mentioned object; when the above-mentioned working state is the above-mentioned voice interaction state, in response to the situation of receiving the liquidity information, based on the above-mentioned liquidity information, the interaction for the target object is generated Management information; when the working state is the voice interaction state, in response to receiving the interaction trigger information, the liquidity information interaction content of the target object is generated based on the interaction trigger information and the interaction management information. In the present disclosure, shopping can be completed in the live broadcast room through the whole process of voice.
Description
技术领域technical field
本公开涉及互联网技术领域,尤其涉及直播语音交互方法、装置、存储介质及电子设备。The present disclosure relates to the field of Internet technologies, and in particular, to a method, apparatus, storage medium and electronic device for live voice interaction.
背景技术Background technique
随着互联网技术的发展,直播应用在人们的生活中发挥了越来越多的作用,在直播应用中发展电子商务是大势所趋,但是,相关技术中的直播电商大多依赖于键盘鼠标等字符输入型设备与用户进行交互,这对于一些视力不佳的人群而言是不够友好的,也就是说,相关技术无法为部分特定人群提供交互服务,交互服务的应用场景受到明显限制。With the development of Internet technology, live broadcast applications have played more and more roles in people's lives, and the development of e-commerce in live broadcast applications is the general trend. It is not friendly enough for some people with poor eyesight, that is to say, related technologies cannot provide interactive services for some specific groups of people, and the application scenarios of interactive services are obviously limited.
发明内容SUMMARY OF THE INVENTION
为了解决上述至少一个技术问题,本公开提供直播语音交互方法、装置、存储介质及电子设备,本公开的技术方案如下:In order to solve at least one of the above technical problems, the present disclosure provides a live voice interaction method, device, storage medium and electronic equipment. The technical solutions of the present disclosure are as follows:
根据本公开实施例的第一方面,提供一种直播语音交互方法,包括:According to a first aspect of the embodiments of the present disclosure, there is provided a live voice interaction method, including:
在直播应用被打开的情况下,响应于接收到语音交互模式唤醒信息的情况,将所述直播应用的工作状态配置为语音交互状态,所述语音交互状态为通过解析语音输入设备接收到的信息确定对象及进行针对所述对象的流通性信息交互的状态;When the live broadcast application is opened, in response to receiving the voice interaction mode wake-up information, configure the working state of the live broadcast application as a voice interaction state, where the voice interaction state is information received by parsing the voice input device determine the status of an object and the exchange of liquidity information for said object;
在所述工作状态为所述语音交互状态的情况下,响应于接收到流通性信息的情况,基于所述流通性信息生成针对目标对象的交互管理信息;When the working state is the voice interaction state, in response to receiving the liquidity information, generating interaction management information for the target object based on the liquidity information;
在所述工作状态为所述语音交互状态的情况下,响应于接收到交互触发信息的情况,基于所述交互触发信息和所述交互管理信息生成所述目标对象的流通性信息交互内容。When the working state is the voice interaction state, in response to receiving the interaction trigger information, the interaction content of the liquidity information of the target object is generated based on the interaction trigger information and the interaction management information.
在一示例性实施方式中,所述方法还包括:In an exemplary embodiment, the method further includes:
在所述工作状态为所述语音交互状态的情况下,维持所述语音输入设备的唤醒状态。When the working state is the voice interaction state, the wake-up state of the voice input device is maintained.
在一示例性实施方式中,所述方法还包括:In an exemplary embodiment, the method further includes:
在所述工作状态为所述语音交互状态的情况下,响应于接收到语音交互模式关闭信息的情况,将所述工作状态配置为字符交互状态,并将所述语音输入设备配置为非唤醒状态,所述字符交互状态为通过字符输入设备接收到的信息确定对象及进行针对所述对象的流通性信息交互的状态。When the working state is the voice interaction state, in response to receiving the voice interaction mode closing information, configure the working state as a character interaction state, and configure the voice input device as a non-awakening state , the character interaction state is a state in which an object is determined through information received by a character input device and a liquidity information interaction for the object is performed.
在一示例性实施方式中,所述在直播应用被打开的情况下,响应于接收到语音交互模式唤醒信息的情况,将所述直播应用的工作状态配置为语音交互状态之前,所述方法还包括:In an exemplary embodiment, when the live broadcast application is opened, in response to receiving the voice interaction mode wake-up information, before configuring the working state of the live broadcast application to the voice interaction state, the method further: include:
响应于所述直播应用被打开的情况,触发语音输出设备播报第一信息并且唤醒所述语音输入设备,所述第一信息用于询问用户是否将所述直播应用配置为所述语音交互状态;In response to the fact that the live broadcast application is opened, trigger the voice output device to broadcast first information and wake up the voice input device, where the first information is used to ask the user whether to configure the live broadcast application to the voice interaction state;
解析所述语音输入设备接收到的针对所述第一信息的反馈信息,得到第一解析结果;Parsing the feedback information for the first information received by the voice input device to obtain a first parsing result;
在所述第一解析结果表征所述用户存在语音交互意愿的情况下,判定接收到所述语音交互模式唤醒信息。In the case that the first analysis result indicates that the user has voice interaction willingness, it is determined that the voice interaction mode wake-up information is received.
在一示例性实施方式中,所述在所述工作状态为所述语音交互状态的情况下,响应于接收到流通性信息的情况,基于所述流通性信息生成针对目标对象的交互管理信息之前,所述方法还包括:In an exemplary embodiment, when the working state is the voice interaction state, in response to receiving the liquidity information, before the interaction management information for the target object is generated based on the liquidity information , the method also includes:
响应于直播过程进行到所述目标对象对应的流通性信息交互时段的情况,触发所述语音输出设备播报第二信息,所述第二信息用于询问用户是否关联所述目标对象;In response to a situation in which the live broadcast process reaches the liquidity information interaction period corresponding to the target object, triggering the voice output device to broadcast second information, where the second information is used to ask the user whether to associate with the target object;
解析所述语音输入设备接收到的针对所述第二信息的反馈信息,得到第二解析结果;Parsing the feedback information for the second information received by the voice input device to obtain a second parsing result;
在所述第二解析结果表征所述用户存在关联意愿的情况下,接收所述流通性信息。The liquidity information is received when the second analysis result indicates that the user has a willingness to associate.
在一示例性实施方式中,所述在所述工作状态为所述语音交互状态的情况下,响应于接收到流通性信息的情况,基于所述流通性信息生成针对目标对象的交互管理信息之前,所述方法还包括:In an exemplary embodiment, when the working state is the voice interaction state, in response to receiving the liquidity information, before the interaction management information for the target object is generated based on the liquidity information , the method also includes:
响应于在直播过程中接收到第一目标语音的情况,对所述第一目标语音进行解析;In response to receiving the first target voice during the live broadcast, parsing the first target voice;
响应于解析结果中存在所述目标对象对应的信息的情况,接收所述流通性信息。The liquidity information is received in response to the fact that the information corresponding to the target object exists in the parsing result.
在一示例性实施方式中,所述流通性信息包括至少一个流通性信息项,所述接收所述流通性信息,包括:In an exemplary embodiment, the liquidity information includes at least one item of liquidity information, and the receiving the liquidity information includes:
获取所述目标对象对应的至少一个属性项;obtaining at least one attribute item corresponding to the target object;
针对每一属性项,触发所述语音输出设备播报所述属性项对应的第三信息,所述第三信息用于请求用户对所述属性项对应的至少一个属性值进行选择;For each attribute item, trigger the voice output device to broadcast third information corresponding to the attribute item, where the third information is used to request the user to select at least one attribute value corresponding to the attribute item;
解析所述语音输入设备接收到的针对所述第三信息的反馈信息,得到所述属性项对应的流通性信息项。Parsing the feedback information for the third information received by the voice input device to obtain the liquidity information item corresponding to the attribute item.
在一示例性实施方式中,所述流通性信息包括至少一个流通性信息项,所述接收所述流通性信息,包括:In an exemplary embodiment, the liquidity information includes at least one item of liquidity information, and the receiving the liquidity information includes:
接收第二目标语音,对所述第二目标语音进行解析;receiving a second target voice, and analyzing the second target voice;
获取所述目标对象对应的至少一个属性项;obtaining at least one attribute item corresponding to the target object;
根据解析结果确定每一所述属性项所对应的内容,根据所述内容得到所述属性项对应的流通性信息项。The content corresponding to each attribute item is determined according to the analysis result, and the liquidity information item corresponding to the attribute item is obtained according to the content.
在一示例性实施方式中,所述在所述工作状态为所述语音交互状态的情况下,响应于接收到交互触发信息的情况,基于所述交互触发信息和所述交互管理信息生成所述目标对象的流通性信息交互内容之前,所述方法还包括:In an exemplary embodiment, when the working state is the voice interaction state, in response to receiving interaction trigger information, generating the interaction trigger information based on the interaction trigger information and the interaction management information. Before interacting with the content of the target object's liquidity information, the method further includes:
在生成所述交互管理信息之后,触发所述语音输出设备播报第四信息,所述第四信息用于请求用户的交互触发参数,所述交互触发参数包括下述至少之一:交互触发方式、交互触发账号、交互触发密码、交互触发口令、交互触发生物信息;After the interaction management information is generated, the voice output device is triggered to broadcast fourth information, where the fourth information is used to request interaction trigger parameters of the user, and the interaction trigger parameters include at least one of the following: interaction trigger mode, Interactive trigger account, interactive trigger password, interactive trigger password, interactive trigger biological information;
根据接收到的所述交互触发参数得到所述交互触发信息。The interaction trigger information is obtained according to the received interaction trigger parameter.
在一示例性实施方式中,所述基于所述交互触发信息和所述交互管理信息生成所述目标对象的流通性信息交互内容,包括:In an exemplary embodiment, the generating, based on the interaction trigger information and the interaction management information, the circulation information interaction content of the target object includes:
触发所述语音输出设备播报第五信息,所述第五信息用于请求用户的地址信息;triggering the voice output device to broadcast fifth information, where the fifth information is used to request the address information of the user;
对接收到的第三目标语音进行解析,得到所述地址信息;Parsing the received third target voice to obtain the address information;
根据所述交互管理信息、所述交互触发信息和所述地址信息,生成所述流通性信息交互内容。The liquidity information interaction content is generated according to the interaction management information, the interaction trigger information and the address information.
在一示例性实施方式中,所述对接收到的第三目标语音进行解析,得到所述地址信息,包括:In an exemplary embodiment, the analyzing the received third target voice to obtain the address information includes:
获取地址模板信息;Get address template information;
将针对所述第三目标语音的解析结果与所述地址模板信息进行匹配,得到所述地址信息。Matching the parsing result for the third target speech with the address template information to obtain the address information.
在一示例性实施方式中,所述响应于接收到流通性信息的情况,基于所述流通性信息生成针对目标对象的交互管理信息之前,所述方法还包括:触发所述语音输出设备播报所述流通性信息中的每一流通性信息项;In an exemplary embodiment, before generating the interaction management information for the target object based on the liquidity information in response to receiving the liquidity information, the method further includes: triggering the voice output device to broadcast the information. each item of liquidity information in the liquidity information;
所述响应于接收到流通性信息的情况,基于所述流通性信息生成针对目标对象的交互管理信息,包括:在接收到针对所述流通性信息中的各所述流通性信息项的确认指令的情况下,基于所述流通性信息生成针对目标对象的交互管理信息。The generating the interaction management information for the target object based on the circulation information in response to the situation of receiving the circulation information includes: receiving a confirmation instruction for each of the circulation information items in the circulation information In the case of , the interaction management information for the target object is generated based on the liquidity information.
在一示例性实施方式中,所述响应于接收到流通性信息的情况,基于所述流通性信息生成针对目标对象的交互管理信息之前,所述方法还包括:In an exemplary embodiment, before generating the interaction management information for the target object based on the liquidity information in response to receiving the liquidity information, the method further includes:
响应于接收到第四目标语音的情况,对所述第四目标语音进行解析;In response to receiving the fourth target voice, parsing the fourth target voice;
根据针对所述第四目标语音的解析结果确定与所述目标对象相关的目标内容,触发所述语音输出设备播报所述目标内容对应的关联信息,所述关联信息用于对所述目标内容进行解释或者给出建议。The target content related to the target object is determined according to the analysis result of the fourth target voice, and the voice output device is triggered to broadcast the associated information corresponding to the target content, where the associated information is used to perform an analysis on the target content. explain or give advice.
根据本公开实施例的第二方面,提供一种直播语音交互装置,包括:According to a second aspect of the embodiments of the present disclosure, a live voice interaction device is provided, including:
切换模块,被配置为执行在直播应用被打开的情况下,响应于接收到语音交互模式唤醒信息的情况,将所述直播应用的工作状态配置为语音交互状态,所述语音交互状态为通过解析语音输入设备接收到的信息确定对象及进行针对所述对象的流通性信息交互的状态;The switching module is configured to perform, in the case that the live broadcast application is opened, in response to receiving the voice interaction mode wake-up information, configure the working state of the live broadcast application as a voice interaction state, and the voice interaction state is through parsing The information received by the voice input device determines the object and the state of the exchange of liquidity information for the object;
语音处理模块,被配置为执行在所述工作状态为所述语音交互状态的情况下,响应于接收到流通性信息的情况,基于所述流通性信息生成针对目标对象的交互管理信息;以及,在所述工作状态为所述语音交互状态的情况下,响应于接收到交互触发信息的情况,基于所述交互触发信息和所述交互管理信息生成所述目标对象的流通性信息交互内容。a voice processing module configured to perform, in response to receiving the liquidity information, generating interaction management information for the target object based on the liquidity information when the working state is the voice interaction state; and, When the working state is the voice interaction state, in response to receiving the interaction trigger information, the interaction content of the liquidity information of the target object is generated based on the interaction trigger information and the interaction management information.
在一示例性实施方式中,所述语音处理模块,被配置为执行在所述工作状态为所述语音交互状态的情况下,维持所述语音输入设备的唤醒状态。In an exemplary embodiment, the voice processing module is configured to maintain the wake-up state of the voice input device when the working state is the voice interaction state.
在一示例性实施方式中,所述切换模块,被配置为执行在所述工作状态为所述语音交互状态的情况下,响应于接收到语音交互模式关闭信息的情况,将所述工作状态配置为字符交互状态,并将所述语音输入设备配置为非唤醒状态,所述字符交互状态为通过字符输入设备接收到的信息确定对象及进行针对所述对象的流通性信息交互的状态。In an exemplary embodiment, the switching module is configured to perform, in the case that the working state is the voice interaction state, in response to receiving the voice interaction mode closing information, configure the working state. is a character interaction state, and the voice input device is configured to be in a non-awakening state, where the character interaction state is a state in which an object is determined through information received by the character input device and a liquidity information interaction for the object is performed.
在一示例性实施方式中,所述切换模块,被配置为执行:In an exemplary embodiment, the switching module is configured to perform:
响应于所述直播应用被打开的情况,触发语音输出设备播报第一信息并且唤醒所述语音输入设备,所述第一信息用于询问用户是否将所述直播应用配置为所述语音交互状态;In response to the fact that the live broadcast application is opened, trigger the voice output device to broadcast first information and wake up the voice input device, where the first information is used to ask the user whether to configure the live broadcast application to the voice interaction state;
解析所述语音输入设备接收到的针对所述第一信息的反馈信息,得到第一解析结果;Parsing the feedback information for the first information received by the voice input device to obtain a first parsing result;
在所述第一解析结果表征所述用户存在语音交互意愿的情况下,判定接收到所述语音交互模式唤醒信息。In the case that the first analysis result indicates that the user has voice interaction willingness, it is determined that the voice interaction mode wake-up information is received.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
响应于直播过程进行到所述目标对象对应的流通性信息交互时段的情况,触发所述语音输出设备播报第二信息,所述第二信息用于询问用户是否关联所述目标对象;In response to a situation in which the live broadcast process reaches the liquidity information interaction period corresponding to the target object, triggering the voice output device to broadcast second information, where the second information is used to ask the user whether to associate with the target object;
解析所述语音输入设备接收到的针对所述第二信息的反馈信息,得到第二解析结果;Parsing the feedback information for the second information received by the voice input device to obtain a second parsing result;
在所述第二解析结果表征所述用户存在关联意愿的情况下,接收所述流通性信息。The liquidity information is received when the second analysis result indicates that the user has a willingness to associate.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
响应于在直播过程中接收到第一目标语音的情况,对所述第一目标语音进行解析;In response to receiving the first target voice during the live broadcast, parsing the first target voice;
响应于解析结果中存在所述目标对象对应的信息的情况,接收所述流通性信息。The liquidity information is received in response to the fact that the information corresponding to the target object exists in the parsing result.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
获取所述目标对象对应的至少一个属性项;obtaining at least one attribute item corresponding to the target object;
针对每一属性项,触发所述语音输出设备播报所述属性项对应的第三信息,所述第三信息用于请求用户对所述属性项对应的至少一个属性值进行选择;For each attribute item, trigger the voice output device to broadcast third information corresponding to the attribute item, where the third information is used to request the user to select at least one attribute value corresponding to the attribute item;
解析所述语音输入设备接收到的针对所述第三信息的反馈信息,得到所述属性项对应的流通性信息项。Parsing the feedback information for the third information received by the voice input device to obtain the liquidity information item corresponding to the attribute item.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
接收第二目标语音,对所述第二目标语音进行解析;receiving a second target voice, and analyzing the second target voice;
获取所述目标对象对应的至少一个属性项;obtaining at least one attribute item corresponding to the target object;
根据解析结果确定每一所述属性项所对应的内容,根据所述内容得到所述属性项对应的流通性信息项。The content corresponding to each attribute item is determined according to the analysis result, and the liquidity information item corresponding to the attribute item is obtained according to the content.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
在生成所述交互管理信息之后,触发所述语音输出设备播报第四信息,所述第四信息用于请求用户的交互触发参数,所述交互触发参数包括下述至少之一:交互触发方式、交互触发账号、交互触发密码、交互触发口令、交互触发生物信息;After the interaction management information is generated, the voice output device is triggered to broadcast fourth information, where the fourth information is used to request interaction trigger parameters of the user, and the interaction trigger parameters include at least one of the following: interaction trigger mode, Interactive trigger account, interactive trigger password, interactive trigger password, interactive trigger biological information;
根据接收到的所述交互触发参数得到所述交互触发信息。The interaction trigger information is obtained according to the received interaction trigger parameter.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
触发所述语音输出设备播报第五信息,所述第五信息用于请求用户的地址信息;triggering the voice output device to broadcast fifth information, where the fifth information is used to request the address information of the user;
对接收到的第三目标语音进行解析,得到所述地址信息;Parsing the received third target voice to obtain the address information;
根据所述交互管理信息、所述交互触发信息和所述地址信息,生成所述流通性信息交互内容。The liquidity information interaction content is generated according to the interaction management information, the interaction trigger information and the address information.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
获取地址模板信息;Get address template information;
将针对所述第三目标语音的解析结果与所述地址模板信息进行匹配,得到所述地址信息。Matching the parsing result for the third target speech with the address template information to obtain the address information.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
触发所述语音输出设备播报所述流通性信息中的每一流通性信息项;triggering the voice output device to broadcast each item of liquidity information in the liquidity information;
在接收到针对所述流通性信息中的各所述流通性信息项的确认指令的情况下,基于所述流通性信息生成针对目标对象的交互管理信息。In the case of receiving a confirmation instruction for each of the liquidity information items in the liquidity information, the interaction management information for the target object is generated based on the liquidity information.
在一示例性实施方式中,所述语音处理模块,被配置为执行:In an exemplary embodiment, the speech processing module is configured to perform:
响应于接收到第四目标语音的情况,对所述第四目标语音进行解析;In response to receiving the fourth target voice, parsing the fourth target voice;
根据针对所述第四目标语音的解析结果确定与所述目标对象相关的目标内容,触发所述语音输出设备播报所述目标内容对应的关联信息,所述关联信息用于对所述目标内容进行解释或者给出建议。The target content related to the target object is determined according to the analysis result of the fourth target voice, and the voice output device is triggered to broadcast the associated information corresponding to the target content, where the associated information is used to perform an analysis on the target content. explain or give advice.
根据本公开实施例的第三方面,提供一种电子设备,包括:处理器;用于存储处理器可执行指令的存储器;其中,处理器被配置为执行指令,以实现如上述第一方面中任一项的方法。According to a third aspect of the embodiments of the present disclosure, there is provided an electronic device, comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions to implement the first aspect as described above any of the methods.
根据本公开实施例的第四方面,提供一种计算机可读存储介质,当计算机可读存储介质中的指令由电子设备的处理器执行时,使得电子设备能够执行本公开实施例的第一方面中任一项的方法。According to a fourth aspect of the embodiments of the present disclosure, a computer-readable storage medium is provided, when instructions in the computer-readable storage medium are executed by a processor of an electronic device, the electronic device can execute the first aspect of the embodiments of the present disclosure any of the methods.
根据本公开实施例的第五方面,提供一种计算机程序产品,计算机程序产品包括计算机程序,计算机程序存储在可读存储介质中,计算机设备的至少一个处理器从可读存储介质读取并执行计算机程序,使得计算机设备执行本公开实施例的第一方面中任一项的方法。According to a fifth aspect of the embodiments of the present disclosure, there is provided a computer program product, the computer program product includes a computer program, the computer program is stored in a readable storage medium, and at least one processor of the computer device reads and executes the computer program from the readable storage medium A computer program that causes a computer device to perform the method of any one of the first aspects of the embodiments of the present disclosure.
本公开的实施例提供的技术方案至少带来以下有益效果:The technical solutions provided by the embodiments of the present disclosure bring at least the following beneficial effects:
本公开实施例通过全程语音的方式完成了流通性信息获取、交互管理信息生成、交互触发以及流通性信息交互内容生成等内容,无需用眼,显著扩展了应用场景。以电商场景为例,在不需要用户手动输入任何信息的情况下,单凭语音交互的方式即可在直播间完成购物。这一技术方案并不需要用户用眼,而是单凭听觉和发出语音指令的方式即可在直播间中完成购物,十分适合于需要休息眼睛或者视力不良的用户。目前失明或者视力障碍的人群规模呈现日益增长的趋势,还有一些视力不佳的老人,因为生活不便,这些用户线上购物需求或许更强烈,本公开实施例支持在直播购物的时候通过语音购物可以很好的满足这类用户的具体需求。The embodiment of the present disclosure completes the acquisition of liquidity information, the generation of interactive management information, the interactive triggering, and the generation of the interactive content of liquidity information through the whole process of voice, which does not require eyesight, and significantly expands application scenarios. Taking the e-commerce scenario as an example, without the need for users to manually input any information, shopping can be completed in the live broadcast room only by means of voice interaction. This technical solution does not require users to use their eyes, but can complete shopping in the live broadcast room only by hearing and issuing voice commands, which is very suitable for users who need to rest their eyes or have poor eyesight. Currently, the number of people who are blind or visually impaired is increasing, and there are some elderly people with poor eyesight. Because of their inconvenience, these users may have a stronger demand for online shopping. The embodiment of the present disclosure supports shopping by voice during live shopping. It can well meet the specific needs of such users.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本公开。It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the present disclosure.
附图说明Description of drawings
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本公开的实施例,并与说明书一起用于解释本公开的原理,并不构成对本公开的不当限定。The accompanying drawings, which are incorporated into and constitute a part of this specification, illustrate embodiments consistent with the present disclosure, and together with the description, serve to explain the principles of the present disclosure and do not unduly limit the present disclosure.
图1是根据一示例性实施例示出的一种直播语音交互方法的实施环境示意图。FIG. 1 is a schematic diagram of an implementation environment of a live voice interaction method according to an exemplary embodiment.
图2是根据一示例性实施例示出的一种直播语音交互方法的流程图;FIG. 2 is a flowchart of a method for live voice interaction according to an exemplary embodiment;
图3是根据一示例性实施例示出的直播语音交互方法的一个场景示意图;3 is a schematic diagram of a scenario of a live voice interaction method according to an exemplary embodiment;
图4是根据一示例性实施例示出的根据图2示例所得到的交互管理信息的流通性信息请求确认场景的示意图;FIG. 4 is a schematic diagram showing a scenario of request confirmation of the liquidity information of the interaction management information obtained according to the example of FIG. 2 according to an exemplary embodiment;
图5根据一示例性实施例示出的直播语音交互方法的另一个场景示意图;5 is a schematic diagram of another scenario of a live voice interaction method according to an exemplary embodiment;
图6根据一示例性实施例示出的地址信息适配方式示意图;6 is a schematic diagram of an address information adaptation manner according to an exemplary embodiment;
图7根据一示例性实施例示出的一个具体的语音交互场景流程图;FIG. 7 shows a flow chart of a specific voice interaction scenario according to an exemplary embodiment;
图8根据一示例性实施例示出的一种直播语音交互装置框图;Fig. 8 shows a block diagram of a live voice interaction apparatus according to an exemplary embodiment;
图9是根据一示例性实施例示出的一种电子设备的框图。Fig. 9 is a block diagram of an electronic device according to an exemplary embodiment.
具体实施方式Detailed ways
为了使本领域普通人员更好地理解本公开的技术方案,下面将结合附图,对本公开实施例中的技术方案进行清楚、完整地描述。In order to make those skilled in the art better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的第一对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本公开的实施例能够以除了在这里图示或描述的那些以外的顺序实施。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above-mentioned drawings are used to distinguish similar first objects, and are not necessarily used to describe a specific order or sequence. order. It is to be understood that the data so used may be interchanged under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present disclosure as recited in the appended claims.
本公开所涉及的各种信息均为经用户授权或者经过各方充分授权的信息。All kinds of information involved in this disclosure are information authorized by the user or fully authorized by all parties.
图1是根据一示例性实施例示出的一种直播语音交互方法的实施环境示意图。以电子设备被提供为终端为例,参见图1,该实施环境具体包括:终端101和服务器102。FIG. 1 is a schematic diagram of an implementation environment of a live voice interaction method according to an exemplary embodiment. Taking the electronic device provided as a terminal as an example, referring to FIG. 1 , the implementation environment specifically includes: a terminal 101 and a
终端101可以为智能手机、智能手表、台式电脑、手提电脑和膝上型便携计算机等设备中的至少一种。终端101上可以安装并运行有提供直播服务的应用程序,用户可以通过终端101登录该应用程序来获取该应用程序提供的直播服务,以及在使用该直播服务的过程中参与电商购物。终端101可以泛指多个终端中的一个,本实施例仅以终端101来举例说明。本领域技术人员可以知晓,上述终端的数量可以更多或更少。比如上述终端可以仅为几个,或者上述终端为几十个或几百个,或者更多数量,本公开实施例对终端的数量和设备类型均不加以限定。The terminal 101 may be at least one of devices such as a smart phone, a smart watch, a desktop computer, a laptop computer, and a laptop portable computer. The terminal 101 can install and run an application program that provides a live broadcast service, and the user can log in to the application program through the terminal 101 to obtain the live broadcast service provided by the application program, and participate in e-commerce shopping in the process of using the live broadcast service. The terminal 101 may generally refer to one of multiple terminals, and this embodiment only takes the terminal 101 as an example for illustration. Those skilled in the art may know that the number of the above-mentioned terminals may be more or less. For example, the above-mentioned terminals may be only a few, or the above-mentioned terminals may be dozens or hundreds, or more, and the embodiments of the present disclosure do not limit the number of terminals and device types.
服务器102可以为一台服务器、多台服务器、云计算平台和虚拟化中心中的至少一种。服务器102可以通过无线网络或有线网络与终端101和其他终端相连,服务器102可以接收终端101发送的语音信息,并根据接收到的语音信息为终端101提供直播以及电商购物服务。当然,服务器102还可以包括其他功能服务器,以便提供更全面且多样化的服务。图2是根据一示例性实施例示出的一种直播语音交互方法的流程图,如图2所示,上述方法至少包括以下步骤S10-S30。The
在步骤S10中,在直播应用被打开的情况下,响应于接收到语音交互模式唤醒信息的情况,将上述直播应用的工作状态配置为语音交互状态,上述语音交互状态为通过解析语音输入设备接收到的信息确定对象及进行针对上述对象的流通性信息交互的状态。In step S10, when the live broadcast application is opened, in response to receiving the voice interaction mode wake-up information, the working state of the live broadcast application is configured as a voice interaction state, and the voice interaction state is received by parsing the voice input device. The received information determines the object and the status of the exchange of liquidity information for the above object.
本公开实施例并不对对象、流通性信息交互进行限定,其根据场景不同可以有对应的含义,比如,在电商场景下,对象可以指代电商交易的物品,流通性信息交互可以指针对该物品的交易。The embodiments of the present disclosure do not limit the interaction of objects and liquidity information, which may have corresponding meanings according to different scenarios. For example, in an e-commerce scenario, an object may refer to an item traded by e-commerce, and an exchange of liquidity information may refer to an object transaction for this item.
本公开并不对直播应用进行限定,可以在直播应用被打开的情况下接收到语音交互模式唤醒信息,这样用户访问任意直播间的时候都可以基于语音进行电商购物。也可以在直播应用被打开并且用户进入某个具体的直播间的情况下接收到语音交互模式唤醒信息,在接收到该语音交互模式唤醒信息后可以进入语音交互状态,这一状态下语音输入设备作为唯一的用于接收用户输入的信息的设备,直播应用通过解析语音输入设备接收到的信息确定对象及进行针对上述对象的流通性信息交互。本公开实施例中通过设置语音交互状态可以在该状态下为用户提供全程的基于语音进行交互的直播电商服务。本公开实施例并不限定语音输入设备的具体类型,其可以为各种麦克风或者其他能够录入语音的设备。The present disclosure does not limit the live broadcast application, and the voice interaction mode wake-up information can be received when the live broadcast application is opened, so that the user can conduct e-commerce shopping based on voice when accessing any live broadcast room. It is also possible to receive the voice interaction mode wake-up information when the live application is opened and the user enters a specific live room. After receiving the voice interaction mode wake-up information, the voice interaction state can be entered. In this state, the voice input device As the only device for receiving the information input by the user, the live application determines the object by parsing the information received by the voice input device and performs the exchange of liquidity information for the above object. In the embodiment of the present disclosure, by setting the voice interaction state, the user can be provided with a whole-process voice-based interactive live broadcast e-commerce service in this state. The embodiments of the present disclosure do not limit the specific type of the voice input device, which may be various microphones or other devices capable of inputting voice.
本公开实施例中,在上述工作状态为上述语音交互状态的情况下,维持上述语音输入设备的唤醒状态。也就是说,当直播应用的工作状态被配置为语音交互状态的情况下,全程语音输入设备处于唤醒状态,用户可以通过语音的方式参与到直播间的电商活动中来,比如,可以通过语音的方式下单、选择对象,触发流通性信息交互等。In the embodiment of the present disclosure, when the working state is the voice interaction state, the wake-up state of the voice input device is maintained. That is to say, when the working state of the live broadcast application is configured as the voice interaction state, the voice input device is in the wake-up state, and the user can participate in the e-commerce activities in the live broadcast room through voice. order, select objects, trigger liquidity information interaction, etc.
在步骤S20中,在上述工作状态为上述语音交互状态的情况下,响应于接收到流通性信息的情况,基于上述流通性信息生成针对目标对象的交互管理信息。In step S20, when the working state is the voice interaction state, in response to receiving the liquidity information, the interaction management information for the target object is generated based on the liquidity information.
本公开实施例并不限定流通性信息,在电子商务场景下,其可以指针对物品的订单信息。本公开实施例并不限定其具体内容,其可以包括即将进行流通性信息交互的目标对象的各种信息,比如,该目标对象的类型、尺寸、颜色、价格、数量、运输方式等。该目标对象可以被理解为直播间中上架的任一对象。通过语音输入设备可以获取到用户的语音,根据接收到的语音即可得到该流通性信息。The embodiments of the present disclosure do not limit the liquidity information, and in an e-commerce scenario, it may refer to order information for items. The embodiment of the present disclosure does not limit its specific content, which may include various information of the target object to be exchanged with liquidity information, such as the type, size, color, price, quantity, transportation method, etc. of the target object. The target object can be understood as any object listed in the live broadcast room. The user's voice can be acquired through the voice input device, and the liquidity information can be obtained according to the received voice.
本公开实施例中并不限定交互管理信息,其可以指代电子商务场景下的物品订单,具体来说,交互管理信息指的是获取到的与目标对象有关的信息所形成的信息集合,基于该交互管理信息,再结合用户的地址信息和流通性信息即可进行流通性信息交互进而得到流通性信息交互内容,该流通性信息交互内容为完成目标对象的流通性信息交互的凭证,基于该流通性信息交互内容还可以实施收货操作、退款操作、维权操作、物流查询等其他电商环节所需的操作。The embodiment of the present disclosure does not limit the interaction management information, which may refer to an item order in an e-commerce scenario. Specifically, the interaction management information refers to an information collection formed by acquired information related to the target The interactive management information, combined with the user's address information and liquidity information, can be used for the exchange of liquidity information to obtain the content of the exchange of liquidity information. The interactive content of liquidity information can also implement operations required by other e-commerce links such as receipt operations, refund operations, rights protection operations, and logistics inquiries.
在步骤S30中,在上述工作状态为上述语音交互状态的情况下,响应于接收到交互触发信息的情况,基于上述交互触发信息和上述交互管理信息生成上述目标对象的流通性信息交互内容。In step S30, when the working state is the voice interaction state, in response to receiving the interaction trigger information, the liquidity information interaction content of the target object is generated based on the interaction trigger information and the interaction management information.
本公开实施例并不限定交互触发信息的具体含义,以电商场景为例,其可以指代为了购买电商物品所产生的支付信息,具体来说,交互触发信息可以被理解为用户为该交互管理信息中的目标对象进行交互触发时所需要的信息。当然,本公开实施例也不限定流通性信息交互内容的具体含义,在电商场景下,其可以表示针对目标对象生成的交易订单。The embodiment of the present disclosure does not limit the specific meaning of the interaction trigger information. Taking an e-commerce scenario as an example, it can refer to payment information generated for purchasing an e-commerce item. Specifically, the interaction trigger information can be understood as the user is the The information required by the target object in the interaction management information to trigger interaction. Of course, the embodiment of the present disclosure does not limit the specific meaning of the interactive content of liquidity information. In an e-commerce scenario, it may represent a transaction order generated for a target object.
在获取到交互触发信息后可以对上述交互管理信息进行交互触发操作,并在交互触发成功后生成流通性信息交互内容,该流通性信息交互内容就是流通性信息交互得到的凭证。至此,通过全程语音的方式完成了流通性信息获取、交互管理信息生成、交互触发以及流通性信息交互内容生成等内容,以电商场景为例,在不需要用户手动输入任何信息的情况下,单凭语音交互的方式即可在直播间完成购物。这一技术方案并不需要用户用眼,而是单凭听觉和发出语音指令的方式即可在直播间中完成购物,十分适合于需要休息眼睛或者视力不良的用户。目前失明或者视力障碍的人群规模呈现日益增长的趋势,还有一些视力不佳的老人,因为生活不便,这些用户线上购物需求或许更强烈,本公开实施例支持在直播购物的时候通过语音购物,可以很好的满足这类用户的具体需求。After the interaction triggering information is obtained, the interaction triggering operation can be performed on the above interaction management information, and after the interaction triggering is successful, the circulation information interaction content is generated, and the circulation information interaction content is the certificate obtained by the circulation information interaction. So far, the acquisition of liquidity information, the generation of interactive management information, the interaction trigger, and the generation of liquidity information interactive content have been completed through the whole process of voice. Taking the e-commerce scenario as an example, without the need for users to manually input any information, Shopping in the live broadcast room can be completed by voice interaction alone. This technical solution does not require users to use their eyes, but can complete shopping in the live broadcast room only by hearing and issuing voice commands, which is very suitable for users who need to rest their eyes or have poor eyesight. Currently, the number of people who are blind or visually impaired is increasing, and there are some elderly people with poor eyesight. Because of their inconvenience, these users may have a stronger demand for online shopping. The embodiment of the present disclosure supports shopping by voice during live shopping. , which can well meet the specific needs of such users.
在一个示例性的实施方式中,在上述工作状态为上述语音交互状态的情况下,响应于接收到语音交互模式关闭信息的情况,将上述工作状态配置为字符交互状态,并将上述语音输入设备配置为非唤醒状态,上述字符交互状态为通过字符输入设备接收到的信息确定对象及进行针对上述对象的流通性信息交互的状态。In an exemplary embodiment, when the working state is the voice interaction state, in response to receiving the voice interaction mode closing information, the working state is configured as a character interaction state, and the voice input device is configured as a character interaction state. It is configured in a non-awakening state, and the above character interaction state is a state in which an object is determined through information received by a character input device and a liquidity information interaction for the above object is performed.
也就是说,在上述工作状态为上述语音交互状态的情况下,用户可以通过发出语音交互模式关闭信息的方式将工作状态切换为字符交互状态,字符交互状态可以被理解为普通用户经常使用的购物状态,这一状态下可以通过键盘、鼠标、触摸屏等字符输入设备输入信息,从而完成对象的选择以及对象的流通性信息交互。也就是说,本公开实施例提供语音模式的电商购物方案,同时也提供普通的电商购物方案,通过兼容两种购物方案并允许用户随意切换,充分满足不同类型的用户的不同购物需求。That is to say, when the above working state is the above voice interaction state, the user can switch the working state to the character interaction state by sending out the voice interaction mode closing message, and the character interaction state can be understood as the shopping frequently used by ordinary users. In this state, information can be input through character input devices such as keyboard, mouse, touch screen, etc., so as to complete the selection of objects and the exchange of information on the circulation of objects. That is to say, the embodiment of the present disclosure provides an e-commerce shopping solution in voice mode, and also provides a common e-commerce shopping solution, which fully meets different shopping needs of different types of users by being compatible with the two shopping solutions and allowing users to switch at will.
在一个示例性的实施方式中,上述在直播应用被打开的情况下,响应于接收到语音交互模式唤醒信息的情况,将上述直播应用的工作状态配置为语音交互状态之前,上述方法还包括:响应于上述直播应用被打开的情况,触发语音输出设备播报第一信息并且唤醒上述语音输入设备,上述第一信息用于询问用户是否将上述直播应用配置为上述语音交互状态;解析上述语音输入设备接收到的针对上述第一信息的反馈信息,得到第一解析结果;在上述第一解析结果表征上述用户存在语音交互意愿的情况下,判定接收到上述语音交互模式唤醒信息。In an exemplary embodiment, when the live broadcast application is opened, in response to receiving the voice interaction mode wake-up information, before configuring the working state of the live broadcast application to the voice interaction state, the method further includes: In response to the above-mentioned live application being opened, trigger the voice output device to broadcast the first information and wake up the above-mentioned voice input device, and the above-mentioned first information is used to ask the user whether to configure the above-mentioned live application to the above-mentioned voice interaction state; Parse the above-mentioned voice input device A first analysis result is obtained from the received feedback information for the above-mentioned first information; if the above-mentioned first analysis result indicates that the user has a voice interaction willingness, it is determined that the above-mentioned voice interaction mode wake-up information is received.
本公开并不对语音输出设备进行限定,比如,其可以为扬声器,音箱等外设。在直播应用被打开的情况下,可以触发语音输出设备播放一段语音(第一信息),以使得用户能够选择是否将直播应用设置为语音交互状态。比如,该第一信息可以是“是否进入语音直播间”或者“请选择交互模式,如需进入语音直播状态请发出‘语音直播’口令,否则,请发出‘普通直播’口令”。用户听到该第一信息之后可以以语音形式反馈,比如,反馈的语音可以为“是”或者“语音直播”。The present disclosure does not limit the voice output device, for example, it may be a peripheral device such as a speaker and a sound box. When the live broadcast application is opened, the voice output device may be triggered to play a piece of voice (first information), so that the user can choose whether to set the live broadcast application to a voice interaction state. For example, the first information can be "whether to enter the voice live broadcast room" or "please select the interactive mode, if you want to enter the voice live broadcast state, please issue the 'voice live broadcast' password, otherwise, please issue the 'normal live broadcast' password". After hearing the first information, the user may give feedback in the form of voice, for example, the feedback voice may be "yes" or "voice live broadcast".
在第一信息为“是否进入语音直播间”并且用户反馈的语音为“是”的情况下,或者在第一信息为“请选择交互模式,如需进入语音直播状态请发出‘语音直播’口令,否则,请发出‘普通直播’口令”并且用户反馈的语音为“语音直播”的情况下,可以判定用户存在语音交互意愿,因此,可以认为接收到了上述语音交互模式唤醒信息,进而可以自动进入语音交互状态,支持全程基于语音的方式为用户提供电商购物体验。这一实施方式下,可以及时为用户提供触发直播应用进入语音交互状态的入口,快速进入语音交互状态,为视力不佳的用户提供更好的体验。When the first information is "whether to enter the voice live broadcast room" and the voice feedback from the user is "Yes", or when the first information is "Please select the interactive mode, if you want to enter the voice live broadcast state, please issue the 'voice live broadcast' password , otherwise, please issue the 'normal live broadcast' password" and the voice feedback from the user is "voice live broadcast", it can be determined that the user has the voice interaction willingness, therefore, it can be considered that the above voice interaction mode wake-up information is received, and then the user can automatically enter the Voice interaction status, support the whole voice-based way to provide users with an e-commerce shopping experience. In this implementation manner, the user can be provided with an entrance to trigger the live application to enter the voice interaction state in time, and the voice interaction state can be quickly entered, thereby providing a better experience for users with poor eyesight.
在一示例性的实施方式中,上述在上述工作状态为上述语音交互状态的情况下,响应于接收到流通性信息的情况,基于上述流通性信息生成针对目标对象的交互管理信息之前,上述方法还包括:响应于直播过程进行到上述目标对象对应的流通性信息交互时段的情况,触发上述语音输出设备播报第二信息,上述第二信息用于询问用户是否关联上述目标对象;解析上述语音输入设备接收到的针对上述第二信息的反馈信息,得到第二解析结果;在上述第二解析结果表征上述用户存在关联意愿的情况下,接收上述流通性信息。具体来说,在电商场景下,用户与目标对象关联,意味着用户购买该目标对象。In an exemplary embodiment, when the working state is the voice interaction state, in response to receiving the liquidity information, before the interaction management information for the target object is generated based on the liquidity information, the above method is performed. It also includes: triggering the voice output device to broadcast second information in response to the live broadcast process reaching the liquidity information interaction period corresponding to the target object, and the second information is used to inquire whether the user is associated with the target object; parsing the voice input The device receives the feedback information for the second information, and obtains a second analysis result; and in the case that the second analysis result represents the user's willingness to associate, the device receives the liquidity information. Specifically, in an e-commerce scenario, a user is associated with a target object, which means that the user purchases the target object.
随着直播的进行,主播可能依次上架很多对象,每一对象上架的时候可以为用户留出一段时间为该对象下单关联,直播应用可以在这个时间段触发上述语音输出设备播报一段内容(第二信息),以询问用户是否下单关联该对象。比如,主播上架了一款电动牙刷并提示观众可以下单,在这个情况下,可以播报第二信息“当前主播上架电动牙刷,请问是否关联”,如果用户反馈的语音信息为“愿意”、“我买”或者“我要”等内容,可以判定用户存在关联意愿,这情况可以继续与该用户进行语音交互,以获取后续的流通性信息。这一方案可以伴随直播过程,主动针对每一上架对象询问用户是否有关联需求,为用户及时提供下单的契机,确保用户不会因为看不见、看不清或者不便使用鼠标键盘等原因错失下单机会,提升用户的购物体验。As the live broadcast progresses, the host may list many objects in turn. When each object is listed, a period of time can be set aside for the user to place an order for the object. The live broadcast application can trigger the above-mentioned voice output device to broadcast a piece of content during this time period. two information) to ask the user whether to place an order to associate the object. For example, the anchor has put an electric toothbrush on the shelf and prompted the audience to place an order. In this case, the second message "The current anchor has an electric toothbrush on the shelf, may I ask if it is related?", if the user feedback voice information is "Yes", " Contents such as "I buy" or "I want" can determine the user's willingness to associate, and in this case, you can continue to conduct voice interaction with the user to obtain subsequent liquidity information. This solution can accompany the live broadcast process, and actively ask the user whether there is a related demand for each listed object, providing users with an opportunity to place an order in a timely manner, to ensure that users will not miss because of invisible, unclear or inconvenient to use the mouse and keyboard. A single opportunity to enhance the user's shopping experience.
在另一示例性的实施方式中,上述在上述工作状态为上述语音交互状态的情况下,响应于接收到流通性信息的情况,基于上述流通性信息生成针对目标对象的交互管理信息之前,上述方法还包括:响应于在直播过程中接收到第一目标语音的情况,对上述第一目标语音进行解析;响应于解析结果中存在上述目标对象对应的信息的情况,接收上述流通性信息。In another exemplary embodiment, when the working state is the voice interaction state, in response to receiving the liquidity information, before the interaction management information for the target object is generated based on the liquidity information, the above The method further includes: in response to receiving the first target voice during the live broadcast, parsing the first target voice; in response to the analysis result having information corresponding to the target object, receiving the liquidity information.
当然,在直播过程的任何时间点,用户都可以就本次直播过程中上架的对象触发下单流程,也就是说,用户在任何时刻都可以输出用于下单的语音(第一目标语音),比如,当前直播间一共会上架“风衣”“口红”“风扇”“饮料”四种对象,在直播过程的任意时刻,用户都可以针对上述四种对象进行下单,比如,可以输出“我想买风衣”、“我要风扇”等即可触发直播应用与用户进行语音交互,进而得到具体的流通性信息。Of course, at any point in the live broadcast process, the user can trigger the ordering process for the objects listed in the live broadcast process, that is to say, the user can output the voice for placing the order (the first target voice) at any time. , for example, the current live broadcast room has a total of four types of objects: "windbreaker", "lipstick", "fan" and "drink". At any time during the live broadcast, users can place an order for the above four objects. For example, you can output "I Want to buy a windbreaker”, “I want a fan”, etc. can trigger the live broadcast application to interact with the user by voice, and then obtain specific liquidity information.
本公开实施例并不限定第一目标语音的具体内容,比如,如果判定接收到的语音中存在直播间中上架的对象以及表达用户关联意愿的词时,即可判定接收到第一目标语音。或者如果接收到的语音中包含预设指令词的时候,比如“下单”,可以触发语音输出设备播放对应的回应“请下单”,在该回应被播放之后,再次经由语音输入设备接收到的语音,即可被认为是第一目标语音。这一方案支持用户在直播过程中的任一时刻下单购物,进一步提升购物体验。The embodiment of the present disclosure does not limit the specific content of the first target voice. For example, if it is determined that there are objects listed in the live broadcast room and words expressing the user's associated wishes in the received voice, it can be determined that the first target voice is received. Or if the received voice contains preset command words, such as "place an order", you can trigger the voice output device to play the corresponding response "please place an order", and after the response is played, it will be received again through the voice input device. The voice can be regarded as the first target voice. This solution allows users to place an order for shopping at any time during the live broadcast, further enhancing the shopping experience.
在一示例性的实施方式中,上述流通性信息包括至少一个流通性信息项,上述接收上述流通性信息,包括:获取上述目标对象对应的至少一个属性项;针对每一属性项,触发上述语音输出设备播报上述属性项对应的第三信息,上述第三信息用于请求用户对上述属性项对应的至少一个属性值进行选择;解析上述语音输入设备接收到的针对上述第三信息的反馈信息,得到上述属性项对应的流通性信息项。In an exemplary embodiment, the liquidity information includes at least one liquidity information item, and the receiving the liquidity information includes: acquiring at least one attribute item corresponding to the target object; for each attribute item, triggering the voice The output device broadcasts the third information corresponding to the above-mentioned attribute item, and the above-mentioned third information is used to request the user to select at least one attribute value corresponding to the above-mentioned attribute item; parses the feedback information for the above-mentioned third information received by the above-mentioned voice input device, Obtain the liquidity information item corresponding to the above attribute item.
举个例子,如果用户关联风衣,风衣这一对象至少包括四个属性项,分别为尺码、颜色、数量和版型,相应的,需要填写的流通性信息项至少包括风衣尺码、风衣颜色、风衣数量、风衣版型。可以通过逐项交互的方式引导用户对上述四个属性的具体属性值进行选择,并根据选择结果得到对应的流通性信息项,进而得到总的流通性信息。For example, if the user associates a windbreaker, the windbreaker object includes at least four attribute items, namely size, color, quantity and version. Correspondingly, the liquidity information items that need to be filled in include at least windbreaker size, windbreaker color, windbreaker Quantity, windbreaker version. The user can be guided to select the specific attribute values of the above-mentioned four attributes in a way of item-by-item interaction, and the corresponding liquidity information items can be obtained according to the selection results, and then the total liquidity information can be obtained.
如图3所示,语音输出设备可以播报“请选择风衣的尺码,S、M、L”并等待用户输入,在语音输入设备接收到的语音“M”后,将“尺码”的具体属性值确定为“M”。然后,语音输出设备可以播报“请选择风衣的颜色,黑、灰、白”并等待用户输入,在语音输入设备接收到的语音“黑”后,将“颜色”的具体属性值确定为“黑”。然后,语音输出设备可以播报“请确定风衣的数量”并等待用户输入,在语音输入设备接收到的语音“2”后,将“数量”的具体属性值确定为“2”。然后,语音输出设备可以播报“请选择风衣的版型,宽松、修身”并等待用户输入,在语音输入设备接收到的语音“修身”后,将“版型”的具体属性值确定为“修身”。As shown in Figure 3, the voice output device can broadcast "Please select the size of the windbreaker, S, M, L" and wait for the user's input. After the voice input device receives the voice "M", the specific attribute value of "size" Determined to be "M". Then, the voice output device can broadcast "Please select the color of the windbreaker, black, gray, white" and wait for the user's input. After the voice input device receives the voice "black", the specific attribute value of "color" is determined as "black" ". Then, the voice output device can broadcast "Please determine the number of windbreakers" and wait for the user's input. After the voice input device receives the voice "2", the specific attribute value of "quantity" is determined as "2". Then, the voice output device can broadcast "Please select the style of the windbreaker, loose, slim fit" and wait for the user's input. After the voice "Slim fit" received by the voice input device, the specific attribute value of the "fit" is determined as "Slim fit" ".
请参考图4,其示出根据图3示例所得到的交互管理信息的流通性信息请求确认场景的示意图。在图3的示例中,可以得到目标对象对应的流通性信息:Please refer to FIG. 4 , which shows a schematic diagram of a scenario of request confirmation of the liquidity information of the interaction management information obtained according to the example of FIG. 3 . In the example of Figure 3, the liquidity information corresponding to the target object can be obtained:
目标对象:风衣;Target object: windbreaker;
风衣尺码:M;Windbreaker size: M;
风衣颜色:黑;Windbreaker color: black;
风衣数量:2;Number of windbreakers: 2;
风衣版型:修身。Windbreaker Fit: Slim fit.
可以通过播报这些流通性信息请求确认,在用户确认后根据这些流通性信息可以生成针对目标对象—风衣的交互管理信息。这一实施方式中通过逐项引导的方式确定用户对每一属性项的属性值的确定结果,不遗漏重要的属性项的内容,从而得到充分满足用户意愿的交互管理信息,确保下单过程不出现错误,提升用户的购物体验。Confirmation can be requested by broadcasting the circulation information, and after confirmation by the user, interactive management information for the target object—the windbreaker can be generated according to the circulation information. In this embodiment, the user's determination result of the attribute value of each attribute item is determined in a way of item-by-item guidance, and the content of important attribute items is not omitted, so as to obtain interactive management information that fully meets the user's wishes and ensure that the ordering process does not fail. Errors occur to improve the user's shopping experience.
在另一实施方式中,上述流通性信息包括至少一个流通性信息项,上述接收上述流通性信息,包括:接收第二目标语音,对上述第二目标语音进行解析;获取上述目标对象对应的至少一个属性项;根据解析结果确定每一上述属性项所对应的内容,根据上述内容得到上述属性项对应的流通性信息项。In another embodiment, the liquidity information includes at least one item of liquidity information, and the receiving the liquidity information includes: receiving a second target voice, and analyzing the second target voice; acquiring at least one item corresponding to the target object An attribute item; the content corresponding to each of the above attribute items is determined according to the analysis result, and the liquidity information item corresponding to the above attribute item is obtained according to the above content.
在这一实施方式中,用户可以在进入下单状态后,发出一段语音(第二目标语音),下单状态可以通过前文中发出的第一目标语音的方式进入。该第二目标语音中可以包括与目标对象的一个或多个属性项有关的内容,通过对该第二目标语音进行解析,可以得到相关属性项所对应的内容。请参考图5,在进入下单状态后,用户输出了一段语音(第二目标语音),该第二目标语音可以为“我想买黑色的修身风衣”,则可以确定流通性信息中的下述三个内容:In this embodiment, after entering the ordering state, the user can send out a piece of voice (the second target voice), and the ordering state can be entered by means of the first target voice sent out above. The second target voice may include content related to one or more attribute items of the target object, and by analyzing the second target voice, content corresponding to the relevant attribute items may be obtained. Please refer to Figure 5, after entering the order state, the user outputs a voice (second target voice), the second target voice can be "I want to buy a black slim fit windbreaker", then it can be determined that the next target voice in the liquidity information three things:
目标对象:风衣;Target object: windbreaker;
风衣颜色:黑;Windbreaker color: black;
风衣版型:修身。Windbreaker Fit: Slim fit.
结合前文可知,流通性信息内容还不足,这种情况下可以对用户进行语音提示,引导用户再次发出新的第二目标语音,比如触发语音输出设备播放“请继续说出风衣的数量和尺码”,用户继续发出新的第二目标语音“我想买两个M号的风衣”,这情况下可以确定订单中的下述两个内容:Combining the above, it can be seen that the content of the circulation information is not enough. In this case, a voice prompt can be given to the user to guide the user to make a new second target voice again, such as triggering the voice output device to play "Please continue to say the number and size of the windbreaker". , the user continues to issue a new second target voice "I want to buy two windbreakers of size M", in this case, the following two contents in the order can be determined:
风衣尺码:M;Windbreaker size: M;
风衣数量:2。Number of trench coats: 2.
至此,流通性信息就齐全了。这一实施方式可以通过一次或有限几次的语音交互快速确定流通性信息的全部内容,交互效率高,下单速度快,并且不会发生信息遗落或者错误。At this point, the liquidity information is complete. In this embodiment, the entire content of the liquidity information can be quickly determined through one or a limited number of voice interactions, the interaction efficiency is high, the ordering speed is fast, and information loss or error will not occur.
前文中给出了至少两种获取流通性信息的方式,获取流通性信息中流通性信息项的过程中,语音输出设备还可以就确定出的每个流通性信息项请求用户确认,比如,在获取到完备的流通性信息后可以触发语音输出设备播报流通性信息的具体内容“请确认下述内容:目标对象:风衣;风衣尺码:M;风衣颜色:黑;风衣数量:2;风衣版型:修身。”如果用户反馈“均正确”则可以生成交互管理信息。如果用户反馈存在错误的流通性信息项,则可以继续与用户进行语音交互直至流通性信息全部正确为止。也就是说,本公开实施例中在接收到针对上述流通性信息中的各上述流通性信息项的确认指令的情况下,基于上述流通性信息生成针对目标对象的交互管理信息。通过与用户确认流通性信息的各项内容,确保生成的交互管理信息符合用户预期,降低误下单的概率,提升用户体验。At least two ways to obtain liquidity information are given in the preceding paragraph. In the process of obtaining the liquidity information items in the liquidity information, the voice output device can also request the user to confirm each determined liquidity information item. After obtaining complete liquidity information, the voice output device can be triggered to broadcast the specific content of the liquidity information "Please confirm the following: target object: windbreaker; windbreaker size: M; windbreaker color: black; windbreaker quantity: 2; windbreaker version : Self-cultivation." If the user feedback "all correct", interactive management information can be generated. If the user feedbacks that there is an incorrect item of liquidity information, the user can continue to perform voice interaction until the liquidity information is all correct. That is, in the embodiment of the present disclosure, when a confirmation instruction for each of the above-mentioned liquidity information items in the above-mentioned liquidity information is received, the interaction management information for the target object is generated based on the above-mentioned liquidity information. By confirming the contents of the liquidity information with the user, it ensures that the generated interactive management information meets the user's expectations, reduces the probability of mistakenly placing an order, and improves the user experience.
在一个实施例中,上述在上述工作状态为上述语音交互状态的情况下,响应于接收到交互触发信息的情况,基于上述交互触发信息和上述交互管理信息生成上述目标对象的流通性信息交互内容之前,上述方法还包括:在生成上述交互管理信息之后,触发上述语音输出设备播报第四信息,上述第四信息用于请求用户的交互触发参数,上述交互触发参数包括下述至少之一:交互触发方式、交互触发账号、交互触发密码、交互触发口令、交互触发生物信息;根据接收到的上述交互触发参数得到上述交互触发信息。In one embodiment, when the working state is the voice interaction state, in response to receiving the interaction trigger information, the interaction content of the liquidity information of the target object is generated based on the interaction trigger information and the interaction management information. Before, the method further includes: after generating the interaction management information, triggering the voice output device to broadcast fourth information, where the fourth information is used to request interaction trigger parameters of the user, and the interaction trigger parameters include at least one of the following: Trigger method, interaction trigger account, interaction trigger password, interaction trigger password, interaction trigger biological information; the interaction trigger information is obtained according to the received interaction trigger parameters.
本公开实施例对交互触发方式不做限定,比如,可以支持指纹、密码、口令、虹膜、人脸等多种验证方式的交互触发,并且也不限定交互触发渠道,比如可以由直播应用提供交互触发渠道,也可以接入任一第三方的交互触发渠道,交互触发有关的信息被封装为交互触发参数,通过与用户进行语音交互的方式触发用户给出交互触发参数,从而生成交互触发信息。The embodiment of the present disclosure does not limit the interactive triggering method. For example, it can support the interactive triggering of multiple authentication methods such as fingerprint, password, password, iris, face, etc., and also does not limit the interactive triggering channel. For example, the live broadcast application can provide interactive triggering. The trigger channel can also be connected to the interaction trigger channel of any third party. The information related to the interaction trigger is encapsulated as the interaction trigger parameter, and the user is triggered to give the interaction trigger parameter by means of voice interaction with the user, thereby generating the interaction trigger information.
在一个实施方式中,在获取到交互管理信息和交互触发信息之后,还需要获取目标对象的寄送地址,这种情况下,上述基于上述交互触发信息和上述交互管理信息生成上述目标对象的流通性信息交互内容,包括:触发上述语音输出设备播报第五信息,上述第五信息用于请求用户的地址信息;对接收到的第三目标语音进行解析,得到上述地址信息;根据上述交互管理信息、上述交互触发信息和上述地址信息,生成上述流通性信息交互内容。本公开实施例并不对地址信息进行限定,其可以包括用户的姓名,电话,住址,邮编等内容,还可以包括寄送的选配信息,比如,选择工作日配送或者休息日配送,上午配送或下午配送,是否到付配送,是否关联配送保险等内容。本公开实施例支持目标对象的信息的全自动语音获取、交互触发信息的全自动语音获取以及地址信息的全自动语音获取,从而实现了直播购物全程语音化,提升用户体验。In one embodiment, after acquiring the interaction management information and interaction trigger information, it is also necessary to acquire the delivery address of the target object. In this case, the above-mentioned circulation of the target object is generated based on the interaction trigger information and the interaction management information. Sexual information interaction content, including: triggering the voice output device to broadcast fifth information, where the fifth information is used to request the address information of the user; parsing the received third target voice to obtain the address information; according to the interaction management information , the above interaction trigger information and the above address information to generate the above liquidity information interaction content. The embodiment of the present disclosure does not limit the address information, which may include the user's name, phone number, address, zip code, etc., and may also include optional delivery information, for example, select weekday delivery or rest day delivery, morning delivery or Afternoon delivery, whether to pay on delivery, whether to associate delivery insurance, etc. The embodiments of the present disclosure support fully automatic voice acquisition of target object information, fully automatic voice acquisition of interactive trigger information, and fully automatic voice acquisition of address information, thereby realizing voice-based live shopping throughout the entire process and improving user experience.
本公开实施例并不限定地址信息解析方法,如图6所示,可以获取地址模板信息;将针对上述第三目标语音的解析结果与上述地址模板信息进行匹配,得到上述地址信息。也就是说,本公开实施例可以通过智能匹配的方式根据地址模板生成地址信息,比如,根据解析结果中的具体地址,可以自动为地址模板中的省、市、区、门牌号、楼层、楼牌号等项目适配对应的内容,从而提升地址信息规范度,降低目标对象配送错误的概率,当然地址信息也可以通过语音播报的方式给用户确认。The embodiment of the present disclosure does not limit the address information parsing method. As shown in FIG. 6 , address template information can be obtained; the parsing result for the third target voice is matched with the address template information to obtain the address information. That is to say, the embodiments of the present disclosure can generate address information according to an address template in an intelligent matching manner. For example, according to the specific address in the parsing result, the province, city, district, house number, floor, building in the address template can be automatically generated. Items such as brand names are adapted to the corresponding content, thereby improving the standardization of address information and reducing the probability of wrong delivery of target objects. Of course, address information can also be confirmed to users by voice broadcast.
在一个实施例中,上述响应于接收到流通性信息的情况,基于上述流通性信息生成针对目标对象的交互管理信息之前,上述方法还包括:响应于接收到第四目标语音的情况,对上述第四目标语音进行解析;根据针对上述第四目标语音的解析结果确定与上述目标对象相关的目标内容,触发上述语音输出设备播报上述目标内容对应的关联信息,上述关联信息用于对上述目标内容进行解释或者给出建议。In one embodiment, before generating the interaction management information for the target object based on the liquidity information in response to receiving the liquidity information, the method further includes: in response to receiving the fourth target voice, performing The fourth target voice is analyzed; the target content related to the above-mentioned target object is determined according to the analysis result of the above-mentioned fourth target voice, and the above-mentioned voice output device is triggered to broadcast the associated information corresponding to the above-mentioned target content, and the above-mentioned associated information is used for the above-mentioned target content. Explain or give advice.
本公开实施例并不限定第四目标语音的具体内容和发出契机,比如,用户可以在直播过程的任何时间点发出第四目标语音,该第四目标语音可以请求对目标对象的有关内容进行解释或者给出建议,比如,直播平台一共上架四个对象“毛巾”、“香皂”、“电动牙刷”、“毛呢大衣”,用户可以发出下面的语音“我想知道电动牙刷充一次电可以用多久”,则可以向用户自动播报与电动压缩耗电情况有关的信息。再比如,用户还可以发出“毛呢大衣怎么洗”,则可以向用户自动播报毛呢大衣清洗的注意事项。本公开实施例通过语音交互的方式可以自动解答用户观看直播过程中产生的疑问和给出相关建议,进一步提升直播体验。The embodiment of the present disclosure does not limit the specific content and the opportunity of issuing the fourth target voice. For example, the user can send out the fourth target voice at any time point in the live broadcast process, and the fourth target voice can request to explain the relevant content of the target object. Or give suggestions. For example, there are four objects "towel", "soap", "electric toothbrush", and "wool coat" on the live platform, and the user can make the following voice "I want to know that the electric toothbrush can be used on a single charge. For how long", information about the power consumption of electric compression can be automatically broadcast to the user. For another example, the user can also issue "how to wash the woolen coat", and then the precautions for cleaning the woolen coat can be automatically broadcast to the user. The embodiments of the present disclosure can automatically answer questions and give relevant suggestions in the process of watching the live broadcast by the user by means of voice interaction, so as to further improve the live broadcast experience.
请参考图7,其示出本公开实施例中一个具体的语音交互场景流程图。语音交互过程至少包括下述步骤:Please refer to FIG. 7 , which shows a flowchart of a specific voice interaction scenario in an embodiment of the present disclosure. The voice interaction process includes at least the following steps:
首先,在用户进入直播间之后询问用户直播间的交互模式。如果用户选择语音交互模式,则为用户提供对应的语音直播间。在该语音直播间中,麦克风(语音输入设备)全程处于唤醒状态。如果麦克风接收到用户发出的第一目标语音,则可以进入下单流程。在下单流程中通过与用户进行语音交互获取订单信息,根据订单信息生成物品订单。然后继续通过与用户进行语音交互获取支付信息,根据支付信息完成支付并生成交易订单,从而成功完成了一次全程语音的电商购物。First, after the user enters the live room, the user is asked about the interaction mode of the live room. If the user selects the voice interaction mode, a corresponding voice live broadcast room is provided for the user. In the voice live broadcast room, the microphone (voice input device) is always awake. If the microphone receives the first target voice sent by the user, the order placement process can be entered. During the ordering process, the order information is obtained by interacting with the user through voice, and the item order is generated according to the order information. Then continue to obtain payment information through voice interaction with the user, complete payment and generate transaction orders according to the payment information, thus successfully completing a full-course voice e-commerce shopping.
图8是根据一示例性实施例示出的一种直播语音交互装置框图。参照图8,该装置包括:Fig. 8 is a block diagram of a live voice interaction apparatus according to an exemplary embodiment. Referring to Figure 8, the device includes:
切换模块10,被配置为执行在直播应用被打开的情况下,响应于接收到语音交互模式唤醒信息的情况,将上述直播应用的工作状态配置为语音交互状态,上述语音交互状态为通过解析语音输入设备接收到的信息确定对象及进行针对上述对象的流通性信息交互的状态;The switching module 10 is configured to perform, in the case that the live broadcast application is opened, in response to the situation of receiving the voice interaction mode wake-up information, configure the working state of the above-mentioned live broadcast application as a voice interaction state, and the above-mentioned voice interaction state is to analyze the voice The information received by the input device determines the object and the status of the exchange of liquidity information for the above object;
语音处理模块20,被配置为执行在上述工作状态为上述语音交互状态的情况下,响应于接收到流通性信息的情况,基于上述流通性信息生成针对目标对象的交互管理信息;以及,在上述工作状态为上述语音交互状态的情况下,响应于接收到交互触发信息的情况,基于上述交互触发信息和上述交互管理信息生成上述目标对象的流通性信息交互内容。The
在一示例性实施方式中,上述语音处理模块,被配置为执行在上述工作状态为上述语音交互状态的情况下,维持上述语音输入设备的唤醒状态。In an exemplary embodiment, the voice processing module is configured to maintain the wake-up state of the voice input device when the working state is the voice interaction state.
在一示例性实施方式中,上述切换模块,被配置为执行在上述工作状态为上述语音交互状态的情况下,响应于接收到语音交互模式关闭信息的情况,将上述工作状态配置为字符交互状态,并将上述语音输入设备配置为非唤醒状态,上述字符交互状态为通过字符输入设备接收到的信息确定对象及进行针对上述对象的流通性信息交互的状态。In an exemplary embodiment, the above-mentioned switching module is configured to perform, in the case that the above-mentioned working state is the above-mentioned voice interactive state, in response to receiving the voice interactive mode closing information, configure the above-mentioned working state as a character interactive state. , and configure the above voice input device to be in a non-awakening state, and the above character interaction state is a state in which an object is determined by the information received by the character input device and the liquidity information exchange for the above object is performed.
在一示例性实施方式中,上述切换模块,被配置为执行:In an exemplary embodiment, the above-mentioned switching module is configured to perform:
响应于上述直播应用被打开的情况,触发语音输出设备播报第一信息并且唤醒上述语音输入设备,上述第一信息用于询问用户是否将上述直播应用配置为上述语音交互状态;In response to the above-mentioned live application being opened, trigger the voice output device to broadcast the first information and wake up the above-mentioned voice input device, and the above-mentioned first information is used to ask the user whether to configure the above-mentioned live application to the above-mentioned voice interaction state;
解析上述语音输入设备接收到的针对上述第一信息的反馈信息,得到第一解析结果;Parsing the feedback information for the first information received by the voice input device to obtain a first parsing result;
在上述第一解析结果表征上述用户存在语音交互意愿的情况下,判定接收到上述语音交互模式唤醒信息。In the case that the above-mentioned first analysis result indicates that the above-mentioned user has a voice interaction willingness, it is determined that the above-mentioned voice interaction mode wake-up information is received.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
响应于直播过程进行到上述目标对象对应的流通性信息交互时段的情况,触发上述语音输出设备播报第二信息,上述第二信息用于询问用户是否关联上述目标对象;In response to the situation in which the live broadcast process reaches the liquidity information interaction period corresponding to the above-mentioned target object, trigger the above-mentioned voice output device to broadcast the second information, and the above-mentioned second information is used to ask the user whether to associate the above-mentioned target object;
解析上述语音输入设备接收到的针对上述第二信息的反馈信息,得到第二解析结果;Parsing the feedback information for the second information received by the voice input device to obtain a second parsing result;
在上述第二解析结果表征上述用户存在关联意愿的情况下,接收上述流通性信息。In the case that the above-mentioned second analysis result indicates that the above-mentioned user has a willingness to associate, the above-mentioned liquidity information is received.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
响应于在直播过程中接收到第一目标语音的情况,对上述第一目标语音进行解析;In response to receiving the first target voice during the live broadcast, parse the above-mentioned first target voice;
响应于解析结果中存在上述目标对象对应的信息的情况,接收上述流通性信息。In response to the fact that the information corresponding to the above-mentioned target object exists in the analysis result, the above-mentioned liquidity information is received.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
获取上述目标对象对应的至少一个属性项;Obtain at least one attribute item corresponding to the above target object;
针对每一属性项,触发上述语音输出设备播报上述属性项对应的第三信息,上述第三信息用于请求用户对上述属性项对应的至少一个属性值进行选择;For each attribute item, trigger the above-mentioned voice output device to broadcast the third information corresponding to the above-mentioned attribute item, and the above-mentioned third information is used to request the user to select at least one attribute value corresponding to the above-mentioned attribute item;
解析上述语音输入设备接收到的针对上述第三信息的反馈信息,得到上述属性项对应的流通性信息项。Parse the feedback information for the third information received by the voice input device to obtain the liquidity information item corresponding to the attribute item.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
接收第二目标语音,对上述第二目标语音进行解析;receiving the second target voice, and analyzing the above-mentioned second target voice;
获取上述目标对象对应的至少一个属性项;Obtain at least one attribute item corresponding to the above target object;
根据解析结果确定每一上述属性项所对应的内容,根据上述内容得到上述属性项对应的流通性信息项。The content corresponding to each attribute item is determined according to the analysis result, and the liquidity information item corresponding to the attribute item is obtained according to the content.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
在生成上述交互管理信息之后,触发上述语音输出设备播报第四信息,上述第四信息用于请求用户的交互触发参数,上述交互触发参数包括下述至少之一:交互触发方式、交互触发账号、交互触发密码、交互触发口令、交互触发生物信息;After the interaction management information is generated, the voice output device is triggered to broadcast fourth information, where the fourth information is used to request interaction trigger parameters of the user, and the interaction trigger parameters include at least one of the following: an interaction trigger mode, an interaction trigger account, Interactive trigger password, interactive trigger password, interactive trigger biological information;
根据接收到的上述交互触发参数得到上述交互触发信息。The interaction trigger information is obtained according to the received interaction trigger parameters.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
触发上述语音输出设备播报第五信息,上述第五信息用于请求用户的地址信息;Trigger the above-mentioned voice output device to broadcast fifth information, and the above-mentioned fifth information is used to request the address information of the user;
对接收到的第三目标语音进行解析,得到上述地址信息;Parsing the received third target voice to obtain the above address information;
根据上述交互管理信息、上述交互触发信息和上述地址信息,生成上述流通性信息交互内容。According to the above-mentioned interaction management information, the above-mentioned interaction trigger information and the above-mentioned address information, the above-mentioned liquidity information interaction content is generated.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
获取地址模板信息;Get address template information;
将针对上述第三目标语音的解析结果与上述地址模板信息进行匹配,得到上述地址信息。The above-mentioned address information is obtained by matching the analysis result for the above-mentioned third target speech with the above-mentioned address template information.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
触发上述语音输出设备播报上述流通性信息中的每一流通性信息项;Trigger the above-mentioned voice output device to broadcast each liquidity information item in the above-mentioned liquidity information;
在接收到针对上述流通性信息中的各上述流通性信息项的确认指令的情况下,基于上述流通性信息生成针对目标对象的交互管理信息。In the case of receiving a confirmation instruction for each of the above-mentioned liquidity information items in the above-mentioned liquidity information, the interaction management information for the target object is generated based on the above-mentioned liquidity information.
在一示例性实施方式中,上述语音处理模块,被配置为执行:In an exemplary embodiment, the above-mentioned speech processing module is configured to perform:
响应于接收到第四目标语音的情况,对上述第四目标语音进行解析;In response to receiving the fourth target voice, parse the above-mentioned fourth target voice;
根据针对上述第四目标语音的解析结果确定与上述目标对象相关的目标内容,触发上述语音输出设备播报上述目标内容对应的关联信息,上述关联信息用于对上述目标内容进行解释或者给出建议。The target content related to the target object is determined according to the analysis result of the fourth target voice, and the voice output device is triggered to broadcast associated information corresponding to the target content, where the associated information is used to explain or provide suggestions for the target content.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Regarding the apparatus in the above-mentioned embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment of the method, and will not be described in detail here.
图9是根据一示例性实施例示出的一种用于直播语音交互的电子设备600的框图。FIG. 9 is a block diagram of an electronic device 600 for live-streaming voice interaction according to an exemplary embodiment.
该电子设备可以是服务器,还可以是终端设备,其内部结构图可以如图9所示。该电子设备包括通过系统总线连接的处理器、存储器和网络接口。其中,该电子设备的处理器用于提供计算和控制能力。该电子设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该电子设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种直播语音交互方法。The electronic device may be a server or a terminal device, and its internal structure diagram may be as shown in FIG. 9 . The electronic device includes a processor, memory, and a network interface connected by a system bus. Among them, the processor of the electronic device is used to provide computing and control capabilities. The memory of the electronic device includes a non-volatile storage medium and an internal memory. The nonvolatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the execution of the operating system and computer programs in the non-volatile storage medium. The network interface of the electronic device is used to communicate with an external terminal through a network connection. The computer program implements a live voice interaction method when executed by the processor.
本领域技术人员可以理解,图9中示出的结构,仅仅是与本公开方案相关的部分结构的框图,并不构成对本公开方案所应用于其上的电子设备的限定,具体的电子设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 9 is only a block diagram of a partial structure related to the solution of the present disclosure, and does not constitute a limitation on the electronic device to which the solution of the present disclosure is applied. The specific electronic device may be Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
在示例性实施例中,还提供了一种电子设备,包括:处理器;用于存储该处理器可执行指令的存储器;其中,该处理器被配置为执行该指令,以实现如本公开实施例中的直播语音交互方法。In an exemplary embodiment, there is also provided an electronic device, comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions to implement the present disclosure The live voice interaction method in the example.
在示例性实施例中,还提供了一种计算机可读存储介质,当该计算机可读存储介质中的指令由电子设备的处理器执行时,使得电子设备能够执行本公开实施例中的直播语音交互方法。In an exemplary embodiment, a computer-readable storage medium is also provided, when the instructions in the computer-readable storage medium are executed by the processor of the electronic device, the electronic device can execute the live voice in the embodiment of the present disclosure interactive method.
在示例性实施例中,还提供了一种计算机程序产品,计算机程序产品包括计算机程序,计算机程序存储在可读存储介质中,计算机设备的至少一个处理器从可读存储介质读取并执行计算机程序,使得计算机设备执行本公开实施例的直播语音交互方法。In an exemplary embodiment, a computer program product is also provided, the computer program product includes a computer program, the computer program is stored in a readable storage medium, and at least one processor of the computer device reads from the readable storage medium and executes the computer The program enables the computer device to execute the live voice interaction method of the embodiment of the present disclosure.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,该计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through a computer program, and the computer program can be stored in a non-volatile computer-readable storage medium , when the computer program is executed, it may include the processes of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database or other medium used in the various embodiments provided in this application may include non-volatile and/or volatile memory. Nonvolatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in various forms such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Road (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。Other embodiments of the present disclosure will readily occur to those skilled in the art upon consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the present disclosure that follow the general principles of the present disclosure and include common knowledge or techniques in the technical field not disclosed by the present disclosure . The specification and examples are to be regarded as exemplary only, with the true scope and spirit of the disclosure being indicated by the following claims.
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。It is to be understood that the present disclosure is not limited to the precise structures described above and illustrated in the accompanying drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210592020.7A CN115050361A (en) | 2022-05-27 | 2022-05-27 | Live voice interaction method, device, storage medium and electronic device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210592020.7A CN115050361A (en) | 2022-05-27 | 2022-05-27 | Live voice interaction method, device, storage medium and electronic device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115050361A true CN115050361A (en) | 2022-09-13 |
Family
ID=83159551
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210592020.7A Pending CN115050361A (en) | 2022-05-27 | 2022-05-27 | Live voice interaction method, device, storage medium and electronic device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115050361A (en) |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030078793A1 (en) * | 2001-10-24 | 2003-04-24 | Toth Mark E. | Enhanced customer-centric restaurant system |
CN108876546A (en) * | 2018-06-23 | 2018-11-23 | 广州联汇网络科技有限公司 | A kind of community service system and method based on region chain and barrage technology |
CN111242721A (en) * | 2019-12-30 | 2020-06-05 | 北京百度网讯科技有限公司 | Voice meal ordering method and device, electronic equipment and storage medium |
CN111541904A (en) * | 2020-04-15 | 2020-08-14 | 腾讯科技(深圳)有限公司 | Information prompting method, device, equipment and storage medium in live broadcast process |
CN111741368A (en) * | 2020-02-19 | 2020-10-02 | 北京沃东天骏信息技术有限公司 | Interactive video display and generation method, device, equipment and storage medium |
CN112312211A (en) * | 2020-10-30 | 2021-02-02 | 维沃移动通信有限公司 | Prompting method and device |
CN112581198A (en) * | 2019-09-27 | 2021-03-30 | 志趋汽车科技(上海)有限公司 | Vehicle-mounted instant order generation method |
CN113298585A (en) * | 2020-05-19 | 2021-08-24 | 阿里巴巴集团控股有限公司 | Method and device for providing commodity object information and electronic equipment |
CN113766253A (en) * | 2021-01-04 | 2021-12-07 | 北京沃东天骏信息技术有限公司 | Live broadcast method, device, equipment and storage medium based on virtual anchor |
WO2022037086A1 (en) * | 2020-08-18 | 2022-02-24 | 广州华多网络科技有限公司 | Network live broadcast transaction order execution method and apparatus therefor, network live broadcast transaction order control method and apparatus therefor, and device and medium |
-
2022
- 2022-05-27 CN CN202210592020.7A patent/CN115050361A/en active Pending
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030078793A1 (en) * | 2001-10-24 | 2003-04-24 | Toth Mark E. | Enhanced customer-centric restaurant system |
CN108876546A (en) * | 2018-06-23 | 2018-11-23 | 广州联汇网络科技有限公司 | A kind of community service system and method based on region chain and barrage technology |
CN112581198A (en) * | 2019-09-27 | 2021-03-30 | 志趋汽车科技(上海)有限公司 | Vehicle-mounted instant order generation method |
CN111242721A (en) * | 2019-12-30 | 2020-06-05 | 北京百度网讯科技有限公司 | Voice meal ordering method and device, electronic equipment and storage medium |
CN111741368A (en) * | 2020-02-19 | 2020-10-02 | 北京沃东天骏信息技术有限公司 | Interactive video display and generation method, device, equipment and storage medium |
CN111541904A (en) * | 2020-04-15 | 2020-08-14 | 腾讯科技(深圳)有限公司 | Information prompting method, device, equipment and storage medium in live broadcast process |
CN113298585A (en) * | 2020-05-19 | 2021-08-24 | 阿里巴巴集团控股有限公司 | Method and device for providing commodity object information and electronic equipment |
WO2022037086A1 (en) * | 2020-08-18 | 2022-02-24 | 广州华多网络科技有限公司 | Network live broadcast transaction order execution method and apparatus therefor, network live broadcast transaction order control method and apparatus therefor, and device and medium |
CN112312211A (en) * | 2020-10-30 | 2021-02-02 | 维沃移动通信有限公司 | Prompting method and device |
CN113766253A (en) * | 2021-01-04 | 2021-12-07 | 北京沃东天骏信息技术有限公司 | Live broadcast method, device, equipment and storage medium based on virtual anchor |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9239705B2 (en) | Method and apparatus for customized software development kit (SDK) generation | |
US20190213528A1 (en) | Digital assistant task management | |
KR101793355B1 (en) | Intelligent automated agent for a contact center | |
CN107977236B (en) | Question-answering system generation method, terminal device, storage medium and question-answering system | |
US20170278117A1 (en) | Customer experience personalisation management platform | |
WO2023050730A1 (en) | Object delivery processing method and apparatus | |
CN111028007B (en) | User portrait information prompting method, device and system | |
CN111383094A (en) | Product service full-chain driving method, equipment and readable storage medium | |
US20210406718A1 (en) | Leveraging dialogue history in updated dialogue | |
CN113436622A (en) | Processing method and device of intelligent voice assistant | |
WO2021015284A1 (en) | Interactive input assistance system and interactive input assistance method | |
CN113949885A (en) | Live broadcast processing method and device, electronic equipment and computer readable storage medium | |
TWM554612U (en) | Intelligent online customer service system | |
CA2965457C (en) | Computer-implemented system and method for providing on-demand expert advice to a consumer | |
CN115050361A (en) | Live voice interaction method, device, storage medium and electronic device | |
CN117972673B (en) | Semantic verification code generation method, device, equipment and medium based on style transfer | |
CN114519177A (en) | Verification processing method, device, electronic device and storage medium | |
CN117555659A (en) | Task execution methods, devices, electronic equipment and computer-readable media | |
US11778051B2 (en) | Method and system for generating a data collection process in a user device | |
US20210304230A1 (en) | Method and system for generating a data collection process in a user device | |
Torres-Cruz et al. | Evaluation of Performance of Artificial Intelligence System during Voice Recognition in Social Conversation | |
US20200175230A1 (en) | Method for determining a conversational agent on a terminal | |
CN113342667A (en) | Data processing method, data processing device, electronic equipment and computer readable storage medium | |
CN114023465A (en) | Session processing method, device, equipment and computer readable storage medium | |
KR102742838B1 (en) | Apparatus for generating customer specific contactcenter and method for the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |