CN109086285A - Chinese intelligent processing method and system and device based on morpheme - Google Patents
Chinese intelligent processing method and system and device based on morpheme Download PDFInfo
- Publication number
- CN109086285A CN109086285A CN201710857227.1A CN201710857227A CN109086285A CN 109086285 A CN109086285 A CN 109086285A CN 201710857227 A CN201710857227 A CN 201710857227A CN 109086285 A CN109086285 A CN 109086285A
- Authority
- CN
- China
- Prior art keywords
- poem
- morpheme
- data
- word
- chinese
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Document Processing Apparatus (AREA)
- Machine Translation (AREA)
Abstract
The Chinese intelligent processing method and system and device that the present invention provides a kind of based on morpheme.Its method includes the following steps: to collect poem data using morpheme as word-building unit;Establish the field of poem database, and production Methods type poem Database field;It is added to the poem data being collected into each field of relationship type poem database, and establishes the data link tree between poem data inside and between poem data and generate the relationship type morpheme database with poem data.It has to the search function of poem, allow one to easily and fast, accurately handle Chinese poem.
Description
Technical field
The present invention relates to computer data processing technology fields, in particular to a kind of in a computer to Chinese, especially
Classic poetry etc., such as " Tang poetry ", " such poems of the Song Dynasty ", " Three Character Primer ", " Records of the Historian ", " Book of Songs " progress intelligent processing method and system and dress
It sets.
Background technique
China's ancient civilization at least 5,000 years, it is gradually outer as the reform and opening-up of China's Mainland and development are powerful
Compatriots are understood, ancient civilization especially therein, enable many foreigners, especially foreign researcher is fascinated, therein
Classic poetry can sufficiently be described the artistic conception of people, sighed with feeling by be allowed people with the short limited word of several rows.
Such as: " the even book of returning to one's home village " of the Tang Dynasty poet He Zhizhang " leaves home a mere child and come back an old man, local accent does not change temples hair and declines;Children's phase
See and be not well acquainted with each other, laughs at and ask that visitor comes from where." this first poem is long objective strange land, the reflections poem for cherishing the memory of hometown.Poet places oneself in the midst of native place and is familiar with
And among strange environment, winding row comes all the way, and mood is quite uncalm;Current year leaves home, at life's full flowering;Today is returned, temples hair
It is scattered, it can't help sighing with deep feeling.
However, especially the foreigner and mind of children can not since the learning difficulty of Chinese is too high in the minds of people
Learn well, the understanding of many people let alone the poem to this ancient Chinese essence also just can not comprehensively appreciate Gu
Poem such as retrieves certain poem from some words in someone poem or poem of China, and to poem full text and
The understanding of pronunciation, various foreign language translations, author etc. can not make due contribution to world culture.
Summary of the invention
The present invention is to overcome defect in the prior art and provide a kind of Chinese intelligent processing method based on morpheme and be
System and device have the very big search function to poem, particularly classic poetry, make to solve deficiency in the prior art
Chinese poem can easily and fast, accurately be handled by obtaining people, the especially foreigner, and potential help is promoted in Chinese in state
Inside and outside usage amount, the carry forward Chinese culture in world civilization.
A kind of Chinese intelligent processing method based on morpheme provided for achieving the object of the present invention, comprising the following steps:
Using morpheme as word-building unit, poem data are collected;
Establish the field of poem database, and production Methods type poem Database field;
It is added to the poem data being collected into each field of relationship type poem database, and described in foundation
Data link tree between poem data inside and between poem data generates the relationship type morpheme data with poem data
Library.
More preferably, the Chinese intelligent processing method, further includes following steps:
Using morpheme as the word-building unit of word and phrase, retrieval poem data are carried out using the poem database.
More preferably, the Chinese intelligent processing method, further includes following steps:
One of original text full text, translation, pronunciation, author, history or more are obtained according to the poem data link tree
The combination of kind.
More preferably, the Chinese intelligent processing method, further includes following steps:
If do not retrieved required poem, then directly return;Or other poems that will be retrieved, as new poem
Data are added in relationship type poem database, and carry out relational data link tree, then return and exit.
More preferably, the Chinese intelligent processing method, the addition data simultaneously carry out relational links to data, including such as
Lower step:
The poem data that will be collected into are added in each field of the relationship type poem database;
It establishes using morpheme as root, establishes individual character morpheme and word morpheme is skill, poem data are the data link tree of leaf;
Between each poem data of data link tree, corresponding link is established.
More preferably, the Chinese intelligent processing method, the morpheme is minimum linguistic unit, smaller than word, same
Word corresponds to multiple morphemes;
Most significant difference between morpheme and word is that morpheme is expressed the meaning, neutral, is shown with a variety of different fonts, so its code
Referred to as " neutral code ".
The present invention also provides a kind of Chinese intelligent processing system based on morpheme, including the above-mentioned Chinese intelligence based on morpheme
The computer system software module of processing method.
More preferably, the Chinese intelligent processing system, including collection module, field establish module, relational links module,
Wherein:
The collection module, for collecting poem data, especially classic poetry data using morpheme as word-building unit;
The field establishes module, for establishing the field of poem database, and each word of production Methods type poem database
Section;
The relational links module, for being added to the relationship type poem data for the poem data being collected into
In each field in library, and establish the data link tree between poem data inside and between poem data.
The present invention also provides a kind of storage medium, the computer software journey including the Chinese intelligent processing system based on morpheme
The storage medium of sequence.
The present invention also provides a kind of hardware, including CPU, and are electrically connected to the storage medium;
The CPU calls the computer software programs of the Chinese intelligent processing system to execute from the storage medium.
The present invention is based on the Chinese intelligent processing methods and system and device of morpheme to have the advantages that
It has the function of the very big computer disposal to poem, particularly classic poetry, so that people, the especially foreigner
Can easily and fast, accurately handle Chinese, by retrieval obtain part or the full text of poem after, intelligent interlinking obtains portion
Point or full text translation and pronunciation, author etc., more potential help promotes Chinese external usage amount at home, in developing
Chineseization contributes.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art
Embodiment or attached drawing needed to be used in the description of the prior art be briefly described, it should be apparent that, it is described below
Attached drawing is some embodiments of the present invention, for those of ordinary skill in the art, before not making the creative labor
It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is Chinese intelligent retrieval processing method flow chart of the embodiment of the present invention based on morpheme;
Fig. 2 is a kind of embodiment flow chart of step S300 in Fig. 1 of the embodiment of the present invention;
Fig. 3 is Chinese processing apparatus structure schematic diagram of the embodiment of the present invention based on morpheme.
Specific embodiment
As shown in Figure 1-3, being illustrated to make the objectives, technical solutions, and advantages of the present invention clearer.In conjunction with specific
Embodiment, the present invention is described in detail.During this, descriptions of well-known structures and technologies are omitted, with to avoid
To unnecessarily obscuring idea of the invention.For these descriptions, only it is exemplary.It is not to limit the scope of the invention.
A kind of Chinese poem intelligent processing method based on morpheme of the embodiment of the present invention, as shown in Figure 1, including following step
It is rapid:
Step S100 collects poem data, especially classic poetry data using morpheme as word-building unit.
Morpheme is the smallest linguistic unit, smaller than word, and the same word can correspond to multiple morphemes.Citing: " biography " word pair
Answer two morphemes (English send, biography;Reception and registration or biography);Corresponding two morphemes of " going through " word (English history,
calendar;History or calendar);" day " word corresponds to three morphemes (English sun, day, japanese;The sun, date, Japan).
The characteristics of morpheme is that its accuracy is strong, only one pronunciation and a meaning (unicity).
Text is used to keep record, and article is made of sentence, and sentence is made of word and phrase, and word and phrase are made of word.Chinese character and west
Fang Yuyan is different, it is provided simultaneously with shape, sound, adopted three attributes, and a shape similar word can have multiple meanings and pronunciation.Due to Chinese character
Ambiguity hamper the automatic processing of information, influence big data analysis, become difficult retrieval, further, translation is patrolled
Collect complicated fallibility.For the weakness of above-mentioned Chinese character, the embodiment of the present invention is proposed to define for word and phrase with morpheme.
The difference of word, morpheme, word is: 1. word is that 2. morpheme is that 3. word is record word and language to structure lexeme for the unit of sentence-making
The grapheme of element.The above two belong to linguistic notation system, there is meaning attribute;The latter belongs to mark system, mainly word
Shape attribute, adopted attribute are fuzzy.Most significant difference between morpheme and word is that morpheme is expressed the meaning, neutral, can use a variety of different fonts
It has been shown that, so its code can be referred to as " neutral code ";Word table shape, shape similar word can have multiple meanings.For a long time, from ancient times to
Today, people are always with " word " for structure lexeme, all information systems, including search engine, are entirely letter with " word "
Cease the basic unit of processing.
The creation of breakthrough invention of the embodiment of the present invention is to give up this unbreakable conventional method, with " morpheme " for word-building
Unit, this come the retrieval of information, analysis, statistics, big data processing, artificial intelligence application ... will obtain greatly
Improve.It is that other family of languageies (including English, French) can not be accomplished using morpheme as the information processing of nuclear structure, such as 1 institute of table
Show.
Using morpheme as the core of Chinese further so that the conversion between simplified and traditional word is not necessary to by contextual analysis
(context analysis) and carry out retrieval process by the instruction (morpheme table registration simplified and traditional font font) of morpheme table, without
Identify that it is simplified or the complex form of Chinese characters, it is 100% that retrieval rate can reach substantially.
Table 1:
Morpheme, particularly single syllable morpheme are the units for forming word or phrase, it should be able to be word and phrase very accurately
It watch sound and expresses the meaning.It is that morpheme classifies morpheme being included into eight major class from group word angle: 1. Chinese language class morpheme 2. surname class morpheme 3. people
Name class morpheme 4. place name class morpheme 5. science and technology morpheme 6. archaic Chinese morpheme 7. nonsense watch sound morpheme 8. table shape morpheme.Two classes exist afterwards
Do not recognize in the prior art rather than true morpheme, but in the embodiment of the present invention, for the accurate retrieval of information and big data analysis
Need also to encode for them, referred to as " false morpheme " (French=assimil é is equivalent to a kind of morpheme and goes to handle).
As an embodiment, word (especially alien word) watch sound for much forming word or phrase, does not express the meaning,
Such as: " horse " and " reaching " word in " motor " this word;" snow ", " iron " and " dragon " word in " Citreen " this word.These are used for
Translate external product, trade mark, name and place name Chinese character be only be used to watch sound, horse, reach, avenge, iron, these words of dragon ... and its
Literal sense has no bearing on.
Poem database in the embodiment of the present invention utilizes " nonsense watch sound morpheme " table to collect whole watch sound Chinese characters, is every
One watch sound word coding, dramatically improves the accuracy of information retrieval and analysis.
The classic poetry data include but is not limited to " Tang poetry ", " such poems of the Song Dynasty ", " Book of Songs ", " Records of the Historian ", " origin of Chinese character ", " three
Word warp ", " 42-volume Chinese dictionary compiled during the regin of Kang Xi in the Qing Dynasty " etc..
Step S200 establishes the field of poem database, and production Methods type poem database.
As an embodiment, described to establish poem database, it is to utilize the including but not limited to inscriptions on bones or tortoise shells
(Oracle), SQL (Structured Query Language), the database that the relational databases such as Sybase, ACCESS are established
File.
In the embodiment of the present invention, establish poem database, particularly classic poetry database, the poem database include but
It is not limited to poem original text field, poem translation field etc..
In the database, the poem database with poem original text field and/or poem translation field is established, so that people
, particularly the foreigner during learning poem, the original text of poem can be retrieved and translate and understand the full text of poem
Or part;
More preferably, the poem database can also include original text Chinese pronunciation field, foreign language pronunciation field, in this way,
After retrieving corresponding poem, the China and foreign countries' pronunciation for even reciting poem can be learnt;
More preferably, the poem database can also include row field, foreign language row field;Author field, author translate word
Section, in this way, poem fan and foreign researcher is made to have the interest further learnt to Chinese Poetry.
The poem data being collected into are added in each field of relationship type poem database by step S300, and
The data link tree between poem data inside and between poem data is established, the relationship with poem data is generated
Type morpheme database.
The data link tree inside poem data is established by the relationship between poem data.
As shown in Fig. 2, the step S300 includes the following steps:
Step S310, the poem data that will be collected into, especially classic poetry data are added to the relationship type poem data
In each field in library;
In the embodiment of the present invention, with morpheme come the poem data being collected into for word and phrase with word-building unit, especially Gu
Poem data increase in poem database as the data one by one in poem database, and foundation can be retrieved according to morpheme and be closed
It is type data.
Step S320 is established using morpheme as root, establishes individual character morpheme and word morpheme is skill, poem data are the data of leaf
Link tree;
Monosyllable in Chinese character is no more than 1400, and morpheme is big more than this number, because a syllable will represent
Perhaps multiple and different meanings.Such as xin this syllable, so that it may indicate " pungent (arduous), new (new person), the heart (heart), zinc (zinc
Mine), firewood (salary), core (wick), fragrant (fragrance), glad (joyful) " etc. several morphemes.
As an embodiment, such as unit of syllable li, the root of morpheme is traversed, finds corresponding morpheme,
The individual character morpheme for the "Off" left such as is found, then input " 7 " to indicate that word morpheme is 7 words, then the word morpheme of 7 words
" leaving home a mere child and come back an old man " is then searched as the leaf on data link tree, to obtain the row poem.
The above-mentioned root that morpheme is traversed as unit of syllable, a kind of only traversal method of the embodiment of the present invention, and it is of the invention
Embodiment can also write input by electronics word, stroke input traverses the root of morpheme.
Step S330 establishes corresponding link between each poem data of data link tree.
Between associated each poem data, associated data link is established, for example, author is " li po "
Poem can establish the link, can illustrate li po writes out the poem how much being handed down in history in this way, so as to adjudicate substantially
The status of poet;Associated data link etc. can also be established with the time of poem, such as Tang Dynasty.
After finding the word morpheme of the row poem, it can find full text, translation, the work of poem by relational database
The content of person, history etc. and other links.
Step S400 carries out retrieval poem number using the poem database using morpheme as the word-building unit of word and phrase
According to obtaining the knowledge of the various aspects such as original text full text, translation, pronunciation, author, history according to poem data link tree.
As an embodiment, the poem database is retrieved by retrieving morpheme, the retrieval morpheme point
For individual character morpheme and word morpheme.
Word morpheme refers to word there are two at least tools, including one or more individual character morpheme, while the individual character morpheme
It combines and constitutes significant fixation meaning unit.
The existing input method of Chinese character of individual character morpheme, such as hand-writing input method, spelling input method, phonitic entry method, five it is defeated
Entering method input can be obtained by;
The word morpheme, the Chinese idiom including but not limited in Modern Chinese, the verse in various ancient poetries, various special names
Word, famous name etc.;
Such as: " People's Republic of China (PRC) ", " execution ", " leaving home a mere child and come back an old man ", " penniless ", " li po " etc.,
It is all word morpheme.
For example, when retrieval " leaves home a mere child and come back an old man " this poem, after can first inputting the individual character morpheme "Off", then
Word number " 7 " are inputted again, indicate that its word morpheme is poem with seven characters to a line, to quickly and easily retrieve the classic poetry.
Further include following steps as a kind of Chinese intelligent search method based on morpheme of more preferably embodiment:
Step S500 does not such as retrieve required poem, then directly returns;Or return step S300, it will retrieve
Other poems be added in relationship type poem database as new poem data, and carry out relational data link tree,
Then it returns and exits.
As a kind of more preferably embodiment, if not return step S300
Of the invention Chinese intelligent processing method and system and device based on morpheme, have easily and fast, it is accurately right
The processing function of Chinese poem.Further, so that people, the especially foreigner can accurately translate Chinese poem, pass through
Retrieval obtains the part of poem, and perhaps link obtains translation and pronunciation, author of part or full text etc. after full text, comprehensively,
Correctly understand, read and write and even recite Chinese poem.Further, intelligently Chinese poem can be handled, is had latent
Power helps to be promoted ancient poetry in Chinese external usage amount at home, contributes for carry forward Chinese culture.
Correspondingly, as shown in figure 3, the embodiment of the present invention also provides a kind of Chinese intelligent processing system based on morpheme, packet
Include the system software module of the above-mentioned Chinese intelligent processing method based on morpheme.
As an embodiment, the Chinese intelligent processing system based on morpheme, including collection module 10, word
Duan Jianli module 20, relational links module 30, retrieval module 40, in which:
The collection module 10, for collecting poem data, especially classic poetry data using morpheme as word-building unit;
The field establishes module 20, and for establishing the field of poem database, and production Methods type poem database is each
Field;
The relational links module 30, for being added to the relationship type poem number for the poem data being collected into
According in each field in library, and establish the data link tree between poem data inside and between poem data;
The retrieval module 40, for being carried out using the poem database using morpheme as the word-building unit of word and phrase
Poem data are retrieved, knowing for the various aspects such as original text full text, translation, pronunciation, author, history is obtained according to poem data link tree
Know.
It further include data addition as a kind of more preferably embodiment, the Chinese intelligent processing system based on morpheme
Module 50 then returns to the relational links module 40 for not retrieving required poem such as, generates the head poem as new
Poem data, be added in relationship type poem database, and establish between poem data inside and poem data it
Between data link tree, then return exit.
As a kind of more more preferably embodiment, the relational links module 40, including addition submodule 41, tree establish son
Module 42 and link submodule 43, in which:
The addition submodule 41, the poem data for will be collected into, especially classic poetry data are added to the pass
It is in each field of type poem database;
The tree setting up submodule 42 establishes individual character morpheme and word morpheme is skill, poem for establishing using morpheme as root
Data are the data link tree of leaf;
The link submodule 43, between each poem data of data link tree, establishing corresponding link.
Chinese intelligent processing system based on the embodiment of the present invention, as shown in figure 3, the embodiment of the present invention also provides a kind of base
In the storage medium of the computer software programs of the Chinese intelligent processing system of morpheme, independence is in plate, mobile phone, desktop computer
Etc. storing on various hardware, it is electrically connected to CPU (Central Processing Unit, central processing unit) and from the storage
The software for calculation program of the Chinese intelligent processing system is called to execute on medium.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can be executed with hardware, processor
The combination of software module or the two is implemented.Storage medium can be random access memory (RAM), memory, read-only memory
(ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technical field
In any other form of memory well known to interior.
The Chinese intelligent processing system and device based on morpheme of the embodiment of the present invention, the course of work with based on morpheme
Chinese intelligent processing method is essentially identical, and obtains essentially identical beneficial effect, therefore, in embodiments of the present invention, no longer
It is described in detail one by one.
Those of ordinary skill in the art should further appreciate that, describe in conjunction with the embodiments described herein
Each exemplary unit and algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clear
Illustrate to Chu the interchangeability of hardware and software, generally describes each exemplary group according to function in the above description
At and step.These functions are implemented in hardware or software actually, the specific application and design depending on technical solution
Constraint condition.Those of ordinary skill in the art can realize described function using distinct methods to each specific application
Can, but such implementation should not be considered as beyond the scope of the present invention.
Above-described specific embodiment, to the purpose of the present invention, technical scheme and beneficial effects into track into one
Step is described in detail, it should be understood that being not used to limit this hair the foregoing is merely a specific embodiment of the invention
Bright protection scope, all within the spirits and principles of the present invention, any modification, equivalent substitution, improvement and etc. done should all wrap
Containing within protection scope of the present invention.
Claims (18)
1. a kind of Chinese intelligent processing method based on morpheme, which comprises the following steps:
Using morpheme as word-building unit, poem data are collected;
Establish the field of poem database, and production Methods type poem Database field;
It is added to the poem data being collected into each field of relationship type poem database, and establishes the poem
Data link tree between data inside and between poem data generates the relationship type morpheme database with poem data.
2. Chinese intelligent processing method according to claim 1, which is characterized in that further include following steps:
Using morpheme as the word-building unit of word and phrase, retrieval poem data are carried out using the poem database.
3. Chinese intelligent processing method according to claim 2, which is characterized in that further include following steps:
One or more of original text full text, translation, pronunciation, author, history are obtained according to the poem data link tree
Combination.
4. Chinese intelligent processing method according to claim 2 or 3, which is characterized in that further include following steps:
If do not retrieved required poem, then directly return;Or other poems that will be retrieved are returned, as new poem
Data are added in relationship type poem database, and carry out relational data link tree, then return and exit.
5. Chinese intelligent processing method according to claim 1, which is characterized in that the addition data simultaneously carry out data
Relational links include the following steps:
The poem data that will be collected into are added in each field of the relationship type poem database;
It establishes using morpheme as root, establishes individual character morpheme and word morpheme is skill, poem data are the data link tree of leaf;
Between each poem data of data link tree, corresponding link is established.
6. Chinese intelligent processing method according to claim 5, which is characterized in that the morpheme is minimum linguistic unit,
Smaller than word, the same word corresponds to multiple morphemes;
Most significant difference between morpheme and word is that morpheme is expressed the meaning, neutral, is shown with a variety of different fonts, so its code is referred to as
For " neutral code ".
7. Chinese intelligent processing method according to claim 6, which is characterized in that 1. the morpheme is divided into from a group word angle
Chinese language class morpheme 2. surname class morpheme 3. name class morpheme 4. place name class morpheme 5. science and technology morpheme 6. archaic Chinese morpheme 7. nonsense
Watch sound morpheme 8. table shape morpheme.
8. Chinese intelligent processing method according to claim 6, which is characterized in that the poem data are classic poetry number
According to;
The classic poetry data are " Tang poetry ", " such poems of the Song Dynasty ", " Book of Songs ", " Records of the Historian ", " origin of Chinese character ", " Three Character Primer ", " Kangxu's word
Allusion quotation " in one or more than one kinds of combination.
9. Chinese intelligent processing method according to claim 6, which is characterized in that the poem database includes poem original
Text section and poem translate field.
10. Chinese intelligent processing method according to claim 9, which is characterized in that the poem database further includes original
Literary Chinese pronunciation field, foreign language pronunciation field, row field, foreign language row field;Author field, author translate one of field or
More than one combination of person.
11. Chinese intelligent processing method according to claim 10, which is characterized in that the poem database passes through retrieval
Morpheme is retrieved;
The retrieval morpheme is divided into individual character morpheme and word morpheme;
The word morpheme refers to word there are two at least tools, including one or more individual character morpheme, while the individual character morpheme
It combines and constitutes significant fixation meaning unit.
12. a kind of Chinese intelligent processing system based on morpheme, which is characterized in that including described in any one of claim 1 to 11
The Chinese intelligent processing method based on morpheme computer system software module.
13. Chinese intelligent processing system according to claim 12, which is characterized in that including collection module, field is established
Module, relational links module, in which:
The collection module, for collecting poem data, especially classic poetry data using morpheme as word-building unit;
The field establishes module, for establishing the field of poem database, and each field of production Methods type poem database;
The relational links module, for it is each to be added to the relationship type poem database by the poem data being collected into
In field, and establish the data link tree between poem data inside and between poem data.
14. Chinese intelligent processing system according to claim 13, which is characterized in that further include retrieval module, for
Morpheme is the word-building unit of word and phrase, retrieval poem data is carried out using the poem database, according to poem data link
Tree obtains one of original text full text, translation, pronunciation, author, history or more than one combination.
15. Chinese intelligent processing system described in 3 or 14 according to claim 1, which is characterized in that further include data addition mould
Block is then directly returned for not retrieving required poem such as;Or other poems that will be retrieved, as new poem
Data are added in relationship type poem database, and carry out relational data link tree, then return and exit.
16. Chinese intelligent processing system described in 3 or 14 according to claim 1, which is characterized in that the relational links module,
Including adding submodule, tree setting up submodule and link submodule, in which:
The addition submodule, the poem data for will be collected into, especially classic poetry data are added to the relationship type poem
In each field of word database;
The tree setting up submodule, for establishing using morpheme as root, establishing individual character morpheme and word morpheme is skill, and poem data are
The data link tree of leaf;
The link submodule, between each poem data of data link tree, establishing corresponding link.
17. a kind of storage medium, which is characterized in that including the described in any item Chinese intelligence based on morpheme of claim 12 to 16
The storage medium of the computer software programs of energy processing system.
18. a kind of hardware, including CPU, which is characterized in that further include being electrically connected to storage medium described in claim 17;
The CPU calls the computer software programs of the Chinese intelligent processing system to execute from the storage medium.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710446493 | 2017-06-14 | ||
CN2017104464935 | 2017-06-14 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109086285A true CN109086285A (en) | 2018-12-25 |
CN109086285B CN109086285B (en) | 2021-10-15 |
Family
ID=64839127
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710857227.1A Active CN109086285B (en) | 2017-06-14 | 2017-09-21 | Intelligent Chinese processing method, system and device based on morphemes |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109086285B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109871442A (en) * | 2019-01-18 | 2019-06-11 | 程家惠 | A kind of Sino-British rendering method, device, equipment and the medium of Chinese character calligraphy text |
CN109948157A (en) * | 2019-03-13 | 2019-06-28 | 日照职业技术学院 | A kind of poem is collected and data analysing method |
CN112434137A (en) * | 2020-12-11 | 2021-03-02 | 乐山师范学院 | Poetry retrieval method and system based on artificial intelligence |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1777888A (en) * | 2003-04-24 | 2006-05-24 | 禹蕣朝 | Method for sentence structure analysis based on mobile configuration concept and method for natural language search using of it |
US7092870B1 (en) * | 2000-09-15 | 2006-08-15 | International Business Machines Corporation | System and method for managing a textual archive using semantic units |
US7225181B2 (en) * | 2000-02-04 | 2007-05-29 | Fujitsu Limited | Document searching apparatus, method thereof, and record medium thereof |
US20070213974A1 (en) * | 2006-03-09 | 2007-09-13 | Fujitsu Limited | Syntax analysis program, syntax analysis method, syntax analysis device, and computer-readable medium storing syntax analysis program |
CN101059915A (en) * | 2006-10-18 | 2007-10-24 | 杨红春 | A system for foreigner learning the common Chinese |
CN101075252A (en) * | 2007-06-21 | 2007-11-21 | 腾讯科技(深圳)有限公司 | Method and system for searching network |
CN101599078A (en) * | 2009-07-10 | 2009-12-09 | 腾讯科技(深圳)有限公司 | A kind of method of text retrieval and device |
CN102184170A (en) * | 2011-06-17 | 2011-09-14 | 成都成电医星数字健康软件有限公司 | Morpheme-level analyzing method for clinical Chinese language |
CN102375838A (en) * | 2010-08-17 | 2012-03-14 | 富士通株式会社 | Method and device for constructing polarity morpheme database, and method and device for determining polarity of words |
CN102567423A (en) * | 2010-12-31 | 2012-07-11 | 成都致远诺亚舟教育科技有限公司 | Method and system for associated search of poetry |
CN103605665A (en) * | 2013-10-24 | 2014-02-26 | 杭州电子科技大学 | Keyword based evaluation expert intelligent search and recommendation method |
CN105574067A (en) * | 2014-10-31 | 2016-05-11 | 株式会社东芝 | Item recommendation device and item recommendation method |
CN106372039A (en) * | 2016-08-18 | 2017-02-01 | 王欣 | Standard Chinese information ASCII system codes |
-
2017
- 2017-09-21 CN CN201710857227.1A patent/CN109086285B/en active Active
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7225181B2 (en) * | 2000-02-04 | 2007-05-29 | Fujitsu Limited | Document searching apparatus, method thereof, and record medium thereof |
US7092870B1 (en) * | 2000-09-15 | 2006-08-15 | International Business Machines Corporation | System and method for managing a textual archive using semantic units |
CN1777888A (en) * | 2003-04-24 | 2006-05-24 | 禹蕣朝 | Method for sentence structure analysis based on mobile configuration concept and method for natural language search using of it |
US20070213974A1 (en) * | 2006-03-09 | 2007-09-13 | Fujitsu Limited | Syntax analysis program, syntax analysis method, syntax analysis device, and computer-readable medium storing syntax analysis program |
CN101059915A (en) * | 2006-10-18 | 2007-10-24 | 杨红春 | A system for foreigner learning the common Chinese |
CN101075252A (en) * | 2007-06-21 | 2007-11-21 | 腾讯科技(深圳)有限公司 | Method and system for searching network |
CN101599078A (en) * | 2009-07-10 | 2009-12-09 | 腾讯科技(深圳)有限公司 | A kind of method of text retrieval and device |
CN102375838A (en) * | 2010-08-17 | 2012-03-14 | 富士通株式会社 | Method and device for constructing polarity morpheme database, and method and device for determining polarity of words |
CN102567423A (en) * | 2010-12-31 | 2012-07-11 | 成都致远诺亚舟教育科技有限公司 | Method and system for associated search of poetry |
CN102184170A (en) * | 2011-06-17 | 2011-09-14 | 成都成电医星数字健康软件有限公司 | Morpheme-level analyzing method for clinical Chinese language |
CN103605665A (en) * | 2013-10-24 | 2014-02-26 | 杭州电子科技大学 | Keyword based evaluation expert intelligent search and recommendation method |
CN105574067A (en) * | 2014-10-31 | 2016-05-11 | 株式会社东芝 | Item recommendation device and item recommendation method |
CN106372039A (en) * | 2016-08-18 | 2017-02-01 | 王欣 | Standard Chinese information ASCII system codes |
Non-Patent Citations (3)
Title |
---|
Y. ZIEMAN 等: "Semantic labeling - unveiling the main components of meaning of free-text", 《 PROCEEDINGS EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL》 * |
于华: "对外汉语智能教学系统分析与设计研究", 《中国优秀硕士学位论文全文数据库 哲学与人文科学辑》 * |
邢红兵: "基于《汉语水平词汇等级大纲》的语素数据库建设", 《数字化对外汉语教学理论与方法研究》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109871442A (en) * | 2019-01-18 | 2019-06-11 | 程家惠 | A kind of Sino-British rendering method, device, equipment and the medium of Chinese character calligraphy text |
CN109871442B (en) * | 2019-01-18 | 2024-08-09 | 程家惠 | Chinese and English presentation method, device, equipment and medium for Chinese character handwriting characters |
CN109948157A (en) * | 2019-03-13 | 2019-06-28 | 日照职业技术学院 | A kind of poem is collected and data analysing method |
CN112434137A (en) * | 2020-12-11 | 2021-03-02 | 乐山师范学院 | Poetry retrieval method and system based on artificial intelligence |
CN112434137B (en) * | 2020-12-11 | 2023-04-11 | 乐山师范学院 | Poetry retrieval method and system based on artificial intelligence |
Also Published As
Publication number | Publication date |
---|---|
CN109086285B (en) | 2021-10-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102799577B (en) | A kind of Chinese inter-entity semantic relation extraction method | |
CN110502621A (en) | Answering method, question and answer system, computer equipment and storage medium | |
CN114036930A (en) | Text error correction method, device, equipment and computer readable medium | |
CN106066866A (en) | A kind of automatic abstracting method of english literature key phrase and system | |
CN108984661A (en) | Entity alignment schemes and device in a kind of knowledge mapping | |
WO2017193472A1 (en) | Method of establishing digital dongba ancient text interpretive library | |
CN110457690A (en) | A Method for Judging the Inventiveness of a Patent | |
CN107656921A (en) | A kind of short text dependency analysis method based on deep learning | |
CN107590119B (en) | Method and device for extracting person attribute information | |
CN109086285A (en) | Chinese intelligent processing method and system and device based on morpheme | |
Hämäläinen et al. | Finding Sami cognates with a character-based NMT approach | |
Dundes | On computers and folk tales | |
Yona et al. | A finite-state morphological grammar of Hebrew | |
CN117972025B (en) | Massive text retrieval matching method based on semantic analysis | |
CN109344390A (en) | A method of Cambodian entity recognition based on multi-feature neural network | |
Falahati Qadimi Fumani et al. | Inconsistent transliteration of Iranian university names: a hazard to Iran’s ranking in ISI Web of Science | |
Kilic et al. | Named entity recognition on morphologically rich language: Exploring the performance of BERT with varying training levels | |
CN112667819A (en) | Entity description reasoning knowledge base construction and reasoning evidence quantitative information acquisition method and device | |
Yona et al. | A finite-state morphological grammar of Hebrew | |
Bizzoni et al. | Some steps towards the generation of diachronic WordNets | |
CN107818078B (en) | Semantic association and matching method for Chinese natural language dialogue | |
Ali et al. | Word embedding based new corpus for low-resourced language: Sindhi | |
CN115408532A (en) | Open source information-oriented weapon equipment knowledge graph construction method, system, device and storage medium | |
Cheng et al. | The revised wordframe model for the Filipino language | |
Wang et al. | Chinese Named Entity Recognition Base on Multi-Metadata Embedding and Global Pointer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |