CN109344355A - Automatic returning detection and Block- matching adaptive approach and device for Web evolution - Google Patents
Automatic returning detection and Block- matching adaptive approach and device for Web evolution Download PDFInfo
- Publication number
- CN109344355A CN109344355A CN201811124012.XA CN201811124012A CN109344355A CN 109344355 A CN109344355 A CN 109344355A CN 201811124012 A CN201811124012 A CN 201811124012A CN 109344355 A CN109344355 A CN 109344355A
- Authority
- CN
- China
- Prior art keywords
- webpage
- block
- matching
- web evolution
- new
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/70—Software maintenance or management
- G06F8/71—Version control; Configuration management
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Security & Cryptography (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811124012.XA CN109344355B (en) | 2018-09-26 | 2018-09-26 | Automatic regression detection and block matching self-adaption method and device for webpage change |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811124012.XA CN109344355B (en) | 2018-09-26 | 2018-09-26 | Automatic regression detection and block matching self-adaption method and device for webpage change |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109344355A true CN109344355A (en) | 2019-02-15 |
CN109344355B CN109344355B (en) | 2022-03-15 |
Family
ID=65306539
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811124012.XA Active CN109344355B (en) | 2018-09-26 | 2018-09-26 | Automatic regression detection and block matching self-adaption method and device for webpage change |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109344355B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110968761A (en) * | 2019-11-29 | 2020-04-07 | 福州大学 | A method for adaptive extraction of web page structured data |
CN111079043A (en) * | 2019-12-05 | 2020-04-28 | 北京数立得科技有限公司 | Key content positioning method |
CN111158973A (en) * | 2019-12-05 | 2020-05-15 | 北京大学 | Web application dynamic evolution monitoring method |
CN112115111A (en) * | 2019-06-20 | 2020-12-22 | 上海怀若智能科技有限公司 | OCR-based document version management method and system |
CN112887381A (en) * | 2021-01-15 | 2021-06-01 | 中国地质大学(武汉) | Method and device for detecting and converging new content facing specific network entrance |
CN113626028A (en) * | 2020-05-07 | 2021-11-09 | 腾讯科技(深圳)有限公司 | Page element mapping method and device |
Citations (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2001268674B2 (en) * | 2000-06-22 | 2007-04-26 | Microsoft Technology Licensing, Llc | Distributed computing services platform |
CN101026503A (en) * | 2006-02-24 | 2007-08-29 | 国际商业机器公司 | Unit detection method and apparatus in Web service business procedure |
CN101127044A (en) * | 2007-06-08 | 2008-02-20 | 北京大学 | Blocking Method for Dynamic Web Pages |
CN101141449A (en) * | 2007-10-22 | 2008-03-12 | 珠海金山软件股份有限公司 | Apparatus and method for implementing Web client terminal software self-adaptive running |
CN101174899A (en) * | 2007-11-26 | 2008-05-07 | 中兴通讯股份有限公司 | Automatic test method for service protection and recovery in ASON network |
CN101178708A (en) * | 2006-11-07 | 2008-05-14 | 北京酷讯科技有限公司 | Automatic moulding plate information locating method for structured web page |
CN101207639A (en) * | 2007-12-03 | 2008-06-25 | 华为技术有限公司 | Method and apparatus of classifying for user |
CN101251855A (en) * | 2008-03-27 | 2008-08-27 | 腾讯科技(深圳)有限公司 | Equipment, system and method for cleaning internet web page |
CN101266603A (en) * | 2007-03-12 | 2008-09-17 | 北京搜狗科技发展有限公司 | Webpage information sorting method, system and service system applying the classification |
US20080281834A1 (en) * | 2007-05-09 | 2008-11-13 | Microsoft Corporation | Block tracking mechanism for web personalization |
CN101408877A (en) * | 2007-10-10 | 2009-04-15 | 英业达股份有限公司 | Tree node loading system and method thereof |
CN101477571A (en) * | 2009-01-07 | 2009-07-08 | 华天清 | Method and apparatus for marking network contents semantic structure |
CN101546261A (en) * | 2008-10-10 | 2009-09-30 | 华中科技大学 | Secure web page tag library system supported by multiple strategies |
CN101593184A (en) * | 2008-05-29 | 2009-12-02 | 国际商业机器公司 | The system and method for self-adaptively locating dynamic web page elements |
CN101655862A (en) * | 2009-08-11 | 2010-02-24 | 华天清 | Method and device for searching information object |
US20100287132A1 (en) * | 2009-05-05 | 2010-11-11 | Paul A. Lipari | System, method and computer readable medium for recording authoring events with web page content |
CN102004805A (en) * | 2010-12-30 | 2011-04-06 | 上海交通大学 | Webpage denoising system and method based on maximum similarity matching |
CN102663023A (en) * | 2012-03-22 | 2012-09-12 | 浙江盘石信息技术有限公司 | Implementation method for extracting web content |
CN102662969A (en) * | 2012-03-11 | 2012-09-12 | 复旦大学 | Internet information object positioning method based on webpage structure semantic meaning |
CN102890681A (en) * | 2011-07-20 | 2013-01-23 | 阿里巴巴集团控股有限公司 | Method and system for generating webpage structure template |
CN102955854A (en) * | 2012-11-06 | 2013-03-06 | 北京中娱在线网络科技有限公司 | Webpage presenting method and device based on HTML5 (Hypertext Markup Language 5) protocol |
US20130332451A1 (en) * | 2012-06-06 | 2013-12-12 | Fliptop, Inc. | System and method for correlating personal identifiers with corresponding online presence |
US20140123186A1 (en) * | 2002-05-10 | 2014-05-01 | Convergent Media Solutions Llc | Method and apparatus for browsing using alternative linkbases |
CN108345687A (en) * | 2018-03-09 | 2018-07-31 | 沈文策 | A kind of 3D web page display method and apparatus |
-
2018
- 2018-09-26 CN CN201811124012.XA patent/CN109344355B/en active Active
Patent Citations (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2001268674B2 (en) * | 2000-06-22 | 2007-04-26 | Microsoft Technology Licensing, Llc | Distributed computing services platform |
US20140123186A1 (en) * | 2002-05-10 | 2014-05-01 | Convergent Media Solutions Llc | Method and apparatus for browsing using alternative linkbases |
CN101026503A (en) * | 2006-02-24 | 2007-08-29 | 国际商业机器公司 | Unit detection method and apparatus in Web service business procedure |
CN101178708A (en) * | 2006-11-07 | 2008-05-14 | 北京酷讯科技有限公司 | Automatic moulding plate information locating method for structured web page |
CN101266603A (en) * | 2007-03-12 | 2008-09-17 | 北京搜狗科技发展有限公司 | Webpage information sorting method, system and service system applying the classification |
US20080281834A1 (en) * | 2007-05-09 | 2008-11-13 | Microsoft Corporation | Block tracking mechanism for web personalization |
CN101127044A (en) * | 2007-06-08 | 2008-02-20 | 北京大学 | Blocking Method for Dynamic Web Pages |
CN101408877A (en) * | 2007-10-10 | 2009-04-15 | 英业达股份有限公司 | Tree node loading system and method thereof |
CN101141449A (en) * | 2007-10-22 | 2008-03-12 | 珠海金山软件股份有限公司 | Apparatus and method for implementing Web client terminal software self-adaptive running |
CN101174899A (en) * | 2007-11-26 | 2008-05-07 | 中兴通讯股份有限公司 | Automatic test method for service protection and recovery in ASON network |
CN101207639A (en) * | 2007-12-03 | 2008-06-25 | 华为技术有限公司 | Method and apparatus of classifying for user |
CN101251855A (en) * | 2008-03-27 | 2008-08-27 | 腾讯科技(深圳)有限公司 | Equipment, system and method for cleaning internet web page |
CN101593184A (en) * | 2008-05-29 | 2009-12-02 | 国际商业机器公司 | The system and method for self-adaptively locating dynamic web page elements |
CN101546261A (en) * | 2008-10-10 | 2009-09-30 | 华中科技大学 | Secure web page tag library system supported by multiple strategies |
CN101477571A (en) * | 2009-01-07 | 2009-07-08 | 华天清 | Method and apparatus for marking network contents semantic structure |
US20100287132A1 (en) * | 2009-05-05 | 2010-11-11 | Paul A. Lipari | System, method and computer readable medium for recording authoring events with web page content |
CN101655862A (en) * | 2009-08-11 | 2010-02-24 | 华天清 | Method and device for searching information object |
CN102004805A (en) * | 2010-12-30 | 2011-04-06 | 上海交通大学 | Webpage denoising system and method based on maximum similarity matching |
CN102890681A (en) * | 2011-07-20 | 2013-01-23 | 阿里巴巴集团控股有限公司 | Method and system for generating webpage structure template |
CN102662969A (en) * | 2012-03-11 | 2012-09-12 | 复旦大学 | Internet information object positioning method based on webpage structure semantic meaning |
CN102663023A (en) * | 2012-03-22 | 2012-09-12 | 浙江盘石信息技术有限公司 | Implementation method for extracting web content |
US20130332451A1 (en) * | 2012-06-06 | 2013-12-12 | Fliptop, Inc. | System and method for correlating personal identifiers with corresponding online presence |
CN102955854A (en) * | 2012-11-06 | 2013-03-06 | 北京中娱在线网络科技有限公司 | Webpage presenting method and device based on HTML5 (Hypertext Markup Language 5) protocol |
CN108345687A (en) * | 2018-03-09 | 2018-07-31 | 沈文策 | A kind of 3D web page display method and apparatus |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112115111A (en) * | 2019-06-20 | 2020-12-22 | 上海怀若智能科技有限公司 | OCR-based document version management method and system |
CN110968761A (en) * | 2019-11-29 | 2020-04-07 | 福州大学 | A method for adaptive extraction of web page structured data |
WO2021103557A1 (en) * | 2019-11-29 | 2021-06-03 | 福州大学 | Adaptive extraction method for webpage structured data |
CN110968761B (en) * | 2019-11-29 | 2022-07-08 | 福州大学 | A method for adaptive extraction of web page structured data |
CN111079043A (en) * | 2019-12-05 | 2020-04-28 | 北京数立得科技有限公司 | Key content positioning method |
CN111158973A (en) * | 2019-12-05 | 2020-05-15 | 北京大学 | Web application dynamic evolution monitoring method |
CN111158973B (en) * | 2019-12-05 | 2021-06-18 | 北京大学 | A method for monitoring the dynamic evolution of web applications |
CN111079043B (en) * | 2019-12-05 | 2023-05-12 | 北京数立得科技有限公司 | Key content positioning method |
CN113626028A (en) * | 2020-05-07 | 2021-11-09 | 腾讯科技(深圳)有限公司 | Page element mapping method and device |
CN112887381A (en) * | 2021-01-15 | 2021-06-01 | 中国地质大学(武汉) | Method and device for detecting and converging new content facing specific network entrance |
CN112887381B (en) * | 2021-01-15 | 2022-07-19 | 中国地质大学(武汉) | Method and apparatus for new content detection and aggregation for specific network portals |
Also Published As
Publication number | Publication date |
---|---|
CN109344355B (en) | 2022-03-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109344355A (en) | Automatic returning detection and Block- matching adaptive approach and device for Web evolution | |
US11256856B2 (en) | Method, device, and system, for identifying data elements in data structures | |
US5794257A (en) | Automatic hyperlinking on multimedia by compiling link specifications | |
US8799772B2 (en) | System and method for gathering, indexing, and supplying publicly available data charts | |
US7941420B2 (en) | Method for organizing structurally similar web pages from a web site | |
Kovbasistyi et al. | Method for detection of non-relevant and wrong information based on content analysis of web resources | |
US8359294B2 (en) | Incorrect hyperlink detecting apparatus and method | |
US8676814B2 (en) | Automatic face annotation of images contained in media content | |
WO2008021561A2 (en) | Joint optimization of wrapper generation and template detection | |
US20090125529A1 (en) | Extracting information based on document structure and characteristics of attributes | |
CN102662969B (en) | A Method for Locating Internet Information Objects Based on Webpage Structural Semantics | |
Papadakis et al. | Stavies: A system for information extraction from unknown web data sources through automatic web wrapper generation using clustering techniques | |
CN109325201A (en) | Generation method, device, equipment and the storage medium of entity relationship data | |
CN106960058B (en) | Webpage structure change detection method and system | |
CN111079043A (en) | Key content positioning method | |
US20060200457A1 (en) | Extracting information from formatted sources | |
CN103853738A (en) | Identification method for webpage information related region | |
US20100185684A1 (en) | High precision multi entity extraction | |
JP2019032704A (en) | Table data structuring system and table data structuring method | |
CN111158973B (en) | A method for monitoring the dynamic evolution of web applications | |
US20080015843A1 (en) | Linguistic Image Label Incorporating Decision Relevant Perceptual, Semantic, and Relationships Data | |
CN105279249B (en) | A method and device for determining the confidence of point of interest data in a website | |
CN105160032B (en) | The determination method and device of the confidence level of interest point data in a kind of website | |
CN115186240A (en) | Social network user alignment method, device and medium based on relevance information | |
CN114238735A (en) | Intelligent internet data acquisition method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: No. 826, building 12345, Phoenix legend, Hanbang, Jingyue Development Zone, Changchun City, Jilin Province Patentee after: Intel Technology Co.,Ltd. Address before: No. 826, building 12345, Phoenix legend, Hanbang, Jingyue Development Zone, Changchun City, Jilin Province Patentee before: Changchun interui Software Co.,Ltd. |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: No. 826, building 12345, Phoenix legend, Hanbang, Jingyue Development Zone, Changchun City, Jilin Province Patentee after: Changchun interui Software Co.,Ltd. Address before: Room 1626, No. 65, North Fourth Ring West Road, Haidian District, Beijing 100080 Patentee before: BEIJING INTERNETWARE Ltd. |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: 130117, 30th floor, Building A2, Mingyu Plaza, No. 3777 Ecological Street, Jingyue High tech Industrial Development Zone, Changchun City, Jilin Province Patentee after: Shenqi Digital Co.,Ltd. Country or region after: China Address before: No. 826, building 12345, Phoenix legend, Hanbang, Jingyue Development Zone, Changchun City, Jilin Province Patentee before: Intel Technology Co.,Ltd. Country or region before: China |