Welcome![Sign In][Sign Up]
Location:
Downloads Documents Software Engineering
Title: design-documents Download
 Description: Web information collection subsystem: the pages collection: dynamic lookup and real-time analysis of the new website, read pages Replies. Web filtering: to clear advertising pages, navigation information, images, copyright noise data access pages via web cleaning module, extract the relevant page title, text, link address, acquisition time, posts, posting number and other data. Web information preprocessing subsystem: Web review dirty: The main features include the Chinese word segmentation, POS tagging, named entity recognition, new word identification, to establish a database description of each type of thesaurus, and the establishment of a sensitive word thesaurus. Page opinion monitoring: monitoring whether there is a certain influence public opinion, that a reply to the post or support other top reaches a preset threshold public opinion analysis subsystems: summary website public opinion, public opinion monitoring information: summary of the entire site, automatically generate re
 Downloaders recently: [More information of uploader 黄生]
 To Search:
File list (Check if you may need any files):
 

design documents.pdf
    

CodeBus www.codebus.net