Knowledge Discovery from Multi-Sourced Data

· ·
· Springer Nature
eBook
83
페이지
검증되지 않은 평점과 리뷰입니다.  자세히 알아보기

eBook 정보

This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to “label” or tell which data source is more reliable.Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

저자 정보

Chen Ye is currently an Associate Researcher at the School of Computer Science and Technology, Hangzhou Dianzi University, China. She received the Ph.D. degree in Computer Software and Theory from Harbin Institute of Technology, China. Her current research interests include data repairing, truth discovery, and crowdsourcing. She has won the ACM SIGMOD China Doctoral Dissertation Award in 2020.

Hongzhi Wang is a Professor and Doctoral Supervisor at the School of Computer Science and Technology, Harbin Institute of Technology, China. His research interests include big data management and analysis, data quality, graph data management, and web data management. He has published more than 150 papers, and he is the Primary Investigator of more than 10 projects including three NSFC projects, and co-PI of 973, 863, and NSFC key projects. He was awarded as Microsoft fellowship, China Excellent Database Engineer, and IBM Ph.D. fellowship.

Guojun Dai is now working in the School of Computer Science and Technology of Hangzhou Dianzi University, as the Head of the National Brain-Computer Collaborative Intelligent Technology International Joint Research Center, the director of the Institute of Computer Application Technology. His research interests include Internet of Things, industrial big data, network collaborative manufacturing, edge computing, brain-computer interface, cognitive computing, artificial intelligence. He has published over 50 research papers in top-quality international conferences and journals, particularly, INFOCOM, IEEE Transactions on Industrial Informatics, and IEEE Transactions on Mobile Computing.

이 eBook 평가

의견을 알려주세요.

읽기 정보

스마트폰 및 태블릿
AndroidiPad/iPhoneGoogle Play 북 앱을 설치하세요. 계정과 자동으로 동기화되어 어디서나 온라인 또는 오프라인으로 책을 읽을 수 있습니다.
노트북 및 컴퓨터
컴퓨터의 웹브라우저를 사용하여 Google Play에서 구매한 오디오북을 들을 수 있습니다.
eReader 및 기타 기기
Kobo eReader 등의 eBook 리더기에서 읽으려면 파일을 다운로드하여 기기로 전송해야 합니다. 지원되는 eBook 리더기로 파일을 전송하려면 고객센터에서 자세한 안내를 따르세요.

Chen Ye 작가의 책 더보기

비슷한 eBook