Normal view MARC view ISBD view

Dirty Data Processing for Machine Learning (Record no. 186804)

MARC details
000 -LEADER
fixed length control field	04185nam a22005415i 4500
001 - CONTROL NUMBER
control field	978-981-99-7657-7
003 - CONTROL NUMBER IDENTIFIER
control field	DE-He213
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20240423130251.0
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field	cr nn 008mamaa
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	231129s2024 si \| s \|\|\|\| 0\|eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number	9789819976577
--	978-981-99-7657-7
024 7# - OTHER STANDARD IDENTIFIER
Standard number or code	10.1007/978-981-99-7657-7
Source of number or code	doi
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number	Q336
072 #7 - SUBJECT CATEGORY CODE
Subject category code	UN
Source	bicssc
072 #7 - SUBJECT CATEGORY CODE
Subject category code	COM021000
Source	bisacsh
072 #7 - SUBJECT CATEGORY CODE
Subject category code	UN
Source	thema
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	005.7
Edition number	23
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name	Qi, Zhixin.
Relator term	author.
Relator code	aut
--	http://id.loc.gov/vocabulary/relators/aut
245 10 - TITLE STATEMENT
Title	Dirty Data Processing for Machine Learning
Medium	[electronic resource] /
Statement of responsibility, etc	by Zhixin Qi, Hongzhi Wang, Zejiao Dong.
250 ## - EDITION STATEMENT
Edition statement	1st ed. 2024.
264 #1 -
--	Singapore :
--	Springer Nature Singapore :
--	Imprint: Springer,
--	2024.
300 ## - PHYSICAL DESCRIPTION
Extent	XIII, 133 p. 1 illus.
Other physical details	online resource.
336 ## -
--	text
--	txt
--	rdacontent
337 ## -
--	computer
--	c
--	rdamedia
338 ## -
--	online resource
--	cr
--	rdacarrier
347 ## -
--	text file
--	PDF
--	rda
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note	Chapter 1. Introduction -- Chapter 2. Impacts of Dirty Data on Classification and Clustering Models -- Chapter 3. Dirty-Data Impacts on Regression Models -- Chapter 4. Incomplete Data Classification with View-Based Decision Tree -- Chapter 5. Density-Based Clustering for Incomplete Data -- Chapter 6. Feature Selection on Inconsistent Data -- Chapter 7. Cost-Sensitive Decision Tree Induction on Dirty Data.
520 ## - SUMMARY, ETC.
Summary, etc	In both the database and machine learning communities, data quality has become a serious issue which cannot be ignored. In this context, we refer to data with quality problems as “dirty data.” Clearly, for a given data mining or machine learning task, dirty data in both training and test datasets can affect the accuracy of results. Accordingly, this book analyzes the impacts of dirty data and explores effective methods for dirty data processing. Although existing data cleaning methods improve data quality dramatically, the cleaning costs are still high. If we knew how dirty data affected the accuracy of machine learning models, we could clean data selectively according to the accuracy requirements instead of cleaning all dirty data, which entails substantial costs. However, no book to date has studied the impacts of dirty data on machine learning models in terms of data quality. Filling precisely this gap, the book is intended for a broad audience ranging from researchers inthe database and machine learning communities to industry practitioners. Readers will find valuable takeaway suggestions on: model selection and data cleaning; incomplete data classification with view-based decision trees; density-based clustering for incomplete data; the feature selection method, which reduces the time costs and guarantees the accuracy of machine learning models; and cost-sensitive decision tree induction approaches under different scenarios. Further, the book opens many promising avenues for the further study of dirty data processing, such as data cleaning on demand, constructing a model to predict dirty-data impacts, and integrating data quality issues into other machine learning models. Readers will be introduced to state-of-the-art dirty data processing techniques, and the latest research advances, while also finding new inspirations in this field.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Artificial intelligence
General subdivision	Data processing.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Data mining.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Big data.
650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Data Science.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Data Mining and Knowledge Discovery.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Big Data.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name	Wang, Hongzhi.
Relator term	author.
Relator code	aut
--	http://id.loc.gov/vocabulary/relators/aut
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name	Dong, Zejiao.
Relator term	author.
Relator code	aut
--	http://id.loc.gov/vocabulary/relators/aut
710 2# - ADDED ENTRY--CORPORATE NAME
Corporate name or jurisdiction name as entry element	SpringerLink (Online service)
773 0# - HOST ITEM ENTRY
Title	Springer Nature eBook
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Display text	Printed edition:
International Standard Book Number	9789819976560
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Display text	Printed edition:
International Standard Book Number	9789819976584
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Display text	Printed edition:
International Standard Book Number	9789819976591
856 40 - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier	<a href="https://doi.org/10.1007/978-981-99-7657-7">https://doi.org/10.1007/978-981-99-7657-7</a>
912 ## -
--	ZDB-2-SCS
912 ## -
--	ZDB-2-SXCS
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type	eBooks-CSE-Springer

No items available.

Print
Add to your cart (remove)
Save record
BIBTEX Dublin Core MARCXML MARC (non-Unicode/MARC-8) MARC (Unicode/UTF-8) MARC (Unicode/UTF-8, Standard) MODS (XML) RIS
More searches

Search for this title in:
Other Libraries (WorldCat) Other Databases (Google Scholar) Online Stores (Bookfinder.com) Open Library (openlibrary.org)