Robust Representation for Data Analytics

Robust Representation for Data Analytics PDF

Author: Sheng Li

Publisher: Springer

Published: 2017-08-09

Total Pages: 229

ISBN-13: 3319601768

DOWNLOAD EBOOK →

This book introduces the concepts and models of robust representation learning, and provides a set of solutions to deal with real-world data analytics tasks, such as clustering, classification, time series modeling, outlier detection, collaborative filtering, community detection, etc. Three types of robust feature representations are developed, which extend the understanding of graph, subspace, and dictionary. Leveraging the theory of low-rank and sparse modeling, the authors develop robust feature representations under various learning paradigms, including unsupervised learning, supervised learning, semi-supervised learning, multi-view learning, transfer learning, and deep learning. Robust Representations for Data Analytics covers a wide range of applications in the research fields of big data, human-centered computing, pattern recognition, digital marketing, web mining, and computer vision.

Understanding Robust and Exploratory Data Analysis

Understanding Robust and Exploratory Data Analysis PDF

Author: David C. Hoaglin

Publisher: John Wiley & Sons

Published: 2000-06-02

Total Pages: 484

ISBN-13: 0471384917

DOWNLOAD EBOOK →

Originally published in hardcover in 1982, this book is now offered in a Wiley Classics Library edition. A contributed volume, edited by some of the preeminent statisticians of the 20th century, Understanding of Robust and Exploratory Data Analysis explains why and how to use exploratory data analysis and robust and resistant methods in statistical practice.

Learning Representation for Multi-View Data Analysis

Learning Representation for Multi-View Data Analysis PDF

Author: Zhengming Ding

Publisher: Springer

Published: 2018-12-06

Total Pages: 268

ISBN-13: 3030007340

DOWNLOAD EBOOK →

This book equips readers to handle complex multi-view data representation, centered around several major visual applications, sharing many tips and insights through a unified learning framework. This framework is able to model most existing multi-view learning and domain adaptation, enriching readers’ understanding from their similarity, and differences based on data organization and problem settings, as well as the research goal. A comprehensive review exhaustively provides the key recent research on multi-view data analysis, i.e., multi-view clustering, multi-view classification, zero-shot learning, and domain adaption. More practical challenges in multi-view data analysis are discussed including incomplete, unbalanced and large-scale multi-view learning. Learning Representation for Multi-View Data Analysis covers a wide range of applications in the research fields of big data, human-centered computing, pattern recognition, digital marketing, web mining, and computer vision.

Robustness in Data Analysis

Robustness in Data Analysis PDF

Author: Georgy L. Shevlyakov

Publisher: Walter de Gruyter

Published: 2011-12-07

Total Pages: 325

ISBN-13: 3110936003

DOWNLOAD EBOOK →

The series is devoted to the publication of high-level monographs and surveys which cover the whole spectrum of probability and statistics. The books of the series are addressed to both experts and advanced students.

Robust and Scalable Data Representation and Analysis Leveraging Isometric Transformations and Sparsity

Robust and Scalable Data Representation and Analysis Leveraging Isometric Transformations and Sparsity PDF

Author: Alireza Zaeemzadeh

Publisher:

Published: 2021

Total Pages: 229

ISBN-13:

DOWNLOAD EBOOK →

The main focus of this doctoral thesis is to study the problem of robust and scalable data representation and analysis. The success of any machine learning and signal processing framework relies on how the data is represented and analyzed. Thus, in this work, we focus on three closely related problems: (i) supervised representation learning, (ii) unsupervised representation learning, and (iii) fault tolerant data analysis. For the first task, we put forward new theoretical results on why a certain family of neural networks can become extremely deep and how we can improve this scalability property in a mathematically sound manner. We further investigate how we can employ them to generate data representations that are robust to outliers and to retrieve representative subsets of huge datasets. For the second task, we will discuss two different methods, namely compressive sensing (CS) and nonnegative matrix factorization (NMF). We show that we can employ prior knowledge, such as slow variation in time, to introduce an unsupervised learning component to the traditional CS framework and to learn better compressed representations. Furthermore, we show that prior knowledge and sparsity constraint can be used in the context of NMF, not to find sparse hidden factors, but to enforce other structures, such as piece-wise continuity. Finally, for the third task, we investigate how a data analysis framework can become robust to faulty data and faulty data processors. We employ Bayesian inference and propose a scheme that can solve the CS recovery problem in an asynchronous parallel manner. Furthermore, we show how sparsity can be used to make an optimization problem robust to faulty data measurements. The methods investigated in this work have applications in different practical problems such as resource allocation in wireless networks, source localization, image/video classification, and search engines. A detailed discussion of these practical applications will be presented for each method.

Proceedings of International Conference on Data Analytics and Insights, ICDAI 2023

Proceedings of International Conference on Data Analytics and Insights, ICDAI 2023 PDF

Author: Nabendu Chaki

Publisher: Springer Nature

Published: 2023-07-24

Total Pages: 801

ISBN-13: 9819938783

DOWNLOAD EBOOK →

The book is a collection of peer-reviewed best selected research papers presented at the International Conference on Data Analytics and Insights (ICDAI 2023), organized by Techno International, Kolkata, India, during May 11–13, 2023. The book covers important topics like sensor and network data analytics and insights; big data analytics and insights; biological and biomedical data analysis and insights; optimization techniques, time series analysis and forecasting; power and energy systems data analytics and insights; civil and environmental data analytics and insights; and industry and applications.

Advances in Big Data Analytics

Advances in Big Data Analytics PDF

Author: Yong Shi

Publisher: Springer Nature

Published: 2022-01-13

Total Pages: 733

ISBN-13: 9811636079

DOWNLOAD EBOOK →

Today, big data affects countless aspects of our daily lives. This book provides a comprehensive and cutting-edge study on big data analytics, based on the research findings and applications developed by the author and his colleagues in related areas. It addresses the concepts of big data analytics and/or data science, multi-criteria optimization for learning, expert and rule-based data analysis, support vector machines for classification, feature selection, data stream analysis, learning analysis, sentiment analysis, link analysis, and evaluation analysis. The book also explores lessons learned in applying big data to business, engineering and healthcare. Lastly, it addresses the advanced topic of intelligence-quotient (IQ) tests for artificial intelligence. /divSince each aspect mentioned above concerns a specific domain of application, taken together, the algorithms, procedures, analysis and empirical studies presented here offer a general picture of big data developments. Accordingly, the book can not only serve as a textbook for graduates with a fundamental grasp of training in big data analytics, but can also show practitioners how to use the proposed techniques to deal with real-world big data problems.

Feature Engineering for Machine Learning and Data Analytics

Feature Engineering for Machine Learning and Data Analytics PDF

Author: Guozhu Dong

Publisher: CRC Press

Published: 2018-03-14

Total Pages: 400

ISBN-13: 1351721275

DOWNLOAD EBOOK →

Feature engineering plays a vital role in big data analytics. Machine learning and data mining algorithms cannot work without data. Little can be achieved if there are few features to represent the underlying data objects, and the quality of results of those algorithms largely depends on the quality of the available features. Feature Engineering for Machine Learning and Data Analytics provides a comprehensive introduction to feature engineering, including feature generation, feature extraction, feature transformation, feature selection, and feature analysis and evaluation. The book presents key concepts, methods, examples, and applications, as well as chapters on feature engineering for major data types such as texts, images, sequences, time series, graphs, streaming data, software engineering data, Twitter data, and social media data. It also contains generic feature generation approaches, as well as methods for generating tried-and-tested, hand-crafted, domain-specific features. The first chapter defines the concepts of features and feature engineering, offers an overview of the book, and provides pointers to topics not covered in this book. The next six chapters are devoted to feature engineering, including feature generation for specific data types. The subsequent four chapters cover generic approaches for feature engineering, namely feature selection, feature transformation based feature engineering, deep learning based feature engineering, and pattern based feature generation and engineering. The last three chapters discuss feature engineering for social bot detection, software management, and Twitter-based applications respectively. This book can be used as a reference for data analysts, big data scientists, data preprocessing workers, project managers, project developers, prediction modelers, professors, researchers, graduate students, and upper level undergraduate students. It can also be used as the primary text for courses on feature engineering, or as a supplement for courses on machine learning, data mining, and big data analytics.

Big Data Analytics for Large-Scale Multimedia Search

Big Data Analytics for Large-Scale Multimedia Search PDF

Author: Stefanos Vrochidis

Publisher: John Wiley & Sons

Published: 2019-03-18

Total Pages: 421

ISBN-13: 1119377005

DOWNLOAD EBOOK →

A timely overview of cutting edge technologies for multimedia retrieval with a special emphasis on scalability The amount of multimedia data available every day is enormous and is growing at an exponential rate, creating a great need for new and more efficient approaches for large scale multimedia search. This book addresses that need, covering the area of multimedia retrieval and placing a special emphasis on scalability. It reports the recent works in large scale multimedia search, including research methods and applications, and is structured so that readers with basic knowledge can grasp the core message while still allowing experts and specialists to drill further down into the analytical sections. Big Data Analytics for Large-Scale Multimedia Search covers: representation learning, concept and event-based video search in large collections; big data multimedia mining, large scale video understanding, big multimedia data fusion, large-scale social multimedia analysis, privacy and audiovisual content, data storage and management for big multimedia, large scale multimedia search, multimedia tagging using deep learning, interactive interfaces for big multimedia and medical decision support applications using large multimodal data. Addresses the area of multimedia retrieval and pays close attention to the issue of scalability Presents problem driven techniques with solutions that are demonstrated through realistic case studies and user scenarios Includes tables, illustrations, and figures Offers a Wiley-hosted BCS that features links to open source algorithms, data sets and tools Big Data Analytics for Large-Scale Multimedia Search is an excellent book for academics, industrial researchers, and developers interested in big multimedia data search retrieval. It will also appeal to consultants in computer science problems and professionals in the multimedia industry.

Enhancing Urban Sustainability with Data, Modeling, and Simulation

Enhancing Urban Sustainability with Data, Modeling, and Simulation PDF

Author: National Academies of Sciences, Engineering, and Medicine

Publisher: National Academies Press

Published: 2019-09-24

Total Pages: 109

ISBN-13: 0309494141

DOWNLOAD EBOOK →

On January 30-31, 2019 the Board on Mathematical Sciences and Analytics, in collaboration with the Board on Energy and Environmental Systems and the Computer Science and Telecommunications Board, convened a workshop in Washington, D.C. to explore the frontiers of mathematics and data science needs for sustainable urban communities. The workshop strengthened the emerging interdisciplinary network of practitioners, business leaders, government officials, nonprofit stakeholders, academics, and policy makers using data, modeling, and simulation for urban and community sustainability, and addressed common challenges that the community faces. Presentations highlighted urban sustainability research efforts and programs under way, including research into air quality, water management, waste disposal, and social equity and discussed promising urban sustainability research questions that improved use of big data, modeling, and simulation can help address. This publication summarizes the presentation and discussion of the workshop.