Thorsten Papenbrock Book

Language: en
Pages: 149

Data Profiling

Author(s): Ziawasch Abedjan, Lukasz Golab, Felix Naumann, Thorsten Papenbrock

Categories: Computers

Type: Book
-
Published: 2022-06-01
-
Publisher: Springer Nature

Data profiling refers to the activity of collecting data about data, {i.e.}, metadata. Most IT professionals and researchers who work with data have engaged in data profiling, at least informally, to understand and explore an unfamiliar dataset or to determine whether a new dataset is appropriate for a particular task at hand. Data profiling results are also important in a variety of other situations, including query optimization, data integration, and data cleaning. Simple metadata are statistics, such as the number of rows and columns, schema and datatype information, the number of distinct values, statistical value distributions, and the number of null or empty values in each column. More...

Language: en
Pages: 653

Database Systems for Advanced Applications

Author(s): Matthias Renz, Cyrus Shahabi, Xiaofang Zhou, Muhammad Aamir Cheema

Categories: Computers

Type: Book
-
Published: 2015-04-08
-
Publisher: Springer

This two volume set LNCS 9049 and LNCS 9050 constitutes the refereed proceedings of the 20th International Conference on Database Systems for Advanced Applications, DASFAA 2015, held in Hanoi, Vietnam, in April 2015. The 63 full papers presented were carefully reviewed and selected from a total of 287 submissions. The papers cover the following topics: data mining; data streams and time series; database storage and index; spatio-temporal data; modern computing platform; social networks; information integration and data quality; information retrieval and summarization; security and privacy; outlier and imbalanced data analysis; probabilistic and uncertain data; query processing.

Language: en
Pages: 451

Internet of Things—Applications and Future

Author(s): Atef Zaki Ghalwash, Nashaat El Khameesy, Dalia A. Magdi, Amit Joshi

Categories: Technology & Engineering

Type: Book
-
Published: 2020-04-03
-
Publisher: Springer Nature

This book is a collection of the best research papers presented at the First World Conference on Internet of Things: Applications & Future (ITAF 2019), Sponsored by GR Foundation and French University in Egypt, held at Triumph Luxury Hotel, Cairo, Egypt, on 14–15 October 2019. It includes innovative works from leading researchers, innovators, business executives, and industry professionals that cover the latest advances in and applications for commercial and industrial end users across sectors within the emerging Internet of Things ecosphere. It addresses both current and emerging topics related to the Internet of Things such as big data research, new services and analytics, Internet of Things (IoT) fundamentals, electronic computation and analysis, big data for multi-discipline services, security, privacy and trust, IoT technologies, and open and cloud technologies.

Language: en
Pages: 164

The Four Generations of Entity Resolution

Author(s): George Papadakis, Ekaterini Ioannou, Emanouil Thanos, Themis Palpanas

Categories: Computers

Type: Book
-
Published: 2022-06-01
-
Publisher: Springer Nature

Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of the research examines ways for improving its effectiveness and time efficiency. The initial ER methods primarily target Veracity in the context of structured (relational) data that are described by a schema of well-known quality and meaning. To achieve high effectiveness, they leverage schema, expert, and/or external knowledge. Part of these methods are extended to address Volume, processing large datasets through multi-core or massive parallelization approaches, such as the MapReduce paradigm. However, these early schema-based approaches are inapplicable to Web Data, which abound in voluminous, noi...

Language: en
Pages: 433

New Trends in Databases and Information Systems

Author(s): András Benczúr, Bernhard Thalheim, Tomáš Horváth, Silvia Chiusano, Tania Cerquitelli, Csaba Sidló, Peter Z. Revesz

Categories: Computers

Type: Book
-
Published: 2018-08-30
-
Publisher: Springer

This book constitutes the thoroughly refereed short papers, workshops and doctoral consortium papers of the 22th European Conference on Advances in Databases and Information Systems, ADBIS 2018, held in Budapest, Hungary, in September 2018. The 20 full and the 4 short workshop papers as well as the 3 doctoral consortium papers were carefully reviewed and selected from 54 submissions to the workshops and 6 submissions to the doctoral consortium. Furthermore, there are 10 short papers included, which were accepted for the main conference. The papers are organized according to the 6 workshops and the doctoral consortium: ADBIS 2018 short papers; First Workshop on Advances on Big Data Management...

Language: en
Pages: 265

Fault-Tolerant Distributed Transactions on Blockchain

Author(s): Suyash Gupta, Jelle Hellings, Mohammad Sadoghi

Categories: Computers

Type: Book
-
Published: 2022-06-01
-
Publisher: Springer Nature

Since the introduction of Bitcoin—the first widespread application driven by blockchain—the interest of the public and private sectors in blockchain has skyrocketed. In recent years, blockchain-based fabrics have been used to address challenges in diverse fields such as trade, food production, property rights, identity-management, aid delivery, health care, and fraud prevention. This widespread interest follows from fundamental concepts on which blockchains are built that together embed the notion of trust, upon which blockchains are built. 1. Blockchains provide data transparancy. Data in a blockchain is stored in the form of a ledger, which contains an ordered history of all the transa...

Language: en
Pages: 409

Big Data Analytics and Knowledge Discovery

Author(s): Robert Wrembel

Type: Book
-
Published: Unknown
-
Publisher: Springer Nature

None

Language: en
Pages: 134

Transaction Processing on Modern Hardware

Author(s): Mohammad Sadoghi, Spyros Blanas

Categories: Computers

Type: Book
-
Published: 2022-05-31
-
Publisher: Springer Nature

The last decade has brought groundbreaking developments in transaction processing. This resurgence of an otherwise mature research area has spurred from the diminishing cost per GB of DRAM that allows many transaction processing workloads to be entirely memory-resident. This shift demanded a pause to fundamentally rethink the architecture of database systems. The data storage lexicon has now expanded beyond spinning disks and RAID levels to include the cache hierarchy, memory consistency models, cache coherence and write invalidation costs, NUMA regions, and coherence domains. New memory technologies promise fast non-volatile storage and expose unchartered trade-offs for transactional durabi...

Language: en
Pages: 184

Non-Volatile Memory Database Management Systems

Author(s): Joy Arulraj, Andrew Pavlo

Categories: Computers

Type: Book
-
Published: 2022-06-01
-
Publisher: Springer Nature

This book explores the implications of non-volatile memory (NVM) for database management systems (DBMSs). The advent of NVM will fundamentally change the dichotomy between volatile memory and durable storage in DBMSs. These new NVM devices are almost as fast as volatile memory, but all writes to them are persistent even after power loss. Existing DBMSs are unable to take full advantage of this technology because their internal architectures are predicated on the assumption that memory is volatile. With NVM, many of the components of legacy DBMSs are unnecessary and will degrade the performance of data-intensive applications. We present the design and implementation of DBMS architectures that...

Language: en
Pages: 176

Data-Intensive Workflow Management

Author(s): Daniel C. M. de Oliveira, Ji Liu, Esther Pacitti

Categories: Computers

Type: Book
-
Published: 2022-06-01
-
Publisher: Springer Nature

Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activities and activations (i.e., tasks associated with activities) and may need a long time for execution. Due to the continuous need to store and process data efficiently (making them data-intensive workflows), high-performance computing environments allied to parallelization techniques are used to run these workflows. At the beginning of the 2010s, cloud technologies emerged as a promising environmen...

Welcome to our book review site go-pdf.online!