You may have to Search all our reviewed books and magazines, click the sign up button below to create a free account.
This second issue of Transactions on Large-Scale Data- and Knowledge-Centered Systems consists of journal versions of selected papers from the 11th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2009).
This book constitutes the refereed proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery, DaWak 2009 held in Linz, Austria in August/September 2009. The 36 revised full papers presented were carefully reviewed and selected from 124 submissions. The papers are organized in topical sections on data warehouse modeling, data streams, physical design, pattern mining, data cubes, data mining applications, analytics, data mining, clustering, spatio-temporal mining, rule mining, and OLAP recommendation.
The latest edition of a popular text and reference on database research, with substantial new material and revision; covers classical literature and recent hot topics. Lessons from database research have been applied in academic fields ranging from bioinformatics to next-generation Internet architecture and in industrial uses including Web-based e-commerce and search engines. The core ideas in the field have become increasingly influential. This text provides both students and professionals with a grounding in database research and a technical context for understanding recent innovations in the field. The readings included treat the most important issues in the database area--the basic mater...
Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of the research examines ways for improving its effectiveness and time efficiency. The initial ER methods primarily target Veracity in the context of structured (relational) data that are described by a schema of well-known quality and meaning. To achieve high effectiveness, they leverage schema, expert, and/or external knowledge. Part of these methods are extended to address Volume, processing large datasets through multi-core or massive parallelization approaches, such as the MapReduce paradigm. However, these early schema-based approaches are inapplicable to Web Data, which abound in voluminous, noi...
Software engineering research can trace its roots to a few highly influential individuals. Among that select group is Leon J. Osterweil, who has been a major force in driving software engineering from its infancy to its modern reality. For more than three decades, Prof. Osterweil's work has fundamentally defined or significantly impacted major directions in software analysis, development tools and environments, and software process--all critical parts of software engineering as it is practiced today. His exceptional contributions to the field have been recognized with numerous awards and honors through his career, including the ACM SIGSOFT Outstanding Research Award, in recognition of his ex...
Data profiling refers to the activity of collecting data about data, {i.e.}, metadata. Most IT professionals and researchers who work with data have engaged in data profiling, at least informally, to understand and explore an unfamiliar dataset or to determine whether a new dataset is appropriate for a particular task at hand. Data profiling results are also important in a variety of other situations, including query optimization, data integration, and data cleaning. Simple metadata are statistics, such as the number of rows and columns, schema and datatype information, the number of distinct values, statistical value distributions, and the number of null or empty values in each column. More...
This book is a gentle introduction to dominance-based query processing techniques and their applications. The book aims to present fundamental as well as some advanced issues in the area in a precise, but easy-to-follow, manner. Dominance is an intuitive concept that can be used in many different ways in diverse application domains. The concept of dominance is based on the values of the attributes of each object. An object dominates another object if is better than . This goodness criterion may differ from one user to another. However, all decisions boil down to the minimization or maximization of attribute values. In this book, we will explore algorithms and applications related to dominance-based query processing. The concept of dominance has a long history in finance and multi-criteria optimization. However, the introduction of the concept to the database community in 2001 inspired many researchers to contribute to the area. Therefore, many algorithmic techniques have been proposed for the efficient processing of dominance-based queries, such as skyline queries, -dominant queries, and top- dominating queries, just to name a few.
Increasingly, formal specification is being used by database researchers to describe and understand the systems they are designing and implementing. Similarly, those working on formal specification techniques have recognised that the database field provides a rich context for developing their ideas. However, as experts in one field often have a relatively limited knowledge of the other, there is a growing need for discussion about the relationship between these two fields and how they can be usefully combined. This volume contains the 16 papers which were presented at the International Workshop on Specification on Database Systems, held in Glasgow, 3-5 July 1991. The purpose of the workshop ...
This book constitutes the refereed proceedings of the 13th International Conference on Database Systems for Advanced Applications, DASFAA 2008, held in New Delhi, India, in March 2008. The 30 revised full papers and 27 revised short papers presented together with the abstracts of 3 invited talks as well as 8 demonstration papers and a panel discussion motivation were carefully reviewed and selected from 173 submissions. The papers are organized in topical sections on XML schemas, data mining, spatial data, indexes and cubes, data streams, P2P and transactions, XML processing, complex pattern processing, IR techniques, queries and transactions, data mining, XML databases, data warehouses and industrial applications, as well as mobile and distributed data.