You may have to Search all our reviewed books and magazines, click the sign up button below to create a free account.
For a machine to convert text into sounds that humans can understand as speech requires an enormous range of components, from abstract analysis of discourse structure to synthesis and modulation of the acoustic output. Work in the field is thus inherently interdisciplinary, involving linguistics, computer science, acoustics, and psychology. This collection of articles by leading researchers in each of the fields involved in text-to-speech synthesis provides a picture of recent work in laboratories throughout the world and of the problems and challenges that remain. By providing samples of synthesized speech as well as video demonstrations for several of the synthesizers discussed, the book will also allow the reader to judge what all the work adds up to -- that is, how good is the synthetic speech we can now produce? Topics covered include: Signal processing and source modeling Linguistic analysis Articulatory synthesis and visual speech Concatenative synthesis and automated segmentation Prosodic analysis of natural speech Synthesis of prosody Evaluation and perception Systems and applications.
The following is a passage from our application for NATO sponsorship: "In the main, the participants in this workshop on the Psychophysics of Speech Perception come from two areas of research: - one area is that of speech perception researc,h, in which the perception of speech sounds is investigated; - the other area is that of psychoacoustics, or auditory psychophysics, in which the perception of simple non-speech sounds, such as pure tones or noise bursts, is investigated, in order to determine the properties of the hearing mechanism. Al though there is widespread agreement among both speech researchers and auditory psychophysicists that there should be a great deal of co-operation between...
Speech recognition by machine : a review / D.R. Reddy -- The value of speech recognition systems / W.A. Lea -- Digital representations of speech signals / R.W. Schafer and L.R. Rabiner -- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences / S.B. Davis and P. Mermelstein -- Vector quantization / R.M. Gray -- A joint synchrony-mean-rate model of auditory speech processing / S. Seneff -- Isolated and connected word recognition : theory and selected applications / L.R. Rabiner and S.E. Levinson -- Minimum prediction residual principle applied to speech recognition / F. Itakura -- Dynamic programming algorithm optimization for spoken word recognition / S. Hakoe and S. Chiba -- Speaker-independent recognition of isolated words using clustering techniques / L.R. Rabiner [and others]Two-level DP-matching : a dynamic programming-based pattern matching algorithm for connected word recognition / H. Sakoe -- The use of a one-stage dynamic pr ...
Profoundly influenced by the analyses, of contemporary linguistics, these original contributions bring a number of different views to bear on important issues in a controversial area of study. The linguistic structures and language-related processes the book deals with are for the most part central (syntactic structures, phonological representations, semantic readings) rather than peripheral (acousticphonetic structures and the perception and production of these structures) aspects of language. Each section contains a summarizing introduction. Section I takes up issues at the interface of linguistics and neurology: The Concept of a Mental Organ for Language; Neural Mechanisms, Aphasia, and T...
The last 50 years have witnessed a rapid growth in the understanding of the articulation and the acoustics of vowels. Contemporary theories of speech perception have concentrated on consonant perception, and this volume is intended as a balance to such bias. The authors propose a computational theory of auditory vowel perception, accounting for vowel identification in the face of acoustic differences between speakers and speaking rate and stress. This work lays the foundation for future experimental and computational studies of vowel perception.
This book brings together phonologists working in different areas to explore key questions relating to phonological primitives, the basic building blocks that are at the heart of phonological structure and over which phonological computations are carried out. Whether these units are referred to as features, elements, gestures, or something else entirely, the assumptions that are made about them are fundamental to modern phonological theory. Even so, there is limited consensus on the specifics of those assumptions. The chapters in this book present differing perspectives on phonological primitives and their implications, addressing some of the most pressing issues in the field such as how many features there are; whether those features are privative or binary; and whether segments need to be specified for all features. The studies cover a wide range of methodologies and domains, including experimental work, fieldwork, language acquisition, theory-internal concerns, and many more, and will be of interest to phoneticians and phonologists from all theoretical backgrounds.
"The papers presented within this volume were selected from the fourteenth meeting of the Austronesian Formal Linguistics Association (AFLA XIV), held May 4-6, 2007 at McGill University in Montreal, Quebec, Canada."
This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.
Describes ITU H H.323 and H.324, H.263, ITU-T video, and MPEG-4 standards, systems, and coding; IP and ATM networks; multimedia search and retrieval; image retrieval in digital laboratories; and the status and direction of MPEG-7.