Welcome to our book review site go-pdf.online!

You may have to Search all our reviewed books and magazines, click the sign up button below to create a free account.

Sign up

Text, Speech and Dialogue
  • Language: en
  • Pages: 663

Text, Speech and Dialogue

This book constitutes the refereed proceedings of the 11th International Conference on Text, Speech and Dialogue, TSD 2008, held in Brno, Czech Republic, September 8-12, 2008. The 79 revised full papers presented together with 4 invited papers were carefully reviewed and selected from 173 submissions. The topics of the conference include, but are not limited to, text corpora and tagging; transcription problems in spoken corpora; sense disambiguation; links between text and speech oriented systems; parsing issues; parsing problems in spoken texts; multi-lingual issues; multi-lingual dialogue systems; information retrieval and information extraction; text/topic summarization; machine translation; semantic networks and ontologies; semantic web; speech modeling; speech segmentation; speech recognition; search in speech for IR and IE; text-to-speech synthesis; dialogue systems; development of dialogue strategies; prosody in dialogues; emotions and personality modeling; user modeling; knowledge representation in relation to dialogue systems; assistive technologies based on speech and dialogue; applied systems and software; facial animation; and visual speech synthesis

Advances in Communication and Computing
  • Language: en
  • Pages: 281

Advances in Communication and Computing

  • Type: Book
  • -
  • Published: 2015-06-17
  • -
  • Publisher: Springer

The present volume is a compilation of research work in computation, communication, vision sciences, device design, fabrication, upcoming materials and related process design, etc. It is derived out of selected manuscripts submitted to the 2014 National Workshop on Advances in Communication and Computing (WACC 2014), Assam Engineering College, Guwahati, Assam, India which is emerging out to be a premier platform for discussion and dissemination of knowhow in this part of the world. The papers included in the volume are indicative of the recent thrust in computation, communications and emerging technologies. Certain recent advances in ZnO nanostructures for alternate energy generation provide...

Speech Recognition Using Articulatory and Excitation Source Features
  • Language: en
  • Pages: 92

Speech Recognition Using Articulatory and Excitation Source Features

  • Type: Book
  • -
  • Published: 2017-01-11
  • -
  • Publisher: Springer

This book discusses the contribution of articulatory and excitation source information in discriminating sound units. The authors focus on excitation source component of speech -- and the dynamics of various articulators during speech production -- for enhancement of speech recognition (SR) performance. Speech recognition is analyzed for read, extempore, and conversation modes of speech. Five groups of articulatory features (AFs) are explored for speech recognition, in addition to conventional spectral features. Each chapter provides the motivation for exploring the specific feature for SR task, discusses the methods to extract those features, and finally suggests appropriate models to capture the sound unit specific knowledge from the proposed features. The authors close by discussing various combinations of spectral, articulatory and source features, and the desired models to enhance the performance of SR systems.

Speech Processing in Mobile Environments
  • Language: en
  • Pages: 129

Speech Processing in Mobile Environments

This book focuses on speech processing in the presence of low-bit rate coding and varying background environments. The methods presented in the book exploit the speech events which are robust in noisy environments. Accurate estimation of these crucial events will be useful for carrying out various speech tasks such as speech recognition, speaker recognition and speech rate modification in mobile environments. The authors provide insights into designing and developing robust methods to process the speech in mobile environments. Covering temporal and spectral enhancement methods to minimize the effect of noise and examining methods and models on speech and speaker recognition applications in mobile environments.

Language Identification Using Excitation Source Features
  • Language: en
  • Pages: 119

Language Identification Using Excitation Source Features

  • Type: Book
  • -
  • Published: 2015-04-15
  • -
  • Publisher: Springer

This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Implicit processing of linear prediction (LP) residual and (ii) Explicit parameterization of linear prediction residual. The book discusses how in implicit processing approach, excitation source features are derived from LP residual, Hilbert envelope (magnitude) of LP residual and Phase of LP residual; and in explicit parameterization approach, LP residual signal is processed in spectral domain to extr...

Robust Emotion Recognition using Spectral and Prosodic Features
  • Language: en
  • Pages: 127

Robust Emotion Recognition using Spectral and Prosodic Features

In this brief, the authors discuss recently explored spectral (sub-segmental and pitch synchronous) and prosodic (global and local features at word and syllable levels in different parts of the utterance) features for discerning emotions in a robust manner. The authors also delve into the complementary evidences obtained from excitation source, vocal tract system and prosodic features for the purpose of enhancing emotion recognition performance. Features based on speaking rate characteristics are explored with the help of multi-stage and hybrid models for further improving emotion recognition performance. Proposed spectral and prosodic features are evaluated on real life emotional speech corpus.

Indian Art Music: A Computational Perspective
  • Language: en
  • Pages: 433

Indian Art Music: A Computational Perspective

This monograph presents a diverse collection of articles on Indian Art Music based on analytical work aided by computational tools. The book focuses mainly on the current practices in music and its representation in audio recordings, a perspective that is particularly relevant to oral traditions. It presents a rare and unique example of collaboration between musicians, musicologists, scientists, and engineers. The presentation brings together various aspects of research on Indian art music that benefits from audio processing or computing, ranging from musicology to information retrieval to instrument modeling. It is hoped that the monograph will serve as an accessible introduction to computational approaches for Indian art music in particular, and ethnomusicology more generally.

Speech, Audio, Image and Biomedical Signal Processing using Neural Networks
  • Language: en
  • Pages: 419

Speech, Audio, Image and Biomedical Signal Processing using Neural Networks

Humans are remarkable in processing speech, audio, image and some biomedical signals. Artificial neural networks are proved to be successful in performing several cognitive, industrial and scientific tasks. This peer reviewed book presents some recent advances and surveys on the applications of artificial neural networks in the areas of speech, audio, image and biomedical signal processing. It chapters are prepared by some reputed researchers and practitioners around the globe.

Speech and Computer
  • Language: en
  • Pages: 737

Speech and Computer

This book constitutes the proceedings of the 24th International Conference on Speech and Computer, SPECOM 2022, held as a hybrid event in Gurugram, India, in November 2022. The 51 full and 9 short papers presented in this volume were carefully reviewed and selected from 99 submissions. The papers present current research in the area of computer speech processing including audio signal processing, automatic speech recognition, speaker recognition, computational paralinguistics, speech synthesis, sign language and multimodal processing, and speech and language resources.

Speech and Computer
  • Language: en
  • Pages: 657

Speech and Computer

The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: ​automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.