Home
Journals
Archaeology International
Architecture_MPS
Europe and the World: A law review
Film Education Journal
History Education Research Journal
International Journal of Development Education and Global Learning
International Journal of Social Pedagogy
Jewish Historical Studies: A Journal of English-Speaking Jewry
Journal of Bentham Studies
London Review of Education
Radical Americas
Research for All
The Journal of the Sylvia Townsend Warner Society
The London Journal of Canadian Studies
About
About UCL Press
Who we are
Contact us
My ScienceOpen
Sign in
Register
Dashboard
Search
Home
Journals
Archaeology International
Architecture_MPS
Europe and the World: A law review
Film Education Journal
History Education Research Journal
International Journal of Development Education and Global Learning
International Journal of Social Pedagogy
Jewish Historical Studies: A Journal of English-Speaking Jewry
Journal of Bentham Studies
London Review of Education
Radical Americas
Research for All
The Journal of the Sylvia Townsend Warner Society
The London Journal of Canadian Studies
About
About UCL Press
Who we are
Contact us
My ScienceOpen
Sign in
Register
Dashboard
Search
30
views
0
references
Top references
cited by
2
Cite as...
0 reviews
Review
0
comments
Comment
0
recommends
+1
Recommend
0
collections
Add to
0
shares
Share
Twitter
Sina Weibo
Facebook
Email
686
similar
All similar
Record
: found
Abstract
: not found
Book
: not found
Speech and Computer
other
Editor(s):
Alexey Karpov
,
Rodmonga Potapova
,
Iosif Mporas
Publication date
(Print):
2017
Publisher:
Springer International Publishing
Read this book at
Publisher
Buy book
Review
Review book
Invite someone to review
Bookmark
Cite as...
There is no author summary for this book yet. Authors can add summaries to their books on ScienceOpen to make them more accessible to a non-specialist audience.
Related collections
Laboratory Phonology
Author and book information
Book
ISBN (Print):
978-3-319-66428-6
ISBN (Electronic):
978-3-319-66429-3
Publication date (Print):
2017
DOI:
10.1007/978-3-319-66429-3
SO-VID:
d307f840-b018-41d6-a8a2-3a9db4c54ee7
License:
http://www.springer.com/tdm
History
Data availability:
Comments
Comment on this book
Sign in to comment
Book chapters
pp. 3
Low-Resource Speech Recognition and Keyword-Spotting
pp. 20
Big Data, Deep Learning – At the Edge of X-Ray Speaker Analysis
pp. 37
A Comparison of Covariance Matrix and i-vector Based Speaker Recognition
pp. 46
A Trainable Method for the Phonetic Similarity Search in German Proper Names
pp. 56
Acoustic and Perceptual Correlates of Vowel Articulation in Parkinson’s Disease With and Without Mild Cognitive Impairment: A Pilot Study
pp. 65
Acoustic Cues for the Perceptual Assessment of Surround Sound
pp. 76
Acoustic Modeling in the STC Keyword Search System for OpenKWS 2016 Evaluation
pp. 87
Adaptation Approaches for Pronunciation Scoring with Sparse Training Data
pp. 98
An Algorithm for Detection of Breath Sounds in Spontaneous Speech with Application to Speaker Recognition
pp. 109
An Alternative Approach to Exploring a Video
pp. 119
An Analysis of the RNN-Based Spoken Term Detection Training
pp. 130
Analysis of Interaction Parameter Levels in Interaction Quality Modelling for Human-Human Conversation
pp. 141
Annotation Error Detection: Anomaly Detection vs. Classification
pp. 152
Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations
pp. 162
Assessing Spoken Dialog Services from the End-User Perspective: Usability and Experience
pp. 171
Audio-Replay Attack Detection Countermeasures
pp. 182
Automatic Estimation of Presentation Skills Using Speech, Slides and Gestures
pp. 192
Automatic Phonetic Transcription for Russian: Speech Variability Modeling
pp. 200
Automatic Smoker Detection from Telephone Speech Signals
pp. 211
Bimodal Anti-Spoofing System for Mobile Security
pp. 221
Canadian English Word Stress: A Corpora-Based Study of National Identity in a Multilingual Community
pp. 233
Classification of Formal and Informal Dialogues Based on Turn-Taking and Intonation Using Deep Neural Networks
pp. 244
Clustering Target Speaker on a Set of Telephone Dialogs
pp. 253
Cognitive Entropy in the Perceptual-Auditory Evaluation of Emotional Modal States of Foreign Language Communication Partner
pp. 262
Correlation Normalization of Syllables and Comparative Evaluation of Pronunciation Quality in Speech Rehabilitation
pp. 272
CRF-Based Phrase Boundary Detection Trained on Large-Scale TTS Speech Corpora
pp. 282
Deep Recurrent Neural Networks in Speech Synthesis Using a Continuous Vocoder
pp. 292
Design of Online Echo Canceller in Duplex Mode
pp. 302
Detection of Stance and Sentiment Modifiers in Political Blogs
pp. 312
Digits to Words Converter for Slavic Languages in Systems of Automatic Speech Recognition
pp. 322
Discriminating Speakers by Their Voices — A Fusion Based Approach
pp. 332
Emotional Poetry Generation
pp. 343
End-to-End Large Vocabulary Speech Recognition for the Serbian Language
pp. 353
Examining the Impact of Feature Selection on Sentiment Analysis for the Greek Language
pp. 362
Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech Recognition
pp. 370
Exploring Multiparty Casual Talk for Social Human-Machine Dialogue
pp. 379
First Experiments to Detect Anomaly Using Personality Traits vs. Prosodic Features
pp. 389
Fusion of a Novel Volterra-Wiener Filter Based Nonlinear Residual Phase and MFCC for Speaker Verification
pp. 398
Hesitations in Spontaneous Speech: Acoustic Analysis and Detection
pp. 407
Human as Acmeologic Entity in Social Network Discourse (Multidimensional Approach)
pp. 417
Improved Speaker Adaptation by Combining I-vector and fMLLR with Deep Bottleneck Networks
pp. 427
Improving of LVCSR for Causal Czech Using Publicly Available Language Resources
pp. 438
Improving Performance of Speaker Identification Systems Using Score Level Fusion of Two Modes of Operation
pp. 445
Improving Speech-Based Emotion Recognition by Using Psychoacoustic Modeling and Analysis-by-Synthesis
pp. 456
In Search of Sentence Boundaries in Spontaneous Speech
pp. 464
Investigating Acoustic Correlates of Broad and Narrow Focus Perception by Japanese Learners of English
pp. 473
Language Adaptive Multilingual CTC Speech Recognition
pp. 483
Language Model Optimization for a Deep Neural Network Based Speech Recognition System for Serbian
pp. 493
Lexico-Semantical Indices of “Deprivation – Aggression” Modality Correlation in Social Network Discourse
pp. 503
Linguistic Features and Sociolinguistic Variability in Everyday Spoken Russian
pp. 512
Medical Speech Recognition: Reaching Parity with Humans
pp. 525
Microphone Array Post-filter in Frequency Domain for Speech Recognition Using Short-Time Log-Spectral Amplitude Estimator and Spectral Harmonic/Noise Classifier
pp. 535
Multimodal Keyword Search for Multilingual and Mixlingual Speech Corpus
pp. 546
Neural Network Doc2vec in Automated Sentiment Analysis for Short Informal Texts
pp. 555
Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech
pp. 564
Novel Linear Prediction Temporal Phase Based Features for Speaker Recognition
pp. 572
Novel Phase Encoded Mel Cepstral Features for Speaker Verification
pp. 582
On a Way to the Computer Aided Speech Intonation Training
pp. 593
On Residual CNN in Text-Dependent Speaker Verification Task
pp. 602
Perception and Acoustic Features of Speech of Children with Autism Spectrum Disorders
pp. 613
Phase Analysis and Labeling Strategies in a CNN-Based Speaker Change Detection System
pp. 623
Preparing Audio Recordings of Everyday Speech for Prosody Research: The Case of the ORD Corpus
pp. 632
Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck Features
pp. 642
Relationship Between Perception of Cuteness in Female Voices and Their Durations
pp. 651
Retaining Expression on De-identified Faces
pp. 662
Semi-automatic Facial Key-Point Dataset Creation
pp. 669
Song Emotion Recognition Using Music Genre Information
pp. 680
Spanish Corpus for Sentiment Analysis Towards Brands
pp. 690
Speech Enhancement for Speaker Recognition Using Deep Recurrent Neural Networks
pp. 700
Stance Classification in Texts from Blogs on the 2016 British Referendum
pp. 710
The “Retrospective Commenting” Method for Longitudinal Recordings of Everyday Speech
pp. 719
The 2016 RWTH Keyword Search System for Low-Resource Languages
pp. 731
The Effect of Morphological Factors on Sentence Boundaries in Russian Spontaneous Speech
pp. 741
The Pausing Method Based on Brown Clustering and Word Embedding
pp. 748
Unsupervised Document Classification and Topic Detection
pp. 757
Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions
pp. 767
Utilizing Lipreading in Large Vocabulary Continuous Speech Recognition
pp. 777
Vocal Emotion Conversion Using WSOLA and Linear Prediction
pp. 788
Voice Conversion for TTS Systems with Tuning on the Target Speaker Based on GMM
pp. 799
VoiScan: Telephone Voice Analysis for Health and Biometric Applications
pp. 809
Web Queries Classification Based on the Syntactical Patterns of Search Types
pp. 820
What Speech Recognition Accuracy is Needed for Video Transcripts to be a Useful Search Interface?
Similar content
686
Spectro-temporal acoustical markers differentiate speech from song across cultures
Authors:
Philippe Albouy
,
Samuel A. Mehr
,
Roxane S. Hoyer
…
Joint Khmer Word Segmentation and Part-of-Speech Tagging Using Deep Learning
Authors:
Rina Buoy
,
Nguonly Taing
,
Sokchea Kor
Multi-microphone Speech Dereverberation Using Eigen-decomposition
Authors:
Sharon Gannot
See all similar
Cited by
2
CNN-Based Identification of Parkinson’s Disease from Continuous Speech in Noisy Environments
Authors:
Paul Faragó
,
Sebastian-Aurelian Ștefănigă
,
Claudia-Georgiana Cordoș
…
End-to-End Residual CNN with L-GM Loss Speaker Verification System
Authors:
Xingjian Du
,
Xuan Shi
,
Mengyao Zhu
See all cited by