000 07602nam a22006615i 4500
001 978-3-031-48312-7
003 DE-He213
005 20240423130229.0
007 cr nn 008mamaa
008 231121s2023 sz | s |||| 0|eng d
020 _a9783031483127
_9978-3-031-48312-7
024 7 _a10.1007/978-3-031-48312-7
_2doi
050 4 _aQ334-342
050 4 _aTA347.A78
072 7 _aUYQ
_2bicssc
072 7 _aCOM004000
_2bisacsh
072 7 _aUYQ
_2thema
082 0 4 _a006.3
_223
245 1 0 _aSpeech and Computer
_h[electronic resource] :
_b25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part II /
_cedited by Alexey Karpov, K. Samudravijaya, K. T. Deepak, Rajesh M. Hegde, Shyam S. Agrawal, S. R. Mahadeva Prasanna.
250 _a1st ed. 2023.
264 1 _aCham :
_bSpringer Nature Switzerland :
_bImprint: Springer,
_c2023.
300 _aXXVI, 568 p. 195 illus., 141 illus. in color.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aLecture Notes in Artificial Intelligence,
_x2945-9141 ;
_v14339
505 0 _aIndustrial Speech and Language Technology -- Analysing Breathing Patterns in Reading and Spontaneous Speech -- Audio-Visual Speaker Verification via Joint Cross Attention -- A Novel Scheme to Classify Read and Spontaneous Speech -- Analysis of a Hinglish ASR System’s Performance for Fraud Detection -- Anomaly Detection in Speech: A Comprehensive Approach for Enhanced Speech Analysis -- CAPTuring Accents: An Approach to Personalize Pronunciation Training for Learners with Different L1 Backgrounds -- Speech Technology for Under-Resourced Languages -- Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages -- Phone Durations Modeling for Livvi-Karelian ASR -- Significance of Indic Self-Supervised Speech Representations for Indic Under-Resourced ASR -- Study of Various End-to-End Keyword Spotting Systems on the Bengali language under Low-Resource Condition -- Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language -- Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children’s ASR -- Code-Mixed Text-to-Speech Synthesis under Low-Resource Constraints -- An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language -- An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language -- Cross Lingual Style Transfer using Multiscale Loss Function for Soliga: A Low Resource Tribal Language -- Preliminary Analysis of Lambani Vowels and Vowel Classification using Acoustic Feature -- Curriculum Learning based Approach for Faster Convergence of TTS Model -- Rhythm Measures and Language Endangerment: the Case of Deori -- Konkani Phonetic Transcription System 1.0 -- Speech Analysis and Synthesis -- E-TTS: Expressive Text-to-Speech Synthesis for Hindi using Data Augmentation -- Direct vs Cascaded Speech-to-Speech Translation using Transformer -- Deep Learning based Speech Quality Assessment Focusing on Noise Effects -- Quantifying the Emotional Landscape of Music with Three Dimensions -- Analysis of Mandarin vs. English Language for Emotional Voice Conversion -- Audio DeepFake Detection Employing Multiple Parametric Exponential Linear Units -- A Comparison of Learned Representations with Jointly Optimized VAE and DNN for Syllable Stress Detection -- On the Asymptotic Behaviour of the Speech Signal -- Improvement of Audio-Visual Keyword Spotting System Accuracy using Excitation Source Feature -- Developing a Question Answering System on the material of Holocaust survivors’ testimonies in Russian -- Enhancing Children’s Short Utterance based ASV using Data Augmentation Techniques and Feature Concatenation Approach -- Studying the Effectiveness of Data Augmentation and Frequency-DomainLinear Prediction Coefficients in Children’s Speaker Verification under Low-Resource Conditions -- Constant-Q based Harmonic and Pitch Features for Normal vs Pathological Infant Cry Classification -- Robustness of Whisper Features for Infant Cry Classification -- Speaker and Language Identification, Verification, and Diarization -- I-MSV 2022: Indic-Multilingual and Multi-Sensor Speaker Verification Challenge -- Multi-Task Learning over Mixup Variants for the Speaker Verification Task -- Exploring the Impact of Different Approaches for Spoken Dialect Identification of Konkani Language -- Adversarially Trained Hierarchical Attention Network for Domain-Invariant Spoken Language Identification -- Ensemble of Incremental System Enhancements for Robust Speaker Diarization in Code-Switched Real-Life Audios -- Enhancing Language Identification in Indian Context through Exploiting Learned Features with Wav2Vec2.0 -- Design and Development of Voice OTP Authentication System -- End-to-End Native Language Identification using a Modified Vision Transformer(ViT) from L2 English Speech -- Dialect Identification in Ao using Modulation-based Representation -- Self-Supervised Speaker Verification Employing Augmentation Mix and Self-Augmented Training-based Clustering. .
520 _aThe two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.
650 0 _aArtificial intelligence.
650 0 _aImage processing
_xDigital techniques.
650 0 _aComputer vision.
650 0 _aComputer engineering.
650 0 _aComputer networks .
650 0 _aApplication software.
650 1 4 _aArtificial Intelligence.
650 2 4 _aComputer Imaging, Vision, Pattern Recognition and Graphics.
650 2 4 _aComputer Engineering and Networks.
650 2 4 _aComputer and Information Systems Applications.
700 1 _aKarpov, Alexey.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aSamudravijaya, K.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aDeepak, K. T.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aHegde, Rajesh M.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aAgrawal, Shyam S.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aPrasanna, S. R. Mahadeva.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
710 2 _aSpringerLink (Online service)
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9783031483110
776 0 8 _iPrinted edition:
_z9783031483134
830 0 _aLecture Notes in Artificial Intelligence,
_x2945-9141 ;
_v14339
856 4 0 _uhttps://doi.org/10.1007/978-3-031-48312-7
912 _aZDB-2-SCS
912 _aZDB-2-SXCS
912 _aZDB-2-LNC
942 _cSPRINGER
999 _c186415
_d186415