Associate Professor Vidhyasaharan Sethu

Associate Professor

PhD (UNSW), MEngSc in Signal Processing (UNSW), BE in Electronics and Communication Engineering (Anna University)

Engineering

Electrical Engineering and Telecommunications

Vidhyasaharan Sethu is an Associate Professor with the School of Electrical Engineering and Telecommunications. His primary research interests are in the field of speech signal processing. Particularly in the application of machine learning techniques for addressing speech processing tasks. His research interests include speech based emotion and mental state recognition systems, affective computing, voice biometrics and more broadly the overlap between machine learning and signal processing.

Phone

+61 2 9385 7737

E-mail

v.sethu@unsw.edu.au

Location

Room 442, EE&T Building (G17), UNSW Sydney

Book Chapters | 2015

Sethu V; Epps J; Ambikairajah E, 2015, 'Speech based emotion recognition', in Speech and Audio Processing for Coding, Enhancement and Recognition, Springer Link, pp. 197 - 228,

Book Chapters | 2014

Ambikairajah E; Sethu V; Eaton R; Sheng M, 2014, 'Evolving use of educational technologies: Enhancing lectures', in Using Technology Tools to Innovate Assessment, Reporting, and Teaching Practices in Engineering Education, pp. 241 - 258,
Journal articles | 2025

Zhang Q; Wickramasinghe B; Ambikairajah E; Sethu V; Li H, 2025, 'Should Audio Front-Ends be Adaptive? Comparing Learnable and Adaptive Front-Ends', IEEE Transactions on Audio, Speech and Language Processing, 33, pp. 998 - 1010,

Journal articles | 2024

Bose D; Sethu V; Ambikairajah E, 2024, 'Continuous Emotion Ambiguity Prediction: Modeling with Beta Distributions', IEEE Transactions on Affective Computing, 15, pp. 1684 - 1695,

Journal articles | 2024

Haghshenas Y; Wong WP; Gunawan D; Khataee A; Keyikoğlu R; Razmjou A; Kumar PV; Toe CY; Masood H; Amal R; Sethu V; Teoh WY, 2024, 'Predicting the rates of photocatalytic hydrogen evolution over cocatalyst-deposited TiO2 using machine learning with active photon flux as a unifying feature', EES Catalysis, 2, pp. 612 - 623,

Journal articles | 2024

Haghshenas Y; Wong WP; Sethu V; Amal R; Kumar PV; Teoh WY, 2024, 'Full prediction of band potentials in semiconductor materials', Materials Today Physics, 46, pp. 101519,

Journal articles | 2024

Hong X; Gong Y; Sethu V; Dang T, 2024, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', CoRR, abs/2409.18339

Journal articles | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', Interspeech 2024, pp. 4323 - 4327,

Journal articles | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework.', CoRR, abs/2409.15357

Journal articles | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6495 - 6499,

Journal articles | 2024

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, 'Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction', Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3185 - 3189,

Journal articles | 2023

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, 'Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio.', CoRR, abs/2310.10922

Journal articles | 2023

Masood H; Sirojan T; Toe CY; Kumar PV; Haghshenas Y; Sit PHL; Amal R; Sethu V; Teoh WY, 2023, 'Enhancing prediction accuracy of physical band gaps in semiconductor materials', Cell Reports Physical Science, 4,

Journal articles | 2023

Wickramasinghe B; Ambikairajah E; Sethu V; Epps J; Li H; Dang T, 2023, 'DNN controlled adaptive front-end for replay attack detection systems', Speech Communication, 154,

Journal articles | 2023

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information', IEEE Transactions on Affective Computing, 14, pp. 2089 - 2101,

Journal articles | 2022

Ramandi HL; Irtza S; Sirojan T; Naman A; Mathew R; Sethu V; Roshan H; Lamei Ramandi H, 2022, 'FracDetect: A novel algorithm for 3D fracture detection in digital fractured rocks', Journal of Hydrology, 607, pp. 127482,

Journal articles | 2021

Aboutanios E; Sethu V; Ambikairajah E; Taubman DS; Epps J, 2021, 'Teaching Signal Processing through Frequent and Diverse Design: A Pedagogical Approach', IEEE Signal Processing Magazine, 38, pp. 133 - 143,

Journal articles | 2021

Gunendradasan T; Ambikairajah E; Epps J; Sethu V; Li H, 2021, 'An adaptive transmission line cochlear model based front-end for replay attack detection', Speech Communication, 132, pp. 114 - 122,

Journal articles | 2021

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information.', CoRR, abs/2108.04605

Journal articles | 2021

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'Multimodal Affect Models: An Investigation of Relative Salience of Audio and Visual Cues for Emotion Prediction', Frontiers in Computer Science, 3,

Journal articles | 2020

Cummins N; Sethu V; Epps J; Williamson JR; Quatieri TF; Krajewski J, 2020, 'Generalized two-stage rank regression framework for depression score prediction from speech', IEEE Transactions on Affective Computing, 11, pp. 272 - 283,

Journal articles | 2020

Huang Z; Epps J; Joachim D; Sethu V, 2020, 'Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection', IEEE Journal on Selected Topics in Signal Processing, 14, pp. 435 - 448,

Journal articles | 2020

Suthokumar G; Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2020, 'An analysis of speaker dependent models in replay detection', APSIPA Transactions on Signal and Information Processing, 9,

Journal articles | 2019

Brown S; Sethu V; Taubman D, 2019, 'Spatial Wiener filter to reduce spatial aliasing with spherical microphone arrays', Journal of the Acoustical Society of America, 145, pp. 2254 - 2264,

Journal articles | 2019

Masood H; Toe CY; Teoh WY; Sethu V; Amal R, 2019, 'Machine Learning for Accelerated Discovery of Solar Photocatalysts', ACS Catalysis, 9, pp. 11774 - 11787,

Journal articles | 2019

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan SS, 2019, 'The Ambiguous World of Emotion Representation.', CoRR, abs/1909.00360

Journal articles | 2019

Vukovic M; Sethu V; Parker J; Cavedon L; Lech M; Thangarajah J, 2019, 'Estimating cognitive load from speech gathered in a complex real-life training exercise', International Journal of Human Computer Studies, 124, pp. 116 - 133,

Journal articles | 2018

Dang T; Sethu V; Ambikairajah E, 2018, 'Compensation Techniques for Speaker Variability in Continuous Emotion Prediction', IEEE Transactions on Affective Computing, pp. 1 - 15,

Journal articles | 2018

Fernando S; Sethu V; Ambikairajah E, 2018, 'Hidden variability subspace learning for adaptation of deep neural networks', Electronics Letters, 54, pp. 173 - 175,

Journal articles | 2018

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'Using language cluster models in hierarchical language identification', Speech Communication, 100, pp. 30 - 40,

Journal articles | 2018

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Generalized variability model for speaker verification', IEEE Signal Processing Letters, 25, pp. 1775 - 1779,

Journal articles | 2017

Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Duration compensation of i-vectors for short duration speaker verification', Electronics Letters, 53, pp. 405 - 407,

Journal articles | 2017

Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2017, 'Front-end for antispoofing countermeasures in speaker verification: Scattering spectral decomposition', IEEE Journal on Selected Topics in Signal Processing, 11, pp. 632 - 643,

Journal articles | 2015

Cummins N; Sethu V; Epps J; Schnieder S; Krajewski J, 2015, 'Analysis of acoustic space variability in speech affected by depression', Speech Communication, 75, pp. 27 - 49,

Journal articles | 2015

Thiruvaran T; Sethu V; Ambikairajah E; Li H, 2015, 'Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition', Electronics Letters,

Journal articles | 2013

Sethu V; Ambikairajah E; Epps J, 2013, 'On the use of speech parameter contours for emotion recognition', Eurasip Journal on Audio, Speech, and Music Processing, 2013,

Journal articles | 2011

Ambikairajah E; Li H; Wang L; Yin B; Sethu V, 2011, 'Language Identification: A Tutorial', Circuits and Systems Magazine, IEEE, 11, pp. 82 - 108,

Journal articles | 2011

Le NP; Ambikairajah E; Epps JR; Sethu V; Choi E, 2011, 'Investigation of spectral centroid features for cognitive load classification', Speech Communication, 53, pp. 540 - 551,

Journal articles | 2008

Sethu V; Ambikairajah E; Ge L, 2008, 'Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms', Eurasip Journal on Advances In Signal Processing, pp. 78092 - 78099

Journal articles | 2007

Meng D; Sethu V; Ambikairajah E; Ge L, 2007, 'A novel technique for noise reduction in InSAR images', IEEE Geoscience and Remote Sensing Letters, 4, pp. 226 - 230,
Working Papers | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework.,

Working Papers | 2023

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio.,
Preprints | 2025

Zhang Q; Wickramasinghe B; Ambikairajah E; Sethu V; Li H, 2025, Should Audio Front-ends be Adaptive? Comparing Learnable and Adaptive Front-ends,

Conference Papers | 2024

Ambikairajah E; Sirojan T; Sethu V; Mishra D, 2024, 'Aligning Tiered Assessments With Course Learning Outcomes', in 2024 IEEE International Conference on Teaching, Assessment and Learning for Engineering, TALE 2024 - Proceedings,

Conference Papers | 2024

Ambikairajah E; Sirojan T; Thiruvaran T; Sethu V, 2024, 'ChatGPT in the Classroom: A Shift in Engineering Design Education', in IEEE Global Engineering Education Conference, EDUCON,

Conference Papers | 2024

Ambikairajah E; Thiruvaran T; Sethu V; Mishra D; Sirojan T, 2024, 'A Tiered Learning Framework for Self-Guided Engineering Design Education', in IEEE Global Engineering Education Conference, EDUCON,

Conference Papers | 2024

Hong X; Gong Y; Sethu V; Dang T, 2024, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', in CoRR

Preprints | 2024

Hong X; Gong Y; Sethu V; Dang T, 2024, AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models,

Conference Papers | 2024

Jing M; Sethu V; Ahmed B, 2024, 'A PROBABILITY GRADIENT BASED APPROACH FOR SAMPLING BOUNDARIES OF IN-DOMAIN DATA', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5340 - 5344,

Conference Papers | 2024

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2024, 'Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features.', in CoRR

Preprints | 2024

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2024, Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features,

Preprints | 2024

Meng H; Sethu V; Ambikairajah E, 2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions,

Conference Papers | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4323 - 4327,

Preprints | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, Binaural Selective Attention Model for Target Speaker Extraction,

Conference Papers | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, pp. 6495 - 6499, presented at ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 14 April 2024 - 19 April 2024,

Conference Papers | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling.', in ICASSP, IEEE, pp. 6495 - 6499,

Preprints | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework,

Preprints | 2024

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction,

Conference Papers | 2024

Wu YT; Wu J; Sethu V; Lee CC, 2024, 'Can Modelling Inter-Rater Ambiguity Lead To Noise-Robust Continuous Emotion Predictions?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3714 - 3718,

Conference Papers | 2023

Dang T; Dimitriadis A; Wu J; Sethu V; Ambikairajah E, 2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,

Preprints | 2023

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio,

Conference Papers | 2023

Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 2898 - 2902,

Conference Papers | 2023

Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions.', in Harte N; Carson-Berndsen J; Jones G (eds.), INTERSPEECH, ISCA, pp. 2898 - 2902,

Preprints | 2023

Nan Z; Dang T; Sethu V; Ahmed B, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling,

Conference Papers | 2023

Shahin M; Nan Z; Sethu V; Ahmed B, 2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4119 - 4123,

Conference Papers | 2023

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023,

Conference Papers | 2023

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1843 - 1847,

Conference Papers | 2022

Wu J; Dang T; Sethu V; Ambikairajah E, 2022, 'A NOVEL SEQUENTIAL MONTE CARLO FRAMEWORK FOR PREDICTING AMBIGUOUS EMOTION STATES', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 8567 - 8571,

Conference Papers | 2021

Ahmed B; Ballard K; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin M; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An auditory-visual corpus of 3-to 12-year-old Australian children's speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4351 - 4355,

Conference Papers | 2021

Ahmed B; Ballard KJ; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin MA; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children's Speech.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 3680 - 3684,

Conference Papers | 2021

Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 576 - 580,

Conference Papers | 2021

Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 4498 - 4502,

Preprints | 2021

Dang T; Sethu V; Ambikairajah E; Epps J; Li H, 2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models,

Preprints | 2021

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information,

Conference Papers | 2020

Ambikairajah E; Sethu V, 2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 9229 - 9233,

Conference Papers | 2020

Suthokumar G; Sethu V; Sriskandaraja K; Ambikairajah E, 2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6609 - 6613,

Conference Papers | 2019

Atcheson M; Sethu V; Epps J, 2019, 'Using Gaussian Processes with LSTM Neural Networks to Predict Continuous-Time, Dimensional Emotion in Ambiguous Speech', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019,

Conference Papers | 2019

Bose D; Dang T; Sethu V; Ambikairajah E; Fernando S, 2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019,

Conference Papers | 2019

Ouyang A; Dang T; Sethu V; Ambikairajah E, 2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019,

Preprints | 2019

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan S, 2019, The Ambiguous World of Emotion Representation,

Conference Papers | 2019

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6106 - 6110,

Conference Papers | 2019

Wickramasinghe B; Ambikairajah E; Epps J; Sethu V; Li H, 2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6011 - 6015,

Conference Papers | 2018

Atcheson M; Sethu V; Epps J, 2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3668 - 3672,

Conference Papers | 2018

Dang T; Sethu V; Ambikairajah E, 2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4929 - 4933,

Conference Papers | 2018

Fernando S; Irtza S; Sethu V; Ambikairajah E, 2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018,

Conference Papers | 2018

Fernando S; Sethu V; Ambikairajah E; Li H, 2018, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1440 - 1447,

Conference Papers | 2018

Fernando S; Sethu V; Ambikairajah E, 2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5204 - 5208,

Conference Papers | 2018

Fernando S; Sethu V; Ambikairajah E, 2018, 'Sub-band envelope features using frequency domain linear prediction for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1818 - 1822,

Conference Papers | 2018

Gamage KW; Dang T; Sethu V; Epps J; Ambikairajah E, 2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018, pp. 47 - 55,

Conference Papers | 2018

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'End-to-End Hierarchical Language Identification System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5199 - 5203,

Conference Papers | 2018

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5264 - 5268,

Conference Papers | 2018

Sriskandaraja K; Sethu V; Ambikairajah E, 2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 671 - 675,

Conference Papers | 2018

Suthokumar G; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 691 - 695,

Conference Papers | 2018

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E; Li H, 2018, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1038 - 1046,

Conference Papers | 2018

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018,

Conference Papers | 2017

Atcheson M; Sethu V; Epps J, 2017, 'Gaussian Process Regression for Continuous Emotion Recognition with Global Temporal Invariance.', in Lawrence N; Reid M (ed.), AffComp@IJCAI, PMLR, pp. 34 - 44,

Conference Papers | 2017

Cetin E; Abewardana Wijenayake C; Sethu V; Ambikairajah E, 2017, 'A Flipped Mode Approach to Teaching an Electronic System Design Course', in PROCEEDINGS OF 2017 IEEE 6TH INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), IEEE, Hong Kong, pp. 223 - 228, presented at IEEE International Conference on Teaching, Assessment, and Learning for Engineering, Hong Kong, 12 December 2017 - 14 December 2017,

Conference Papers | 2017

Dang T; Atcheson M; Stasak B; Hayat M; Goecke R; Huang Z; Le P; Epps J; Jayawardena S; Sethu V, 2017, 'Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017', in Ringeval F; Schuller BW; Valstar MF; Gratch J; Cowie R; Pantic M (eds.), AVEC 2017 - Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, co-located with MM 2017, Association for Computing Machinery (ACM), Mountain View, California, USA, pp. 27 - 35, presented at 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, California, USA, 23 October 2017 - 23 October 2017,

Conference Papers | 2017

Dang T; Sethu V; Epps J; Ambikairajah E, 2017, 'An investigation of emotion prediction uncertainty using Gaussian Mixture Regression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1248 - 1252, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017,

Conference Papers | 2017

Fernando S; Sethu V; Ambikairajah E; Epps J, 2017, 'Bidirectional modelling for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2809 - 2813, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017,

Conference Papers | 2017

Gamage KW; Sethu V; Ambikairajah E, 2017, 'Modeling variable length phoneme sequences - A step towards linguistic information for speech emotion recognition in wider world', in 2017 7th International Conference on Affective Computing and Intelligent Interaction, ACII 2017, pp. 518 - 523,

Conference Papers | 2017

Gamage KW; Sethu V; Ambikairajah E, 2017, 'Salience based lexical features for emotion recognition', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5830 - 5834,

Conference Papers | 2017

Irtza S; Sethu V; Ambikairajah E; Li H, 2017, 'Investigating scalability in hierarchical language identification system', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2581 - 2585, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017,

Conference Papers | 2017

Lee KA; Hautamäki V; Kinnunen T; Larcher A; Zhang C; Nautsch A; Stafylakis T; Liu G; Rouvier M; Rao W; Alegre F; Ma J; Mak MW; Sarkar AK; Delgado H; Saeidi R; Aronowitz H; Sizov A; Sun H; Nguyen TH; Wang G; Ma B; Vestman V; Sahidullah M; Halonen M; Kanervisto A; Le Lan G; Bahmaninezhad F; Isadskiy S; Rathgeb C; Busch C; Tzimiropoulos G; Qian Q; Wang Z; Zhao Q; Wang T; Li H; Xue J; Zhu S; Jin R; Zhao T; Bousquet PM; Ajili M; Kheder WB; Matrouf D; Lim ZH; Xu C; Xu H; Xiao X; Chng ES; Fauve B; Sriskandaraja K; Sethu V; Lin WW; Thomsen DAL; Tan ZH; Todisco M; Evans N; Li H; Hansen JHL; Bonastre JF; Ambikairajah E, 2017, 'The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1328 - 1332, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017,

Conference Papers | 2017

Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Incorporating local acoustic variability information into short duration speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1502 - 1506, presented at Interspeech 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017,

Conference Papers | 2017

Sriskandaraja K; Suthokumar G; Sethu V; Ambikairajah E, 2017, 'Investigating the use of scattering coefficients for replay attack detection', in Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, pp. 1195 - 1198,

Conference Papers | 2017

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2017, 'Independent modelling of high and low energy speech frames for spoofing detection', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2606 - 2610, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017,

Conference Papers | 2016

Dang T; Sethu V; Ambikairajah E, 2016, 'Factor analysis based speaker normalisation for continuous emotion prediction', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 913 - 917,

Conference Papers | 2016

Fernando S; Sethu V; Ambikairajah E, 2016, 'A feature normalisation technique for PLLR based language identification systems', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, CA, USA, pp. 2925 - 2929, presented at Interspeech 2016, San Francisco, CA, USA, 08 September 2016 - 12 September 2016,

Conference Papers | 2016

Huang Z; Stasak B; Dang T; Gamage KW; Le P; Sethu V; Epps J, 2016, 'Staircase regression in OA RVM, data selection and gender dependency in AVEC 2016', in AVEC 2016 - Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, co-located with ACM Multimedia 2016, ASSOC COMPUTING MACHINERY, Amsterdam, NETHERLANDS, pp. 19 - 26, presented at 6th International Workshop on Audio-Visual Emotion Recognition Challenge - Depression, Mood, and Emotion (AVEC), Amsterdam, NETHERLANDS, 16 October 2016 - 16 October 2016,

Conference Papers | 2016

Irtza S; Sethu V; Bavattichalil H; Ambikairajah E; Li H, 2016, 'A hierarchical framework for language identification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Shanghai, China, pp. 5820 - 5824, presented at 2016 IEEE International Conference on, Shanghai, China, 20 March 2016 - 25 March 2016,

Conference Papers | 2016

Irtza S; Sethu V; Fernando S; Ambikairajah E; Li H, 2016, 'Out of set language modelling in Hierarchical language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3270 - 3274,

Conference Papers | 2016

Ma J; Irtza S; Sriskandaraja K; Sethu V; Ambikairajah E, 2016, 'Parallel speaker and content modelling for text-dependent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 435 - 439, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016,

Conference Papers | 2016

Ma J; Sethu V; Ambikairajah E; Lee KA, 2016, 'Twin model G-PLDA for duration mismatch compensation in text-independent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1853 - 1857, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016,

Conference Papers | 2016

Sethu V; Fernando S; Ambikairajah E, 2016, 'Eigenfeatures: An alternative to Shifted Delta Coefficients for Language Identification', in SST2016, ASSTA, Parramatta, Australia, pp. 253 - 256, presented at 16th Speech Science and Technology Conference (SST2016), Parramatta, Australia, 06 December 2016 - 09 December 2017,

Conference Papers | 2016

Sriskandaraja K; Sethu V; Le PN; Ambikairajah E, 2016, 'Investigation of sub-band discriminative information between spoofed and genuine speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1710 - 1714, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016,

Conference Papers | 2015

Cummins N; Epps J; Sethu V; Krajewski J, 2015, 'Weighted pairwise Gaussian likelihood regression for depression score prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4779 - 4783,

Conference Papers | 2015

Cummins N; Sethu V; Epps J; Krajewski J, 2015, 'Relevance Vector Machine for Depression Prediction', in Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, presented at Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015,

Conference Papers | 2015

Epps J; Sethu V; Eaton R; Ambikairajah E, 2015, 'High Definition Multi-View Video Guidance for Self-Directed Learning and More Effective Engineering Laboratories', Geelong,Australia, presented at Australasian Association for Engineering Education, Geelong,Australia, 06 December 2015 - 09 December 2015,

Conference Papers | 2015

Gamage KW; Sethu V; Le P; Ambikairajah E, 2015, 'An i-vector GPLDA System for Speech based Emotion Recognition', in 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015,

Conference Papers | 2015

Hines C; Sethu V; Epps J, 2015, 'Twitter: A new online source of automatically tagged data for conversational speech emotion recognition', in ASM 2015 - Proceedings of the 1st International Workshop on Affect and Sentiment in Multimedia, co-located with ACM MM 2015, pp. 9 - 14,

Conference Papers | 2015

Huang Z; Dang T; Cummins N; Stasak B; Le P; Sethu V; Epps J, 2015, 'An investigation of annotation delay compensation and output-associative fusion for multimodal continuous emotion prediction', in AVEC 2015 - Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, co-Located with MM 2015, pp. 41 - 48,

Conference Papers | 2015

Irtza S; Bavattichalil H; Sethu V; Ambikairajah E, 2015, 'Scalable I-vector Concatenation for PLDA based Language Identification System', in The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015,

Conference Papers | 2015

Irtza S; Sethu V; Le P; Ambikairajah E; Li H, 2015, 'Phonemes Frequency Based PLLR Dimensionality Reduction for Language Recognition', Dresden, Germany, presented at In Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015

Conference Papers | 2015

Khlif A; Sethu V, 2015, 'An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription', in Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015, pp. 330 - 335

Conference Papers | 2015

Sriskandaraja K; Sethu V; Le P; Ambikairajah E, 2015, 'A Model Based Voice Activity Detector for Noisy Environments', Dresden, Germany, presented at Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015,

Conference Papers | 2014

Cummins N; Epps J; Sethu V; Krajewski J, 2014, 'Variability compensation in small data: Oversampled extraction of i-vectors for the classification of depressed speech', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 970 - 974,

Conference Papers | 2014

Cummins N; Sethu V; Epps J; Krajewski J, 2014, 'Probabilistic acoustic volume analysis for speech affected by depression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1238 - 1242

Conference Papers | 2014

Kua JMK; Sethu V; Le P; Ambikairajah E, 2014, 'The UNSW submission to INTERSPEECH 2014 ComParE cognitive load challenge', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 746 - 750

Conference Papers | 2013

Cummins N; Epps J; Sethu V; Breakspear M; Goecke R, 2013, 'Modeling Spectral Variability for the Classification of Depressed Speech', in INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at 14th Annual Conference of the International Speech Communication Association Interspeech2013, Lyon, France, 25 August 2013 - 29 August 2013

Conference Papers | 2013

Cummins N; Joshi J; Dhall A; Sethu V; Goecke R; Epps J, 2013, 'Diagnosis of depression by behavioural signals: A multimodal approach', in AVEC 2013 - Proceedings of the 3rd ACM International Workshop on Audio/Visual Emotion Challenge, pp. 11 - 20,

Conference Papers | 2013

Sethu V; Epps J; Ambikairajah E, 2013, 'GMM Based Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge', in CERISARA C (ed.), INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at INTERSPEECH 2013 14thAnnual Conference of the International Speech Communication Association, Lyon, France, 25 August 2013 - 29 August 2013

Conference Papers | 2013

Sethu V; Epps J; Ambikairajah E, 2013, 'Speaker variability in speech based emotion models - Analysis and normalisation', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 7522 - 7525,

Conference Papers | 2012

Ambikairajah E; Kua JM; Sethu V; Li H, 2012, 'PNCC-ivector-SRC based Speaker Verification', in 2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012, APSIPA, Hollywood, California, USA, presented at Asia Pacific Signal and Information Processing Association, Hollywood, California, USA, 03 December 2012 - 06 December 2012

Conference Papers | 2012

Ding N; Sethu V; Epps JR; Ambikairajah E, 2012, 'Speaker variability in emotion recognition - An adaptation based approach', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., Piscataway, NJ, pp. 5101 - 5104, presented at 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, Kyoto, Japan, 25 March 2012 - 30 March 2012,

Conference Papers | 2011

Le PN; Sethu V; Ambikairajah E; Kua JMK, 2011, 'Investigation of the robustness of a non-uniform filterbank for cognitive load classification', in ICICS 2011 - 8th International Conference on Information, Communications and Signal Processing,

Conference Papers | 2010

Ambikairajah E; Ibrahim RK; Sethu V, 2010, 'Novel delta zero crossing regression features for gait pattern classification', IEEE, Beunos Aires, presented at Proceedings of the 32nd Annual International Conference of the IEEE EMBS, Beunos Aires, 31 August 2010 - 04 September 2010

Conference Papers | 2010

Le NP; Epps JR; Ambikairajah E; Sethu V, 2010, 'Robust Speech-Based Cognitive Load Classification Using a Multi-band Approach', in The Proceedings of APSIPA ASC 2010, Asia-Pacific Signal Processing Association, Hong Kong, presented at Asia-Pacific Signal Processing Association Conf., Singapore, 14 December 2010 - 17 December 2010

Conference Papers | 2009

Sethu V; Ambikairajah E; Epps JR, 2009, 'Pitch Contour Prameterisation based on Linear Stylisation for Emotion Recognition', in Interspeech 2012, Curran Associates, Inc, Brighton, UK, presented at Interspeech 2009 Speech and Intelligence, Brighton, UK, 06 September 2009 - 10 September 2009

Conference Papers | 2009

Sethu V; Ambikairajah E; Epps JR, 2009, 'SPEAKER DEPENDENCY OF SPECTRAL FEATURES AND SPEECH PRODUCTION CUES FOR AUTOMATIC EMOTION CLASSIFICATION', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19 April 2009 - 24 April 2009

Conference Papers | 2009

Sethu V; Ambikairajah E; Epps J, 2009, 'Pitch contour parameterisation based on linear stylisation for emotion recognition', in Interspeech 2009, ISCA, presented at Interspeech 2009,

Conference Papers | 2008

Le NP; Ambikairajah E; Sethu V, 2008, 'Speech enhancement based on empirical mode decomposition', in Modelling, Identification and Control 2008, Innsbruck, Austria, presented at 5th IASTED International Conference on Signal Processing, Pattern Recognition and Applications 2008, Innsbruck, Austria, 13 February 2008 - 15 February 2008

Conference Papers | 2008

Sethu V; Ambikairajah E; Epps JR, 2008, 'Empirical mode decomposition based weighted frequency feature for speech-based emotion classification', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 31 March 2008 - 04 April 2008

Conference Papers | 2008

Sethu V; Ambikairajah E; Epps JR, 2008, 'Phonetic and speaker variations in automatic emotion classification', in Interspeech 2012, Curran Associates, Inc, Brisbane Australia, presented at Interspeech 2008, Brisbane Australia, 22 September 2008 - 26 September 2008

Conference Papers | 2007

Sethu V; Ambikairajah E; Epps JR, 2007, 'Group Delay Features for Emotion Detection', in INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECHCOMMUNICATION ASSOCIATION, VOLS 1-4, Isca-Inst Speech Communication Assoc, Baixas

Conference Papers | 2007

Sethu V; Ambikairajah E; Epps JR, 2007, 'Speaker normalisation for speech-based emotion detection', in 2007 15th International Conference on Digital Signal Processing, Wales, UK, presented at 15th International Conference on Digital Signal Processing 2007, Wales, UK, 01 July 2007 - 04 July 2007

Conference Papers | 2007

Wang Y; An J; Sethu V; Ambikairajah E, 2007, 'Perceptually motivated pre-filter for speech enhancement using Kalman filtering', in 2007 6th International Conference on Information, Communications and Signal Processing, ICICS,

Conference Papers | 2007

Sethu V; Ambikairajah E; Epps J, 2007, 'Group delay features for emotion detection', in Interspeech 2007, ISCA, presented at Interspeech 2007,

Conference Papers | 2006

Ambikairajah E; Sethu V; Ge L, 2006, 'Noise reduction in SAR interferograms using undecimated wavelet transform', in 2nd international symposium on Geo-information for Disaster Management, Goa, India, presented at 2nd international symposium on Geo-information for Disaster Management, Goa, India, 25 September 2006 - 26 September 2006

ARC Discovery Project (2020)
ARC Discovery Project (2019)
ARC LIEF Grant (2019)
UNSW Research Infrastructure (2019)
UNSW Faculty of Engineering Research Infrastructure (2018)
Huawei Innovation Research Program (2018)
UNSW SEIF Grant (2018)
ARC Linkage (2017)
UNSW Faculty of Engineering Silverstar (2016)
UNSW Strategic Educational Development Grant (2014)
NICTA International Postgraduate Award (2006-2009)

Research Interests include:

Artificial Emotional Intelligence and Speech based Emotion Recognition
Computational models of cochlear signal processing
Speaker recognition/Voice biometrics
Application of machine learning to signal processing tasks

My Teaching

I currently teach or have previously taught the following courses at UNSW:

Data Science for Electrical Engineers (ELEC9741)
Speech Processing (ELEC9723)
Digital Signal Processing (ELEC3104)
Electrical Systems Design (ELEC2117)
Design Proficiency (ELEC/TELE/PHTN4123)

91���˰涶��

Follow

Associate Professor Vidhyasaharan Sethu

91��˰涶��