François Germain

François Germain
  • Biography

    During his graduate studies, François worked on advancing the state of the art in efficient modelling of analog audio systems. Concurrently, he made important contributions to audio signal processing and spatial audio rendering during internships at Adobe Research, Dolby Laboratories and Intel Labs. Before joining MERL, he led research on music source separation and speech enhancement at iZotope. His research interests focus on efficient and robust signal processing and machine learning methods applied to speech, music, and audio content in general.

  • Recent News & Events

    •  NEWS    MERL Researchers to Present 2 Conference and 11 Workshop Papers at NeurIPS 2024
      Date: December 10, 2024 - December 15, 2024
      Where: Advances in Neural Processing Systems (NeurIPS)
      MERL Contacts: Petros T. Boufounos; Matthew Brand; Ankush Chakrabarty; Anoop Cherian; François Germain; Toshiaki Koike-Akino; Christopher R. Laughman; Jonathan Le Roux; Jing Liu; Suhas Lohit; Tim K. Marks; Yoshiki Masuyama; Kieran Parsons; Kuan-Chuan Peng; Diego Romeres; Pu (Perry) Wang; Ye Wang; Gordon Wichern
      Research Areas: Artificial Intelligence, Communications, Computational Sensing, Computer Vision, Control, Data Analytics, Dynamical Systems, Machine Learning, Multi-Physical Modeling, Optimization, Robotics, Signal Processing, Speech & Audio, Human-Computer Interaction, Information Security
      Brief
      • MERL researchers will attend and present the following papers at the 2024 Advances in Neural Processing Systems (NeurIPS) Conference and Workshops.

        1. "RETR: Multi-View Radar Detection Transformer for Indoor Perception" by Ryoma Yataka (Mitsubishi Electric), Adriano Cardace (Bologna University), Perry Wang (Mitsubishi Electric Research Laboratories), Petros Boufounos (Mitsubishi Electric Research Laboratories), Ryuhei Takahashi (Mitsubishi Electric). Main Conference. https://neurips.cc/virtual/2024/poster/95530

        2. "Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads" by Anoop Cherian (Mitsubishi Electric Research Laboratories), Kuan-Chuan Peng (Mitsubishi Electric Research Laboratories), Suhas Lohit (Mitsubishi Electric Research Laboratories), Joanna Matthiesen (Math Kangaroo USA), Kevin Smith (Massachusetts Institute of Technology), Josh Tenenbaum (Massachusetts Institute of Technology). Main Conference, Datasets and Benchmarks track. https://neurips.cc/virtual/2024/poster/97639

        3. "Probabilistic Forecasting for Building Energy Systems: Are Time-Series Foundation Models The Answer?" by Young-Jin Park (Massachusetts Institute of Technology), Jing Liu (Mitsubishi Electric Research Laboratories), François G Germain (Mitsubishi Electric Research Laboratories), Ye Wang (Mitsubishi Electric Research Laboratories), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories), Gordon Wichern (Mitsubishi Electric Research Laboratories), Navid Azizan (Massachusetts Institute of Technology), Christopher R. Laughman (Mitsubishi Electric Research Laboratories), Ankush Chakrabarty (Mitsubishi Electric Research Laboratories). Time Series in the Age of Large Models Workshop.

        4. "Forget to Flourish: Leveraging Model-Unlearning on Pretrained Language Models for Privacy Leakage" by Md Rafi Ur Rashid (Penn State University), Jing Liu (Mitsubishi Electric Research Laboratories), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories), Shagufta Mehnaz (Penn State University), Ye Wang (Mitsubishi Electric Research Laboratories). Workshop on Red Teaming GenAI: What Can We Learn from Adversaries?

        5. "Spatially-Aware Losses for Enhanced Neural Acoustic Fields" by Christopher Ick (New York University), Gordon Wichern (Mitsubishi Electric Research Laboratories), Yoshiki Masuyama (Mitsubishi Electric Research Laboratories), François G Germain (Mitsubishi Electric Research Laboratories), Jonathan Le Roux (Mitsubishi Electric Research Laboratories). Audio Imagination Workshop.

        6. "FV-NeRV: Neural Compression for Free Viewpoint Videos" by Sorachi Kato (Osaka University), Takuya Fujihashi (Osaka University), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories), Takashi Watanabe (Osaka University). Machine Learning and Compression Workshop.

        7. "GPT Sonography: Hand Gesture Decoding from Forearm Ultrasound Images via VLM" by Keshav Bimbraw (Worcester Polytechnic Institute), Ye Wang (Mitsubishi Electric Research Laboratories), Jing Liu (Mitsubishi Electric Research Laboratories), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories). AIM-FM: Advancements In Medical Foundation Models: Explainability, Robustness, Security, and Beyond Workshop.

        8. "Smoothed Embeddings for Robust Language Models" by Hase Ryo (Mitsubishi Electric), Md Rafi Ur Rashid (Penn State University), Ashley Lewis (Ohio State University), Jing Liu (Mitsubishi Electric Research Laboratories), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories), Kieran Parsons (Mitsubishi Electric Research Laboratories), Ye Wang (Mitsubishi Electric Research Laboratories). Safe Generative AI Workshop.

        9. "Slaying the HyDRA: Parameter-Efficient Hyper Networks with Low-Displacement Rank Adaptation" by Xiangyu Chen (University of Kansas), Ye Wang (Mitsubishi Electric Research Laboratories), Matthew Brand (Mitsubishi Electric Research Laboratories), Pu Wang (Mitsubishi Electric Research Laboratories), Jing Liu (Mitsubishi Electric Research Laboratories), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories). Workshop on Adaptive Foundation Models.

        10. "Preference-based Multi-Objective Bayesian Optimization with Gradients" by Joshua Hang Sai Ip (University of California Berkeley), Ankush Chakrabarty (Mitsubishi Electric Research Laboratories), Ali Mesbah (University of California Berkeley), Diego Romeres (Mitsubishi Electric Research Laboratories). Workshop on Bayesian Decision-Making and Uncertainty. Lightning talk spotlight.

        11. "TR-BEACON: Shedding Light on Efficient Behavior Discovery in High-Dimensions with Trust-Region-based Bayesian Novelty Search" by Wei-Ting Tang (Ohio State University), Ankush Chakrabarty (Mitsubishi Electric Research Laboratories), Joel A. Paulson (Ohio State University). Workshop on Bayesian Decision-Making and Uncertainty.

        12. "MEL-PETs Joint-Context Attack for the NeurIPS 2024 LLM Privacy Challenge Red Team Track" by Ye Wang (Mitsubishi Electric Research Laboratories), Tsunato Nakai (Mitsubishi Electric), Jing Liu (Mitsubishi Electric Research Laboratories), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories), Kento Oonishi (Mitsubishi Electric), Takuya Higashi (Mitsubishi Electric). LLM Privacy Challenge. Special Award for Practical Attack.

        13. "MEL-PETs Defense for the NeurIPS 2024 LLM Privacy Challenge Blue Team Track" by Jing Liu (Mitsubishi Electric Research Laboratories), Ye Wang (Mitsubishi Electric Research Laboratories), Toshiaki Koike-Akino (Mitsubishi Electric Research Laboratories), Tsunato Nakai (Mitsubishi Electric), Kento Oonishi (Mitsubishi Electric), Takuya Higashi (Mitsubishi Electric). LLM Privacy Challenge. Won 3rd Place Award.

        MERL members also contributed to the organization of the Multimodal Algorithmic Reasoning (MAR) Workshop (https://marworkshop.github.io/neurips24/). Organizers: Anoop Cherian (Mitsubishi Electric Research Laboratories), Kuan-Chuan Peng (Mitsubishi Electric Research Laboratories), Suhas Lohit (Mitsubishi Electric Research Laboratories), Honglu Zhou (Salesforce Research), Kevin Smith (Massachusetts Institute of Technology), Tim K. Marks (Mitsubishi Electric Research Laboratories), Juan Carlos Niebles (Salesforce AI Research), Petar Veličković (Google DeepMind).
    •  
    •  EVENT    MERL Contributes to ICASSP 2024
      Date: Sunday, April 14, 2024 - Friday, April 19, 2024
      Location: Seoul, South Korea
      MERL Contacts: Petros T. Boufounos; François Germain; Chiori Hori; Sameer Khurana; Toshiaki Koike-Akino; Jonathan Le Roux; Hassan Mansour; Kieran Parsons; Joshua Rapp; Anthony Vetro; Pu (Perry) Wang; Gordon Wichern
      Research Areas: Artificial Intelligence, Computational Sensing, Machine Learning, Robotics, Signal Processing, Speech & Audio
      Brief
      • MERL has made numerous contributions to both the organization and technical program of ICASSP 2024, which is being held in Seoul, Korea from April 14-19, 2024.

        Sponsorship and Awards

        MERL is proud to be a Bronze Patron of the conference and will participate in the student job fair on Thursday, April 18. Please join this session to learn more about employment opportunities at MERL, including openings for research scientists, post-docs, and interns.

        MERL is pleased to be the sponsor of two IEEE Awards that will be presented at the conference. We congratulate Prof. Stéphane G. Mallat, the recipient of the 2024 IEEE Fourier Award for Signal Processing, and Prof. Keiichi Tokuda, the recipient of the 2024 IEEE James L. Flanagan Speech and Audio Processing Award.

        Jonathan Le Roux, MERL Speech and Audio Senior Team Leader, will also be recognized during the Awards Ceremony for his recent elevation to IEEE Fellow.

        Technical Program

        MERL will present 13 papers in the main conference on a wide range of topics including automated audio captioning, speech separation, audio generative models, speech and sound synthesis, spatial audio reproduction, multimodal indoor monitoring, radar imaging, depth estimation, physics-informed machine learning, and integrated sensing and communications (ISAC). Three workshop papers have also been accepted for presentation on audio-visual speaker diarization, music source separation, and music generative models.

        Perry Wang is the co-organizer of the Workshop on Signal Processing and Machine Learning Advances in Automotive Radars (SPLAR), held on Sunday, April 14. It features keynote talks from leaders in both academia and industry, peer-reviewed workshop papers, and lightning talks from ICASSP regular tracks on signal processing and machine learning for automotive radar and, more generally, radar perception.

        Gordon Wichern will present an invited keynote talk on analyzing and interpreting audio deep learning models at the Workshop on Explainable Machine Learning for Speech and Audio (XAI-SA), held on Monday, April 15. He will also appear in a panel discussion on interpretable audio AI at the workshop.

        Perry Wang also co-organizes a two-part special session on Next-Generation Wi-Fi Sensing (SS-L9 and SS-L13) which will be held on Thursday afternoon, April 18. The special session includes papers on PHY-layer oriented signal processing and data-driven deep learning advances, and supports upcoming 802.11bf WLAN Sensing Standardization activities.

        Petros Boufounos is participating as a mentor in ICASSP’s Micro-Mentoring Experience Program (MiME).

        About ICASSP

        ICASSP is the flagship conference of the IEEE Signal Processing Society, and the world's largest and most comprehensive technical conference focused on the research advances and latest technological development in signal and information processing. The event attracts more than 3000 participants.
    •  

    See All News & Events for François
  • Awards

    •  AWARD    MERL team wins the Listener Acoustic Personalisation (LAP) 2024 Challenge
      Date: August 29, 2024
      Awarded to: Yoshiki Masuyama, Gordon Wichern, Francois G. Germain, Christopher Ick, and Jonathan Le Roux
      MERL Contacts: François Germain; Jonathan Le Roux; Gordon Wichern; Yoshiki Masuyama
      Research Areas: Artificial Intelligence, Machine Learning, Speech & Audio
      Brief
      • MERL's Speech & Audio team ranked 1st out of 7 teams in Task 2 of the 1st SONICOM Listener Acoustic Personalisation (LAP) Challenge, which focused on "Spatial upsampling for obtaining a high-spatial-resolution HRTF from a very low number of directions". The team was led by Yoshiki Masuyama, and also included Gordon Wichern, Francois Germain, MERL intern Christopher Ick, and Jonathan Le Roux.

        The LAP Challenge workshop and award ceremony was hosted by the 32nd European Signal Processing Conference (EUSIPCO 24) on August 29, 2024 in Lyon, France. Yoshiki Masuyama presented the team's method, "Retrieval-Augmented Neural Field for HRTF Upsampling and Personalization", and received the award from Prof. Michele Geronazzo (University of Padova, IT, and Imperial College London, UK), Chair of the Challenge's Organizing Committee.

        The LAP challenge aims to explore challenges in the field of personalized spatial audio, with the first edition focusing on the spatial upsampling and interpolation of head-related transfer functions (HRTFs). HRTFs with dense spatial grids are required for immersive audio experiences, but their recording is time-consuming. Although HRTF spatial upsampling has recently shown remarkable progress with approaches involving neural fields, HRTF estimation accuracy remains limited when upsampling from only a few measured directions, e.g., 3 or 5 measurements. The MERL team tackled this problem by proposing a retrieval-augmented neural field (RANF). RANF retrieves a subject whose HRTFs are close to those of the target subject at the measured directions from a library of subjects. The HRTF of the retrieved subject at the target direction is fed into the neural field in addition to the desired sound source direction. The team also developed a neural network architecture that can handle an arbitrary number of retrieved subjects, inspired by a multi-channel processing technique called transform-average-concatenate.
    •  
    •  AWARD    MERL team wins the Audio-Visual Speech Enhancement (AVSE) 2023 Challenge
      Date: December 16, 2023
      Awarded to: Zexu Pan, Gordon Wichern, Yoshiki Masuyama, Francois Germain, Sameer Khurana, Chiori Hori, and Jonathan Le Roux
      MERL Contacts: François Germain; Chiori Hori; Sameer Khurana; Jonathan Le Roux; Gordon Wichern; Yoshiki Masuyama
      Research Areas: Artificial Intelligence, Machine Learning, Speech & Audio
      Brief
      • MERL's Speech & Audio team ranked 1st out of 12 teams in the 2nd COG-MHEAR Audio-Visual Speech Enhancement Challenge (AVSE). The team was led by Zexu Pan, and also included Gordon Wichern, Yoshiki Masuyama, Francois Germain, Sameer Khurana, Chiori Hori, and Jonathan Le Roux.

        The AVSE challenge aims to design better speech enhancement systems by harnessing the visual aspects of speech (such as lip movements and gestures) in a manner similar to the brain’s multi-modal integration strategies. MERL’s system was a scenario-aware audio-visual TF-GridNet, that incorporates the face recording of a target speaker as a conditioning factor and also recognizes whether the predominant interference signal is speech or background noise. In addition to outperforming all competing systems in terms of objective metrics by a wide margin, in a listening test, MERL’s model achieved the best overall word intelligibility score of 84.54%, compared to 57.56% for the baseline and 80.41% for the next best team. The Fisher’s least significant difference (LSD) was 2.14%, indicating that our model offered statistically significant speech intelligibility improvements compared to all other systems.
    •  
    •  AWARD    Joint CMU-MERL team wins DCASE2023 Challenge on Automated Audio Captioning
      Date: June 1, 2023
      Awarded to: Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-weon Jung, Francois Germain, Jonathan Le Roux, Shinji Watanabe
      MERL Contacts: François Germain; Jonathan Le Roux; Gordon Wichern
      Research Areas: Artificial Intelligence, Machine Learning, Speech & Audio
      Brief
      • A joint team consisting of members of CMU Professor and MERL Alumn Shinji Watanabe's WavLab and members of MERL's Speech & Audio team ranked 1st out of 11 teams in the DCASE2023 Challenge's Task 6A "Automated Audio Captioning". The team was led by student Shih-Lun Wu and also featured Ph.D. candidate Xuankai Chang, Postdoctoral research associate Jee-weon Jung, Prof. Shinji Watanabe, and MERL researchers Gordon Wichern, Francois Germain, and Jonathan Le Roux.

        The IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (DCASE Challenge), started in 2013, has been organized yearly since 2016, and gathers challenges on multiple tasks related to the detection, analysis, and generation of sound events. This year, the DCASE2023 Challenge received over 428 submissions from 123 teams across seven tasks.

        The CMU-MERL team competed in the Task 6A track, Automated Audio Captioning, which aims at generating informative descriptions for various sounds from nature and/or human activities. The team's system made strong use of large pretrained models, namely a BEATs transformer as part of the audio encoder stack, an Instructor Transformer encoding ground-truth captions to derive an audio-text contrastive loss on the audio encoder, and ChatGPT to produce caption mix-ups (i.e., grammatical and compact combinations of two captions) which, together with the corresponding audio mixtures, increase not only the amount but also the complexity and diversity of the training data. The team's best submission obtained a SPIDEr-FL score of 0.327 on the hidden test set, largely outperforming the 2nd best team's 0.315.
    •  
    See All Awards for MERL
  • Research Highlights

  • MERL Publications

    •  Ick, Christopher, Wichern, Gordon, Masuyama, Yoshiki, Germain, François G, Le Roux, Jonathan, "Spatially-Aware Losses for Enhanced Neural Acoustic Fields", Tech. Rep. TR2024-169, Mitsubishi Electric Research Laboratories, Cambridge, MA, December 2024.
      BibTeX TR2024-169 PDF
      • @techreport{MERL_TR2024-169,
      • author = {Ick, Christopher; Wichern, Gordon; Masuyama, Yoshiki; Germain, François G; Le Roux, Jonathan},
      • title = {Spatially-Aware Losses for Enhanced Neural Acoustic Fields},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR2024-169},
      • month = dec,
      • year = 2024,
      • url = {https://www.merl.com/publications/TR2024-169/}
      • }
    •  Ick, C., Wichern, G., Masuyama, Y., Germain, F.G., Le Roux, J., "Spatially-Aware Losses for Enhanced Neural Acoustic Fields", NeurIPS 2024 Audio Imagination Workshop, December 2024.
      BibTeX
      • @inproceedings{Ick2024dec,
      • author = {{Ick, Christopher and Wichern, Gordon and Masuyama, Yoshiki and Germain, François G and Le Roux, Jonathan}},
      • title = {Spatially-Aware Losses for Enhanced Neural Acoustic Fields},
      • booktitle = {NeurIPS 2024 Audio Imagination Workshop},
      • year = 2024,
      • month = dec
      • }
    •  Saijo, K., Ebbers, J., Germain, F.G., Wichern, G., Le Roux, J., "Task-Aware Unified Source Separation", arXiv, October 2024.
      BibTeX arXiv
      • @article{Saijo2024oct,
      • author = {Saijo, Kohei and Ebbers, Janek and Germain, François G and Wichern, Gordon and Le Roux, Jonathan}},
      • title = {Task-Aware Unified Source Separation},
      • journal = {arXiv},
      • year = 2024,
      • month = oct,
      • url = {https://arxiv.org/abs/2410.23987v1}
      • }
    •  Saijo, K., Ebbers, J., Germain, F.G., Khurana, S., Wichern, G., Le Roux, J., "Leveraging Audio-Only Data for Text-Queried Target Sound Extraction", arXiv, September 2024.
      BibTeX arXiv
      • @article{Saijo2024sep3,
      • author = {{Saijo, Kohei and Ebbers, Janek and Germain, François G and Khurana, Sameer and Wichern, Gordon and Le Roux, Jonathan}},
      • title = {Leveraging Audio-Only Data for Text-Queried Target Sound Extraction},
      • journal = {arXiv},
      • year = 2024,
      • month = sep,
      • url = {https://arxiv.org/abs/2409.13152v1}
      • }
    •  Saijo, K., Wichern, G., Germain, F.G., Pan, Z., Le Roux, J., "TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement", International Workshop on Acoustic Signal Enhancement (IWAENC), DOI: 10.1109/​IWAENC61483.2024.10694313, September 2024, pp. 205-209.
      BibTeX TR2024-126 PDF Software
      • @inproceedings{Saijo2024sep2,
      • author = {Saijo, Kohei and Wichern, Gordon and Germain, François G and Pan, Zexu and Le Roux, Jonathan}},
      • title = {TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement},
      • booktitle = {International Workshop on Acoustic Signal Enhancement (IWAENC)},
      • year = 2024,
      • pages = {205--209},
      • month = sep,
      • doi = {10.1109/IWAENC61483.2024.10694313},
      • issn = {2835-3439},
      • isbn = {979-8-3503-6185-8},
      • url = {https://www.merl.com/publications/TR2024-126}
      • }
    See All MERL Publications for François
  • Other Publications

    •  François G. Germain, "Periodic Analysis of Nonlinear Virtual Analog Models", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), October 2021, pp. 321-325.
      BibTeX
      • @Inproceedings{Germain:PeriodicAnalysisNonlinear:2021,
      • author = {Germain, Fran\c{c}ois G.},
      • title = {Periodic Analysis of Nonlinear Virtual Analog Models},
      • booktitle = {{IEEE} Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
      • year = 2021,
      • pages = {321--325},
      • month = oct
      • }
    •  François G. Germain, "Practical Virtual Analog Modeling Using Möbius Transforms", International Conference on Digital Audio Effects (DAFx), September 2021, pp. 49-56.
      BibTeX
      • @Inproceedings{Germain:PracticalVirtualAnalog:2021,
      • author = {Germain, Fran\c{c}ois G.},
      • title = {Practical Virtual Analog Modeling Using Möbius Transforms},
      • booktitle = {International Conference on Digital Audio Effects (DAFx)},
      • year = 2021,
      • pages = {49--56},
      • month = sep
      • }
    •  Kurt James Werner, Francois G. Germain and Cory S. Goldsmith, "Energy-preserving Time-varying Schroeder Allpass Filters and Multichannel Extensions", Journal of the Audio Engineering Society (AES), Vol. 69, No. 7/8, pp. 465-485, 2021.
      BibTeX
      • @Article{WernerGermainGoldsmith:EnergypreservingTime:2021,
      • author = {Werner, Kurt James and Germain, Francois G. and Goldsmith, Cory S.},
      • title = {Energy-preserving Time-varying Schroeder Allpass Filters and Multichannel Extensions},
      • journal = {Journal of the Audio Engineering Society (AES)},
      • year = 2021,
      • volume = 69,
      • number = {7/8},
      • pages = {465--485}
      • }
    •  François G. Germain, "Non-oversampled Physical Modeling for Virtual Analog Simulations", 2019, Stanford University.
      BibTeX
      • @Phdthesis{Germain:NonoversampledPhysical:2019,
      • author = {Germain, Fran\c{c}ois G.},
      • title = {Non-oversampled Physical Modeling for Virtual Analog Simulations},
      • school = {{S}tanford University},
      • year = 2019
      • }
    •  Francois G. Germain, Qifeng Chen and Vladlen Koltun, "Speech Denoising with Deep Feature Losses", INTERSPEECH Conference, September 2018, pp. 2723-2727.
      BibTeX
      • @Inproceedings{GermainChenKoltun:SpeechDenoisingDeep:2018,
      • author = {Germain, Francois G. and Chen, Qifeng and Koltun, Vladlen},
      • title = {Speech Denoising with Deep Feature Losses},
      • booktitle = {{INTERSPEECH} Conference},
      • year = 2018,
      • pages = {2723--2727},
      • month = sep
      • }
    •  François G. Germain and Kurt James Werner, "Optimizing Differentiated Discretization for Audio Circuits beyond Driving Point Transfer Functions", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), October 2017, pp. 384-388.
      BibTeX
      • @Inproceedings{GermainWerner:OptimizingDifferentiatedDiscretization:2017,
      • author = {Germain, Fran\c{c}ois G. and Werner, Kurt James},
      • title = {Optimizing Differentiated Discretization for Audio Circuits beyond Driving Point Transfer Functions},
      • booktitle = {{IEEE} Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
      • year = 2017,
      • pages = {384--388},
      • month = oct
      • }
    •  François G. Germain, "Fixed-rate Modeling of Audio Lumped Systems: A Comparison between Trapezoidal and Implicit Midpoint Methods", International Conference on Digital Audio Effects (DAFx), September 2017, pp. 168-75.
      BibTeX
      • @Inproceedings{Germain:FixedrateModeling:2017,
      • author = {Germain, Fran\c{c}ois G.},
      • title = {Fixed-rate Modeling of Audio Lumped Systems: A Comparison between Trapezoidal and Implicit Midpoint Methods},
      • booktitle = {International Conference on Digital Audio Effects (DAFx)},
      • year = 2017,
      • pages = {168--75},
      • month = sep
      • }
    •  Michael Jørgen Olsen, Kurt James Werner and François G. Germain, "Network Variable Preserving Step-size Control in Wave Digital Filters", International Conference on Digital Audio Effects (DAFx), September 2017, pp. 200-207.
      BibTeX
      • @Inproceedings{OlsenWernerGermain:NetworkVariablePreserving:2017,
      • author = {Olsen, Michael J{\o}rgen and Werner, Kurt James and Germain, Fran{\c{c}}ois G.},
      • title = {Network Variable Preserving Step-size Control in Wave Digital Filters},
      • booktitle = {International Conference on Digital Audio Effects (DAFx)},
      • year = 2017,
      • pages = {200--207},
      • month = sep
      • }
    •  François G. Germain and Kurt James Werner, "Joint Parameter Optimization of Differentiated Discretization Schemes for Audio Circuits", Audio Engineering Society (AES) Convention, May 2017.
      BibTeX
      • @Inproceedings{GermainWerner:JointParameterOptimization:2017,
      • author = {Germain, Fran\c{c}ois G. and Werner, Kurt James},
      • title = {Joint Parameter Optimization of Differentiated Discretization Schemes for Audio Circuits},
      • booktitle = {Audio Engineering Society (AES) Convention},
      • year = 2017,
      • month = may
      • }
    •  Kurt James Werner, W. Ross Dunkel and François G. Germain, "A Computational Model of the Hammond Organ Vibrato/chorus Using Wave Digital Filters", International Conference on Digital Audio Effects (DAFx), September 2016, pp. 271-278.
      BibTeX
      • @Inproceedings{WernerDunkelGermain:ComputationalModelHammond:2016,
      • author = {Werner, Kurt James and Dunkel, W. Ross and Germain, Fran{\c{c}}ois G.},
      • title = {A Computational Model of the Hammond Organ Vibrato/chorus Using Wave Digital Filters},
      • booktitle = {International Conference on Digital Audio Effects (DAFx)},
      • year = 2016,
      • pages = {271--278},
      • month = sep
      • }
    •  François G. Germain, Gautham J. Mysore and Takako Fujioka, "Equalization Matching of Speech Recordings in Real-world Environments", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), March 2016, pp. 609-613.
      BibTeX
      • @Inproceedings{GermainMysoreFujioka:EqualizationMatchingSpeech:2016,
      • author = {Germain, Fran\c{c}ois G. and Mysore, Gautham J. and Fujioka, Takako},
      • title = {Equalization Matching of Speech Recordings in Real-world Environments},
      • booktitle = {{IEEE} International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
      • year = 2016,
      • pages = {609--613},
      • month = mar
      • }
    •  Kurt James Werner and François Georges Germain, "Sinusoidal Parameter Estimation Using Quadratic Interpolation around Power-scaled Magnitude Spectrum Peaks", Applied Sciences, Vol. 6, No. 10, pp. 306, 2016.
      BibTeX
      • @Article{WernerGermain:SinusoidalParameterEstimation:2016,
      • author = {Werner, Kurt James and Germain, Fran{\c{c}}ois Georges},
      • title = {Sinusoidal Parameter Estimation Using Quadratic Interpolation around Power-scaled Magnitude Spectrum Peaks},
      • journal = {Applied Sciences},
      • year = 2016,
      • volume = 6,
      • number = 10,
      • pages = 306,
      • publisher = {MDPI}
      • }
    •  François G. Germain and Kurt James Werner, "Design Principles for Lumped Model Discretization Using Möbius Transforms", International Conference on Digital Audio Effects (DAFx), December 2015, pp. 371-378.
      BibTeX
      • @Inproceedings{GermainWerner:DesignPrinciplesLumped:2015,
      • author = {Germain, Fran\c{c}ois G. and Werner, Kurt James},
      • title = {Design Principles for Lumped Model Discretization Using Möbius Transforms},
      • booktitle = {International Conference on Digital Audio Effects (DAFx)},
      • year = 2015,
      • pages = {371--378},
      • month = dec
      • }
    •  François G. Germain and Gautham J. Mysore, "Speaker and Noise Independent Online Single-channel Speech Enhancement", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), April 2015, pp. 71-75.
      BibTeX
      • @Inproceedings{GermainMysore:SpeakerNoiseIndependent:2015,
      • author = {Germain, Fran{\c{c}}ois G. and Mysore, Gautham J.},
      • title = {Speaker and Noise Independent Online Single-channel Speech Enhancement},
      • booktitle = {{IEEE} International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
      • year = 2015,
      • pages = {71--75},
      • month = apr
      • }
    •  Francois G. Germain, Iretiayo A. Akinola, Qiyuan Tian, Steven P. Lansel and Brian A. Wandell, "Efficient Illuminant Correction in the Local, Linear, Learned (L3) Method", Digital Photography XI, February 2015, vol. 9404, pp. 24-30.
      BibTeX
      • @Inproceedings{GermainAkinolaTianEtAl:EfficientIlluminantCorrection:2015,
      • author = {Germain, Francois G. and Akinola, Iretiayo A. and Tian, Qiyuan and Lansel, Steven P. and Wandell, Brian A.},
      • title = {Efficient Illuminant Correction in the Local, Linear, Learned (L3) Method},
      • booktitle = {Digital Photography XI},
      • year = 2015,
      • volume = 9404,
      • pages = {24--30},
      • month = feb
      • }
    •  François G. Germain and Gautham J. Mysore, "Stopping Criteria for Non-negative Matrix Factorization Based Supervised and Semi-supervised Source Separation", IEEE Signal Processing Letters, Vol. 21, No. 10, pp. 1284-1288, 2014.
      BibTeX
      • @Article{GermainMysore:StoppingCriteriaNon:2014,
      • author = {Germain, Fran\c{c}ois G. and Mysore, Gautham J.},
      • title = {Stopping Criteria for Non-negative Matrix Factorization Based Supervised and Semi-supervised Source Separation},
      • journal = {{IEEE} Signal Processing Letters},
      • year = 2014,
      • volume = 21,
      • number = 10,
      • pages = {1284--1288},
      • publisher = {IEEE}
      • }
    •  Zafar Rafii, François G. Germain, Dennis L. Sun and Gautham J. Mysore, "Combining Modeling of Singing Voice and Background Music for Automatic Separation of Musical Mixtures", Internation Society for Music Information Retrieval (ISMIR) Conference, November 2013, pp. 41-46.
      BibTeX
      • @Inproceedings{RafiiGermainSunEtAl:CombiningModelingSinging:2013,
      • author = {Rafii, Zafar and Germain, Fran{\c{c}}ois G. and Sun, Dennis L. and Mysore, Gautham J.},
      • title = {Combining Modeling of Singing Voice and Background Music for Automatic Separation of Musical Mixtures},
      • booktitle = {Internation Society for Music Information Retrieval (ISMIR) Conference},
      • year = 2013,
      • pages = {41--46},
      • month = nov
      • }
    •  François G. Germain, Dennis L. Sun and Gautham J. Mysore, "Speaker and Noise Independent Voice Activity Detection", INTERSPEECH Conference, August 2013, pp. 732-736.
      BibTeX
      • @Inproceedings{GermainSunMysore:SpeakerNoiseIndependent:2013,
      • author = {Germain, François G. and Sun, Dennis L. and Mysore, Gautham J.},
      • title = {Speaker and Noise Independent Voice Activity Detection},
      • booktitle = {{INTERSPEECH} Conference},
      • year = 2013,
      • pages = {732--736},
      • month = aug
      • }
    •  François G. Germain, Jonathan S. Abel, Philippe Depalle and Marcelo M. Wanderley, "Uniform Noise Sequencers for Nonlinear System Identification", International Conference on Digital Audio Effects (DAFx), September 2012, pp. 241-244.
      BibTeX
      • @Inproceedings{GermainAbelDepalleEtAl:UniformNoiseSequencers:2012,
      • author = {Germain, Fran\c{c}ois G. and Abel, Jonathan S. and Depalle, Philippe and Wanderley, Marcelo M.},
      • title = {Uniform Noise Sequencers for Nonlinear System Identification},
      • booktitle = {International Conference on Digital Audio Effects (DAFx)},
      • year = 2012,
      • pages = {241--244},
      • address = {York, United Kingdom},
      • month = sep
      • }
    •  François Georges Germain, "A Nonlinear Analysis Framework for Electronic Synthesizer Circuits", October 2011, McGill University.
      BibTeX
      • @Mastersthesis{Germain:NonlinearAnalysisFramework:2011,
      • author = {Germain, Fran\c{c}ois Georges},
      • title = {A Nonlinear Analysis Framework for Electronic Synthesizer Circuits},
      • school = {McGill University},
      • year = 2011,
      • address = {Montr{\'e}al, Canada},
      • month = oct
      • }
    •  Vincent Freour, Gary P. Scavone, Antoine Lefebvre and François Germain, "Acoustical Properties of the Vocal-tract in Trombone Performance", Forum Acusticum, June 2011, pp. 625-630.
      BibTeX
      • @Inproceedings{FreourScavoneLefebvreEtAl:AcousticalPropertiesVocal:2011,
      • author = {Freour, Vincent and Scavone, Gary P. and Lefebvre, Antoine and Germain, Fran{\c{c}}ois},
      • title = {Acoustical Properties of the Vocal-tract in Trombone Performance},
      • booktitle = {Forum Acusticum},
      • year = 2011,
      • pages = {625--630},
      • month = jun
      • }
    •  François Germain and Gianpaolo Evangelista, "Synthesis of Guitar by Digital Waveguides: Modeling the Plectrum in the Physical Interaction of the Player with the Instrument", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), October 2009, pp. 25-28.
      BibTeX
      • @Inproceedings{GermainEvangelista:SynthesisGuitarDigital:2009,
      • author = {Germain, Fran{\c{c}}ois and Evangelista, Gianpaolo},
      • title = {Synthesis of Guitar by Digital Waveguides: Modeling the Plectrum in the Physical Interaction of the Player with the Instrument},
      • booktitle = {{IEEE} Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
      • year = 2009,
      • pages = {25--28},
      • month = oct
      • }
  • Software & Data Downloads