Game Audio Craft & Adaptive-Audio Tech

144 items · default last 14 days

Task-Vector Arithmetic for Emotional Expressivity Control in Language-Model-Based Text-to-Speech arXiv — Sound (cs.SD) 8h
nnAudio 2: Overcoming Dynamic Compilation Barriers and Transform Inconsistencies arXiv — Sound (cs.SD) 8h
Exploring LLMs for South Asian Music Understanding and Generation arXiv — Sound (cs.SD) 8h
Probing Spatial Structure in Pretrained Audio Representations arXiv — Sound (cs.SD) 8h
Sound Effects Dataset Unification With the Universal Category System arXiv — Sound (cs.SD) 8h
SB-RF: Schr\"odinger Bridge Rectified Flow for One-Step Robust Speech Enhancement arXiv — Sound (cs.SD) 8h
Beyond Waveform Robustness: Robust Feature-Vocoder Adversarial Attacks on Automatic Speech Recognition arXiv — Sound (cs.SD) 8h
Do speech foundation models perceive speaker similarity as humans do? arXiv — Sound (cs.SD) 8h
SagnacAssisted Enhanced OTDR for Distributed Acoustic Sensing: A Standardized Benchmark and Engineering Evaluation Framework arXiv — Sound (cs.SD) 8h
UniVoice: A Unified Model for Speech and Singing Voice Generation arXiv — Sound (cs.SD) 8h
GLASS: GRPO-Trained LoRA for Acoustic Style Steering in Zero-Shot Text-to-Speech arXiv — Sound (cs.SD) 8h
Beyond WER: A Paired Acoustic Stress Test for Ambient Clinical Scribes arXiv — Sound (cs.SD) 8h
DBHN-Net: Dual-Branch Hybrid Neural Network For Low-Complexity Monaural Speech Enhancement arXiv — Sound (cs.SD) 8h
SpeechJBB: Probing Safety Alignment and Comprehension in Large Audio Language Models under Code-Switched Speech arXiv — Sound (cs.SD) 8h
Learning Emotion-discriminative Representations for Zero-Shot Cross-lingual Speech Emotion Recognition arXiv — Sound (cs.SD) 8h
Behind the Hand-Made Sound of ‘MOUSE: P.I. For Hire’ – with Damian Czajka, Lukasz Koscielny, and Patryk Scelina A Sound Effect — Blog yest
The Audio Podcast Alliance page has a brand new look! A Sound Effect — Blog yest
Channel-Oriented Design for EEG-to-Music Reconstruction arXiv — Sound (cs.SD) yest
The Differentiable Auditory Loop (DAL): An ML Framework for Hyper-Personalized Hearing Aids arXiv — Sound (cs.SD) yest
Feasibility of Time-Domain DNN-Based Speech Enhancement on Embedded FPGA for Hearing Aid arXiv — Sound (cs.SD) yest
Gauss Circle Lattices with Geometric Convolutions for Synthesizing High Dimensional Image-Source Room Impulse Responses arXiv — Sound (cs.SD) yest
CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding arXiv — Sound (cs.SD) yest
A Second-Order Cepstral Signature of Contact-Vibration Sounds Reproduced by Laptop Loudspeakers: A Synthetic Case Study arXiv — Sound (cs.SD) yest
Flow-HOA: Generative Joint Optimization for Ambisonics Encoding via Flow Matching arXiv — Sound (cs.SD) yest
SHB-AE: Spherical harmonic beamforming based Ambisonics encoding and upscaling method for smartphone microphone array arXiv — Sound (cs.SD) yest
Drift-Augmented Scoring: Text-Derived Noise Robustness for Zero-Shot Audio-Language Classification arXiv — Sound (cs.SD) yest
SURF: Separation via Unsupervised Remixing Flow arXiv — Sound (cs.SD) yest
FoeGlass: Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors arXiv — Sound (cs.SD) yest
Audio Interaction Model arXiv — Sound (cs.SD) yest
Beyond Text Following: Repairable Arbitration Reversals in Audio-Language Models arXiv — Sound (cs.SD) yest
DetectZoo: A Unified Toolkit for AI-Generated Content Detection Across Text, Audio, and Image Modalities arXiv — Sound (cs.SD) yest
Representation Matters in Randomized Smoothing for Audio Classification arXiv — Sound (cs.SD) yest
SegTune: Structured and Fine-Grained Control for Song Generation arXiv — Sound (cs.SD) Jun 3
EntangleCodec: A Unified Discrete Audio Tokenizer via Semantic-Acoustic Entanglement arXiv — Sound (cs.SD) Jun 3
A Training-Efficient Transformer-Based Anti-Spoofing Network for Logical Access in ASVspoof 5 arXiv — Sound (cs.SD) Jun 3
Audio Spotforming via Post-Filtering Using Cross-Array Non-target Estimates arXiv — Sound (cs.SD) Jun 3
SketchSong: Hierarchical Song Generation with Sketch Planning and Fine-Grained Multi-Track Modeling arXiv — Sound (cs.SD) Jun 3
Speech Emotion Recognition using Attention-based LSTM-Network with Residual Connection arXiv — Sound (cs.SD) Jun 3
Tonal parsimony in chord-sequence analysis: combining modulation cost and tonal vocabulary arXiv — Sound (cs.SD) Jun 3
Foley-Omni: A Unified Multimodal Generation Model from Task-Level Audio Synthesis to Complete Video Soundtrack Generation arXiv — Sound (cs.SD) Jun 3
LiveBand: Live Accompaniment Generation in the Audio Domain arXiv — Sound (cs.SD) Jun 3
FSA-GRPO: Teaching Auditory LLMs to Use Few-shot Demonstrations arXiv — Sound (cs.SD) Jun 3
Wavelet as Tokenizer: Preliminary Results on a Shared Wavelet Token Schema for Natural Signals arXiv — Sound (cs.SD) Jun 3
SVHalluc: Benchmarking Speech-Vision Hallucination in Audio-Visual Large Language Models arXiv — Sound (cs.SD) Jun 3
Before Fusion, Ask What to Keep: Contextual Calibration of Multimodal Signals arXiv — Sound (cs.SD) Jun 3
A Comparison of Generative and Discriminative Methods for Speech Enhancement: Robustness, Complexity, and Hallucination arXiv — Sound (cs.SD) Jun 3
AnyAudio-Judge: A Dynamic Rubric-Based Benchmark and Evaluator for Audio Instruction Following arXiv — Sound (cs.SD) Jun 3
DUET: Unified Dual-Space Emotion Control for Diffusion and Flow-Matching Driven Text-to-Speech arXiv — Sound (cs.SD) Jun 2
Quality Audio Prototyping: a prototype system for unified sound retrieval and procedural generation arXiv — Sound (cs.SD) Jun 2
Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty arXiv — Sound (cs.SD) Jun 2
Sympatheia: Emotionally Adaptive Voice Assistant with Continuous Affect Conditioning arXiv — Sound (cs.SD) Jun 2
MelT: GEMM-Native NDFT for Efficient Single-Stage Audio Frontends on Modern Accelerators arXiv — Sound (cs.SD) Jun 2
A Lightweight Slot-Attention Framework for Multi-Instrument Multi-Pitch Estimation arXiv — Sound (cs.SD) Jun 2
UniVocal: Unified Speech-Singing Code-Switching Synthesis arXiv — Sound (cs.SD) Jun 2
HAIM: Human-AI Music Datasets for AI Music Production Tracking Benchmark arXiv — Sound (cs.SD) Jun 2
JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions arXiv — Sound (cs.SD) Jun 2
MOSS-Audio Technical Report arXiv — Sound (cs.SD) Jun 2
Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space arXiv — Sound (cs.SD) Jun 2
C2GA: A Class-Controllable Generative Augmentation Framework for Respiratory Sound Classification arXiv — Sound (cs.SD) Jun 2
Parameter-efficient Dual-encoder Architecture with Differentiable Choquet Integral Fusion for Underwater Acoustic Classification arXiv — Sound (cs.SD) Jun 2
DAStatFormer: A Hybrid Multibranch Transformer with Statistical Feature Integration for DAS-Based Pattern Recognitions arXiv — Sound (cs.SD) Jun 2
Local Diagnostics of Continuous Normalizing Flow for Out-of-Distribution Detection arXiv — Sound (cs.SD) Jun 2
28 great new sound effects libraries: Mechs & robots, howler monkeys, vintage sirens, 192 kHz gore & more A Sound Effect — Blog Jun 1
Mental Damage: Caption Poisoning Attacks on Retrieval-Augmented Text-to-Music Generation arXiv — Sound (cs.SD) Jun 1
3DAE: Binaural Quality Assessment for Audio Novel View Synthesis with Spatial Maps and Benchmark arXiv — Sound (cs.SD) Jun 1
Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS arXiv — Sound (cs.SD) Jun 1
AnchorSteer: Self-Discovered Concept Injection for Structure-Preserving Music Editing arXiv — Sound (cs.SD) Jun 1
Sound effects in media:A comparative analysis of recorded and synthetic samples in live-action and animation arXiv — Sound (cs.SD) Jun 1
MindVoice: Reconstructing Intelligible Speech from Non-invasive Neural Signals with Pretrained Priors arXiv — Sound (cs.SD) Jun 1
Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation arXiv — Sound (cs.SD) Jun 1
Escaping the Linearity Trap: Manifold Detours for Black-Box Adversarial Attacks on Singing Audio Deepfake Detection arXiv — Sound (cs.SD) Jun 1
Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors arXiv — Sound (cs.SD) Jun 1
GaMi: Geometry-Agnostic Material Identification via Cross-Modal Subtractive Disentanglement arXiv — Sound (cs.SD) Jun 1
A Unified and Reproducible Experimentation Framework for Speech Understanding arXiv — Sound (cs.SD) Jun 1
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer arXiv — Sound (cs.SD) Jun 1
DOA: Training-Free Decoder-Only Attention Policy for Long-Form Simultaneous Translation with SpeechLLMs arXiv — Sound (cs.SD) Jun 1
Scaling Conversational Hungarian ASR: The BEA-Dialogue+ Corpus arXiv — Sound (cs.SD) Jun 1
UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception arXiv — Sound (cs.SD) Jun 1
AirCon26 is live! Attend here (and get a free sound effects library!) A Sound Effect — Blog May 21
GameSoundCon Game Audio Industry Survey 2025: Game Music and Sound Design Jobs Data GameSoundCon Blog & Industry Survey May 31
Personal Finance Essentials for Composers GameSoundCon Blog & Industry Survey May 31
Game Audio Job Skills - How to Get Hired as a Game Sound Designer GameSoundCon Blog & Industry Survey May 31
Hexany Audio's Hiring Pipeline: One Company's Approach to Hiring a Game Sound Designer GameSoundCon Blog & Industry Survey May 31
Game Audio Industry Survey 2019 GameSoundCon Blog & Industry Survey May 31
Can Video Game Composers Get Royalties? GameSoundCon Blog & Industry Survey May 31
18 great new sound effects libraries: 1955 Citroën DS, jungle ambisonics, retro sci-fi, stadium chants & more A Sound Effect — Blog May 19
The insane amount of work behind Airwiggles Audio Conference A Sound Effect — Blog May 18
Crafting Spell-binding Sound for ‘Lee Cronin’s The Mummy’ – with Peter Albrechtsen, Gabriel Gutiérrez and Garret Farrell A Sound Effect — Blog May 13
How Game Music Is Made A Sound Effect — Blog May 13
The AirCon26 Raffle – $15,000 Of Prizes To Be Won! A Sound Effect — Blog May 13
28 great new sound effects libraries: Snow footsteps, fantasy UI, subsurface vibrations, a vintage fire truck & tons more A Sound Effect — Blog May 11
Microsoft and Valve sued, PlayStation's first party downturn, and could a UK social media ban impact games? - Patch Notes #55 Game Developer — Audio section 2h
Nintendo to debut a Switch 2 model with replaceable batteries in the EU Game Developer — Audio section 18h
How the Replaced devs fixed display bloat with a stylish gadget Game Developer — Audio section 22h
Amazon suggests IO Interactive might not return for 007 First Light sequel Game Developer — Audio section yest
PlayerUnknown Productions is laying off staff and halting development on Go Wayback Game Developer — Audio section yest
Team17 has laid off members of its marketing and communications department Game Developer — Audio section yest
Total consumer spending on games topped $60B in the US in 2025 Game Developer — Audio section yest
Mina the Hollower sells 300,000 copies in three days Game Developer — Audio section Jun 3
Game Audio Explained – your guide to making sound for games A Sound Effect — Blog May 7
Tekken 8 game director Kohei Ikeda has left Bandai Namco Game Developer — Audio section Jun 2
PlayStation first-party game sales show a decline since 2020 Game Developer — Audio section Jun 2
The Symbolism of Recurrence: The Music of Assassin’s Creed Liberation Designing Music NOW (Winifred Phillips) Jun 2
How to direct unconventional games like Control Resonant Game Developer — Audio section Jun 2
Bungie experiments by adding casual PvP-lite mode to Marathon Game Developer — Audio section Jun 2
UK social media ban could impact video game platforms Game Developer — Audio section Jun 1
Atari to acquire Crossy Road developer Hipster Whale Game Developer — Audio section Jun 1
Fable reboot delayed until February 2027 Game Developer — Audio section Jun 1
The /r/GameAudio Community Career Corner June, 2026 - for job seekers, help wanted, new career queries, and evaluation requests r/GameAudio Jun 1
The GameAudio Share Mine June, 2026 - Use this post to link to / discuss your site, works, product, business or anything you created or are affiliated with r/GameAudio Jun 1
Crafting a Complex Reality for ‘The Night Manager’ – with Oriol Tarragó A Sound Effect — Blog May 6
handling runtime asset layout layout or timeline sync based on audio transients? r/GameAudio May 31
Has anyone successfully left the audio industry after building a long career in it? r/GameAudio May 31
Middleware r/GameAudio May 31
16 great new sound effects libraries: Robotics, dogs, detective voices, Greek loops & more A Sound Effect — Blog May 5
Need help recreating that PPC / heavy energy weapon sound r/GameAudio May 31
QA in games looking to into Cinematic & Game Sound Design: how should I build a portfolio? r/GameAudio May 30
Wwise custom plugins, what language? r/GameAudio May 29
Crafting Clair Obscur: Expedition 33's mournful tale ft. Jennifer Svedeberg-Yen Game Developer — Audio section May 29
Balatro publisher Playstack is being sold to GameSpot and Fandom parent company Game Developer — Audio section May 29
007 First Light has topped 1.5 million sales Game Developer — Audio section May 29
Micro-grants for indies, another hardware price hike, and Valnet sinks to new lows - Patch Notes #54 Game Developer — Audio section May 29
Remedy Entertainment CEO: Alan Wake and Control should have sold more Game Developer — Audio section May 28
The Rockstar Game Workers Union breaks cover Game Developer — Audio section May 28
Mastering Game Music Question r/GameAudio May 28
Star Trek: Voyager - Across the Unknown reinvents TV storytelling for survival strategy - Narrative Notebook #3 Game Developer — Audio section May 28
Global Game Jam launches micro-grant program to support indies and emerging talent Game Developer — Audio section May 28
Valve raises Steam Deck OLED prices by over $200 Game Developer — Audio section May 28
Report: Payment for some TheGamer staff now tied to per-article sessions Game Developer — Audio section May 27
The Witcher 3: Wild Hunt is getting a new expansion 12 years after launch Game Developer — Audio section May 27
PS5 exclusive Destruction AllStars shut down after five years Game Developer — Audio section May 27
Report: Latest Nintendo Switch 2 production estimates contrast hardware forecast Game Developer — Audio section May 26
How Brazil's government boosts the local game industry Game Developer — Audio section May 26
Microsoft to pay $250M to settle lawsuit filed by aggrieved Activision Blizzard shareholders Game Developer — Audio section May 26
Will PRS registration affect streamers/content-creators? r/GameAudio May 26
Are stem separation tools becoming normal in game audio workflows now? r/GameAudio May 25
Honest opinion on my case r/GameAudio May 25
Music as part of UX r/GameAudio May 24
Music producer looking to dive into Game Audio. Any advice from industry pros? r/GameAudio May 18
Beards, Cats, And Indie Game Audio EP 68 - Elise Kates Beards, Cats and Indie Game Audio May 2
Pause event except for one track (FMOD) r/GameAudio May 13
My FMOD repeats this windows r/GameAudio May 12
Melodies as Symbols: The Music of Assassin’s Creed Liberation Designing Music NOW (Winifred Phillips) May 5
The GameAudio Share Mine May, 2026 - Use this post to link to / discuss your site, works, product, business or anything you created or are affiliated with r/GameAudio May 1

Keyboard

j / k
move between items
Space
expand / collapse
o
open original
s
save / unsave
m
mark read
/
focus search
?
this help