Generative Audio & Music Models

429 items · default last 14 days

Homecoming: bringing the ElevenLabs Summit to Warsaw ElevenLabs Blog Jun 1
Revolut selects ElevenLabs Agents to bolster customer support ElevenLabs Blog Jun 1
ElevenLabs crosses $500M ARR as it welcomes new investors including BlackRock, NVIDIA, Jamie Foxx and Eva Longoria ElevenLabs Blog May 31
Introducing Dubbing v2 ElevenLabs Blog May 31
Introducing Stan Lee on ElevenLabs ElevenLabs Blog May 31
Introducing Music v2 ElevenLabs Blog May 31
ElevenLabs expands presence in Australia and New Zealand ElevenLabs Blog May 31
Introducing ElevenMusic ElevenLabs Blog May 31
Bringing voice AI into the classroom with ElevenLabs ElevenLabs Blog May 31
ElevenLabs is expanding in Spain ElevenLabs Blog May 31
Honoring Eric Dane’s Legacy at SXSW: Advancing 1 Million Voices ElevenLabs Blog May 31
Introducing ElevenLabs for Government ElevenLabs Blog May 31
ElevenLabs raises $500M Series D at $11B valuation ElevenLabs Blog May 31
Klarna reduces Time to Resolution by 10X with ElevenAgents ElevenLabs Blog May 31
How T-pop is blowing up the global charts Music Ally now
Threads launches music stickers to all users Music Ally now
How artists are riding the Off Campus sync wave Music Ally now
Embracing the youth social media bans Music Ally now
Taco Bell adds 100 more artists to its ‘Feed The Beat’ scheme Music Ally now
Instagram details the features in its new ‘Plus’ subscription Music Ally now
Tidal aims to help DIY artists sell music directly to fans Music Ally 1h
Chordal reveals 21 new partners for its sync-licensing network Music Ally 1h
New tools for touring artists: Bandsintown Boost and Laylo’s AI ticket-sales agent Music Ally 1h
TikTok wants to teach its creators how to write micro-dramas Music Ally 2h
Cisac launches Paris Commitment while Human Artistry Campaign takes to skies with Suno protest Music Ally 2h
Everything that happened at Spotify India’s first Equal Day Music Ally 5h
Indian artists did win at the 2026 Grammys Awards after all Music Ally 5h
PaRa Music is a new Indian “AI-powered” label — that says it won’t be using AI to create songs Music Ally 5h
Age-Aware Adapter Tuning for Children's Speech Recognition arXiv eess.AS (Audio and Speech Processing) 8h
Enhancing Audio Captioning with Auxiliary AudioSet Semantics arXiv eess.AS (Audio and Speech Processing) 8h
M2S-AVSR: Modality-aware Multi-view Self-supervised Representation for Robust Audio-Visual Speech Recognition arXiv eess.AS (Audio and Speech Processing) 8h
An Ultra-Low-Bitrate Neural Speech Codec with Plain-to-Pseudo Synergistic Vector Quantization arXiv eess.AS (Audio and Speech Processing) 8h
VoCodec: A Low-bitrate Streamable Neural Speech Codec with Voicing-driven Quantization arXiv eess.AS (Audio and Speech Processing) 8h
CoSTA: Cognitive-State-Conditioned TTS Data Augmentation Using ASR Transcripts for Alzheimer's Disease Detection arXiv eess.AS (Audio and Speech Processing) 8h
Revisiting Lexicon Evaluation in Unsupervised Word Discovery arXiv eess.AS (Audio and Speech Processing) 8h
USAD 2.0: Scaling Representation Distillation for Universal Audio Understanding arXiv eess.AS (Audio and Speech Processing) 8h
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models arXiv eess.AS (Audio and Speech Processing) 8h
Task-Vector Arithmetic for Emotional Expressivity Control in Language-Model-Based Text-to-Speech arXiv eess.AS (Audio and Speech Processing) 8h
nnAudio 2: Overcoming Dynamic Compilation Barriers and Transform Inconsistencies arXiv eess.AS (Audio and Speech Processing) 8h
Exploring LLMs for South Asian Music Understanding and Generation arXiv eess.AS (Audio and Speech Processing) 8h
Probing Spatial Structure in Pretrained Audio Representations arXiv eess.AS (Audio and Speech Processing) 8h
Domain-Aware Mispronunciation Detection and Diagnosis Using Language-Specific Statistical Graphs arXiv eess.AS (Audio and Speech Processing) 8h
Sound Effects Dataset Unification With the Universal Category System arXiv eess.AS (Audio and Speech Processing) 8h
Task-Vector Arithmetic for Emotional Expressivity Control in Language-Model-Based Text-to-Speech arXiv cs.SD (Sound) 8h
nnAudio 2: Overcoming Dynamic Compilation Barriers and Transform Inconsistencies arXiv cs.SD (Sound) 8h
Exploring LLMs for South Asian Music Understanding and Generation arXiv cs.SD (Sound) 8h
Probing Spatial Structure in Pretrained Audio Representations arXiv cs.SD (Sound) 8h
Sound Effects Dataset Unification With the Universal Category System arXiv cs.SD (Sound) 8h
SB-RF: Schr\"odinger Bridge Rectified Flow for One-Step Robust Speech Enhancement arXiv cs.SD (Sound) 8h
Beyond Waveform Robustness: Robust Feature-Vocoder Adversarial Attacks on Automatic Speech Recognition arXiv cs.SD (Sound) 8h
Do speech foundation models perceive speaker similarity as humans do? arXiv cs.SD (Sound) 8h
SagnacAssisted Enhanced OTDR for Distributed Acoustic Sensing: A Standardized Benchmark and Engineering Evaluation Framework arXiv cs.SD (Sound) 8h
UniVoice: A Unified Model for Speech and Singing Voice Generation arXiv cs.SD (Sound) 8h
GLASS: GRPO-Trained LoRA for Acoustic Style Steering in Zero-Shot Text-to-Speech arXiv cs.SD (Sound) 8h
Beyond WER: A Paired Acoustic Stress Test for Ambient Clinical Scribes arXiv cs.SD (Sound) 8h
DBHN-Net: Dual-Branch Hybrid Neural Network For Low-Complexity Monaural Speech Enhancement arXiv cs.SD (Sound) 8h
SpeechJBB: Probing Safety Alignment and Comprehension in Large Audio Language Models under Code-Switched Speech arXiv cs.SD (Sound) 8h
Learning Emotion-discriminative Representations for Zero-Shot Cross-lingual Speech Emotion Recognition arXiv cs.SD (Sound) 8h
Canadian government rows back on ‘streaming tax’ for DSPs Music Ally 23h
These Indian acts are part of YouTube Music’s Foundry Class of 2026 Music Ally yest
Producer Abhijit Vaghani launches music institute The Swamp Academy Music Ally yest
Representation Matters in Randomized Smoothing for Audio Classification arXiv eess.AS (Audio and Speech Processing) yest
Masked Wavelet Scattering Transform Neural Field for Sound Field Reconstruction arXiv eess.AS (Audio and Speech Processing) yest
Read What You Hear: Reference-Free Hypotheses Evaluation with Acoustic Discrepancy arXiv eess.AS (Audio and Speech Processing) yest
UAT: Unified Audio-Text Diffusion for Audio Generation, Editing, and Captioning arXiv eess.AS (Audio and Speech Processing) yest
Differentiable Articulatory Copy-Synthesis of Biphonic Singing arXiv eess.AS (Audio and Speech Processing) yest
Channel-Oriented Design for EEG-to-Music Reconstruction arXiv eess.AS (Audio and Speech Processing) yest
The Differentiable Auditory Loop (DAL): An ML Framework for Hyper-Personalized Hearing Aids arXiv eess.AS (Audio and Speech Processing) yest
Feasibility of Time-Domain DNN-Based Speech Enhancement on Embedded FPGA for Hearing Aid arXiv eess.AS (Audio and Speech Processing) yest
Gauss Circle Lattices with Geometric Convolutions for Synthesizing High Dimensional Image-Source Room Impulse Responses arXiv eess.AS (Audio and Speech Processing) yest
CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding arXiv eess.AS (Audio and Speech Processing) yest
Entity Binding Failures in Speech LLM Reasoning: Diagnosis and Chain-of-Thought Intervention arXiv eess.AS (Audio and Speech Processing) yest
Multilingual Long-Form Speech Instruction Following: KIT's Submission to IWSLT 2026 arXiv eess.AS (Audio and Speech Processing) yest
SURF: Separation via Unsupervised Remixing Flow arXiv eess.AS (Audio and Speech Processing) yest
Audio Interaction Model arXiv eess.AS (Audio and Speech Processing) yest
A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References arXiv eess.AS (Audio and Speech Processing) yest
Channel-Oriented Design for EEG-to-Music Reconstruction arXiv cs.SD (Sound) yest
The Differentiable Auditory Loop (DAL): An ML Framework for Hyper-Personalized Hearing Aids arXiv cs.SD (Sound) yest
Feasibility of Time-Domain DNN-Based Speech Enhancement on Embedded FPGA for Hearing Aid arXiv cs.SD (Sound) yest
Gauss Circle Lattices with Geometric Convolutions for Synthesizing High Dimensional Image-Source Room Impulse Responses arXiv cs.SD (Sound) yest
CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding arXiv cs.SD (Sound) yest
A Second-Order Cepstral Signature of Contact-Vibration Sounds Reproduced by Laptop Loudspeakers: A Synthetic Case Study arXiv cs.SD (Sound) yest
Flow-HOA: Generative Joint Optimization for Ambisonics Encoding via Flow Matching arXiv cs.SD (Sound) yest
SHB-AE: Spherical harmonic beamforming based Ambisonics encoding and upscaling method for smartphone microphone array arXiv cs.SD (Sound) yest
Drift-Augmented Scoring: Text-Derived Noise Robustness for Zero-Shot Audio-Language Classification arXiv cs.SD (Sound) yest
SURF: Separation via Unsupervised Remixing Flow arXiv cs.SD (Sound) yest
FoeGlass: Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors arXiv cs.SD (Sound) yest
Audio Interaction Model arXiv cs.SD (Sound) yest
Beyond Text Following: Repairable Arbitration Reversals in Audio-Language Models arXiv cs.SD (Sound) yest
DetectZoo: A Unified Toolkit for AI-Generated Content Detection Across Text, Audio, and Image Modalities arXiv cs.SD (Sound) yest
Representation Matters in Randomized Smoothing for Audio Classification arXiv cs.SD (Sound) yest
Meet the three startups demoing at our NYC meetup next week Water & Music yest
Sandbox Guide :: Embracing the youth social media bans Music Ally yest
2.5bn people are now using Google’s AI overviews in search Music Ally yest
Meta’s latest tests focus on episodic reels and teen wellbeing Music Ally yest
New next-gen labels roundup: Volyum, Home Hits and Orinda Music Ally yest
Federal appeals court reverses 2 Live Crew catalogue ruling Music Ally yest
SoundExchange and IFPI make it easier to get ISRCs assigned Music Ally yest
Music Technology Australia VMDO panel report: ‘No one’s made metadata fun!’ Music Ally yest
ARIA boss blasts latest call for pro-AI copyright reforms Music Ally Jun 3
International Entrepreneur of the Year 2026 nominees: Leo Ballesteros (Atonemo) Music Ally Jun 3
International Entrepreneur of the Year 2026 nominees: Dr. Moti Margalit (SonicEdge) Music Ally Jun 3
FSA-GRPO: Teaching Auditory LLMs to Use Few-shot Demonstrations arXiv eess.AS (Audio and Speech Processing) Jun 3
Wavelet as Tokenizer: Preliminary Results on a Shared Wavelet Token Schema for Natural Signals arXiv eess.AS (Audio and Speech Processing) Jun 3
SVHalluc: Benchmarking Speech-Vision Hallucination in Audio-Visual Large Language Models arXiv eess.AS (Audio and Speech Processing) Jun 3
A Comparison of Generative and Discriminative Methods for Speech Enhancement: Robustness, Complexity, and Hallucination arXiv eess.AS (Audio and Speech Processing) Jun 3
AnyAudio-Judge: A Dynamic Rubric-Based Benchmark and Evaluator for Audio Instruction Following arXiv eess.AS (Audio and Speech Processing) Jun 3
SpeakerCard-1M: An Evidence-Grounded Speaker Card Corpus for In-the-Wild Speaker Verification arXiv eess.AS (Audio and Speech Processing) Jun 3
WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling arXiv eess.AS (Audio and Speech Processing) Jun 3
Stable Hybrid Cross-Attention Fusion for Audio-Visual Event Recognition arXiv eess.AS (Audio and Speech Processing) Jun 3
In-the-Loop Training of Deep Feedback Cancellation for Hearing Aids arXiv eess.AS (Audio and Speech Processing) Jun 3
SegTune: Structured and Fine-Grained Control for Song Generation arXiv eess.AS (Audio and Speech Processing) Jun 3
Before Fusion, Ask What to Keep: Contextual Calibration of Multimodal Signals arXiv eess.AS (Audio and Speech Processing) Jun 3
EntangleCodec: A Unified Discrete Audio Tokenizer via Semantic-Acoustic Entanglement arXiv eess.AS (Audio and Speech Processing) Jun 3
CoughSense: Five-Class Respiratory Disease Classification via Whisper Encoder Fine-Tuning and Dual-Encoder Cross-Attention Fusion with Balanced Contrastive Learning arXiv eess.AS (Audio and Speech Processing) Jun 3
Inference-Time Scaling for Joint Audio-Video Generation arXiv eess.AS (Audio and Speech Processing) Jun 3
Benchmarking Speech-to-Speech Translation Models arXiv eess.AS (Audio and Speech Processing) Jun 3
SegTune: Structured and Fine-Grained Control for Song Generation arXiv cs.SD (Sound) Jun 3
EntangleCodec: A Unified Discrete Audio Tokenizer via Semantic-Acoustic Entanglement arXiv cs.SD (Sound) Jun 3
A Training-Efficient Transformer-Based Anti-Spoofing Network for Logical Access in ASVspoof 5 arXiv cs.SD (Sound) Jun 3
Audio Spotforming via Post-Filtering Using Cross-Array Non-target Estimates arXiv cs.SD (Sound) Jun 3
SketchSong: Hierarchical Song Generation with Sketch Planning and Fine-Grained Multi-Track Modeling arXiv cs.SD (Sound) Jun 3
Speech Emotion Recognition using Attention-based LSTM-Network with Residual Connection arXiv cs.SD (Sound) Jun 3
Tonal parsimony in chord-sequence analysis: combining modulation cost and tonal vocabulary arXiv cs.SD (Sound) Jun 3
Foley-Omni: A Unified Multimodal Generation Model from Task-Level Audio Synthesis to Complete Video Soundtrack Generation arXiv cs.SD (Sound) Jun 3
LiveBand: Live Accompaniment Generation in the Audio Domain arXiv cs.SD (Sound) Jun 3
FSA-GRPO: Teaching Auditory LLMs to Use Few-shot Demonstrations arXiv cs.SD (Sound) Jun 3
Wavelet as Tokenizer: Preliminary Results on a Shared Wavelet Token Schema for Natural Signals arXiv cs.SD (Sound) Jun 3
SVHalluc: Benchmarking Speech-Vision Hallucination in Audio-Visual Large Language Models arXiv cs.SD (Sound) Jun 3
Before Fusion, Ask What to Keep: Contextual Calibration of Multimodal Signals arXiv cs.SD (Sound) Jun 3
A Comparison of Generative and Discriminative Methods for Speech Enhancement: Robustness, Complexity, and Hallucination arXiv cs.SD (Sound) Jun 3
AnyAudio-Judge: A Dynamic Rubric-Based Benchmark and Evaluator for Audio Instruction Following arXiv cs.SD (Sound) Jun 3
International Entrepreneur of the Year 2026 nominees: Vishruti Bindal and Bharavi (Linear Festivals) Music Ally Jun 3
AI music-videos startup Neural Frames hits $5m annual run rate Music Ally Jun 2
Roli’s latest music-education move is a partnership with Casio Music Ally Jun 2
French project ReDisco has recycled more than 1.2m records Music Ally Jun 2
Bank of America is ‘Side by Side’ with Tomorrow’s Warriors Music Ally Jun 2
TikTok hails 11bn views (and streaming success) for ‘Self Aware’ Music Ally Jun 2
Serato kicks off Pride Month with ‘Paradise’ on Apple Music Music Ally Jun 2
Amazon Music is getting pricier in India — but will also be available for free Music Ally Jun 2
Nina to shut down over sustainable revenue challenges Music Ally Jun 2
Privacy-preserving Prosody Representation Learning arXiv eess.AS (Audio and Speech Processing) Jun 2
Local Diagnostics of Continuous Normalizing Flow for Out-of-Distribution Detection arXiv eess.AS (Audio and Speech Processing) Jun 2
Context-aware child-directed speech detection from long-form recordings arXiv eess.AS (Audio and Speech Processing) Jun 2
Description and Discussion on DCASE 2026 Challenge Task 2: Noise-aware Unsupervised Anomalous Sound Detection for Machine Condition Monitoring arXiv eess.AS (Audio and Speech Processing) Jun 2
RRP-Voice: A Longitudinal Dataset and Benchmark for Recurrent Respiratory Papillomatosis Detection arXiv eess.AS (Audio and Speech Processing) Jun 2
Kinship Verification Using Voice arXiv eess.AS (Audio and Speech Processing) Jun 2
SpeechEditBench: A Bilingual Multi-Attribute Benchmark for Instruction-Guided Speech Editing arXiv eess.AS (Audio and Speech Processing) Jun 2
Advancing Electrolaryngeal Speech Enhancement Through Speech-Text Representation Learning arXiv eess.AS (Audio and Speech Processing) Jun 2
Localizing broadband noise sources using the Lo\`eve spectrum and a 2.5D approach arXiv eess.AS (Audio and Speech Processing) Jun 2
Domain-Agnostic Incremental Learning for Sound Classification. A DCASE 2026 Challenge task arXiv eess.AS (Audio and Speech Processing) Jun 2
Breaking the Pair: Evaluating Dyadic Interaction via Speaker Switching arXiv eess.AS (Audio and Speech Processing) Jun 2
SiamCTC: Learning Speech Representations through Monotonic Temporal Alignment arXiv eess.AS (Audio and Speech Processing) Jun 2
Exploiting Noise Inseparability for Weakly-Supervised Discriminative Speech Denoising Using Noisy Targets arXiv eess.AS (Audio and Speech Processing) Jun 2
SoulX-Transcriber: A Robust End-to-End Framework for Multi-Speaker Speech Transcription arXiv eess.AS (Audio and Speech Processing) Jun 2
DUET: Unified Dual-Space Emotion Control for Diffusion and Flow-Matching Driven Text-to-Speech arXiv eess.AS (Audio and Speech Processing) Jun 2
DUET: Unified Dual-Space Emotion Control for Diffusion and Flow-Matching Driven Text-to-Speech arXiv cs.SD (Sound) Jun 2
Quality Audio Prototyping: a prototype system for unified sound retrieval and procedural generation arXiv cs.SD (Sound) Jun 2
Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty arXiv cs.SD (Sound) Jun 2
Sympatheia: Emotionally Adaptive Voice Assistant with Continuous Affect Conditioning arXiv cs.SD (Sound) Jun 2
MelT: GEMM-Native NDFT for Efficient Single-Stage Audio Frontends on Modern Accelerators arXiv cs.SD (Sound) Jun 2
A Lightweight Slot-Attention Framework for Multi-Instrument Multi-Pitch Estimation arXiv cs.SD (Sound) Jun 2
UniVocal: Unified Speech-Singing Code-Switching Synthesis arXiv cs.SD (Sound) Jun 2
HAIM: Human-AI Music Datasets for AI Music Production Tracking Benchmark arXiv cs.SD (Sound) Jun 2
JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions arXiv cs.SD (Sound) Jun 2
MOSS-Audio Technical Report arXiv cs.SD (Sound) Jun 2
Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space arXiv cs.SD (Sound) Jun 2
C2GA: A Class-Controllable Generative Augmentation Framework for Respiratory Sound Classification arXiv cs.SD (Sound) Jun 2
Parameter-efficient Dual-encoder Architecture with Differentiable Choquet Integral Fusion for Underwater Acoustic Classification arXiv cs.SD (Sound) Jun 2
DAStatFormer: A Hybrid Multibranch Transformer with Statistical Feature Integration for DAS-Based Pattern Recognitions arXiv cs.SD (Sound) Jun 2
Local Diagnostics of Continuous Normalizing Flow for Out-of-Distribution Detection arXiv cs.SD (Sound) Jun 2
International Entrepreneur of the Year 2026 nominees: Max Busin (Gotobeat) Music Ally Jun 1
Musixmatch responds to latest legal filing in LyricFind battle Music Ally Jun 1
MTUK report warns of big decline in UK music-tech funding Music Ally Jun 1
Brenda Lee’s full catalogue is finally coming to streaming services Music Ally Jun 1
Boards of Canada hit out at White House over use of their music Music Ally Jun 1
Suno is riding the wave of TikTok’s new ‘text to song’ trend Music Ally Jun 1
UMG joins Bolloré Group in rejecting Pershing Square offer Music Ally Jun 1
Extracting accent features in spoken Brazilian Portuguese without sociolinguistic labels arXiv eess.AS (Audio and Speech Processing) Jun 1
FiPA-SR -- FiLM-Conditioned Perceptually Informed Audio Super-Resolution arXiv eess.AS (Audio and Speech Processing) Jun 1
OpenSTBench: Beyond Semantic Evaluation for Speech Translation arXiv eess.AS (Audio and Speech Processing) Jun 1
A Unified and Reproducible Experimentation Framework for Speech Understanding arXiv eess.AS (Audio and Speech Processing) Jun 1
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer arXiv eess.AS (Audio and Speech Processing) Jun 1
ImmersiveTTS: Environment-Aware Text-to-Speech with Multimodal Diffusion Transformer and Domain-Specific Representation Alignment arXiv eess.AS (Audio and Speech Processing) Jun 1
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue arXiv eess.AS (Audio and Speech Processing) Jun 1
On the Use of Dereverberation for Acoustic Feedback Cancellation arXiv eess.AS (Audio and Speech Processing) Jun 1
Improving acoustic drone detection generalization through pretraining and data augmentation arXiv eess.AS (Audio and Speech Processing) Jun 1
UNISON: A Unified Sound Generation and Editing Framework via Deep LLM Fusion arXiv eess.AS (Audio and Speech Processing) Jun 1
Mental Damage: Caption Poisoning Attacks on Retrieval-Augmented Text-to-Music Generation arXiv eess.AS (Audio and Speech Processing) Jun 1
Escaping the Linearity Trap: Manifold Detours for Black-Box Adversarial Attacks on Singing Audio Deepfake Detection arXiv eess.AS (Audio and Speech Processing) Jun 1
Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS arXiv eess.AS (Audio and Speech Processing) Jun 1
Scaling Conversational Hungarian ASR: The BEA-Dialogue+ Corpus arXiv eess.AS (Audio and Speech Processing) Jun 1
Acoustic Simulation Framework for Multi-channel Replay Speech Detection arXiv eess.AS (Audio and Speech Processing) Jun 1
Mental Damage: Caption Poisoning Attacks on Retrieval-Augmented Text-to-Music Generation arXiv cs.SD (Sound) Jun 1
3DAE: Binaural Quality Assessment for Audio Novel View Synthesis with Spatial Maps and Benchmark arXiv cs.SD (Sound) Jun 1
Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS arXiv cs.SD (Sound) Jun 1
AnchorSteer: Self-Discovered Concept Injection for Structure-Preserving Music Editing arXiv cs.SD (Sound) Jun 1
Sound effects in media:A comparative analysis of recorded and synthetic samples in live-action and animation arXiv cs.SD (Sound) Jun 1
MindVoice: Reconstructing Intelligible Speech from Non-invasive Neural Signals with Pretrained Priors arXiv cs.SD (Sound) Jun 1
Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation arXiv cs.SD (Sound) Jun 1
Escaping the Linearity Trap: Manifold Detours for Black-Box Adversarial Attacks on Singing Audio Deepfake Detection arXiv cs.SD (Sound) Jun 1
Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors arXiv cs.SD (Sound) Jun 1
GaMi: Geometry-Agnostic Material Identification via Cross-Modal Subtractive Disentanglement arXiv cs.SD (Sound) Jun 1
A Unified and Reproducible Experimentation Framework for Speech Understanding arXiv cs.SD (Sound) Jun 1
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer arXiv cs.SD (Sound) Jun 1
DOA: Training-Free Decoder-Only Attention Policy for Long-Form Simultaneous Translation with SpeechLLMs arXiv cs.SD (Sound) Jun 1
Scaling Conversational Hungarian ASR: The BEA-Dialogue+ Corpus arXiv cs.SD (Sound) Jun 1
UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception arXiv cs.SD (Sound) Jun 1
Sync’s new frontiers: clearance, creators, boutiques and brand intelligence Music Ally May 29
YouTube reveals latest changes to its AI-content labels system Music Ally May 29
LabelWorx launches $10m fund for indie electronic-music labels Music Ally May 29
LyricFind files updated complaint in battle with Musixmatch Music Ally May 29
Bloomberg questions whether TikTok is leaving labels behind Music Ally May 29
Epidemic Sound study examines creator attitudes towards AI Music Ally May 29
How to content batch social media posts Music Ally May 29
TikTok ‘In The Mix’ podcast starts second season Music Ally May 29
2 Chainz and OussiFooty launch World Cup series Music Ally May 29
The perfect pivot Music Ally May 29
A tale of two US events: Freedom 250 and Power to the People Music Ally May 29
Regional-language music streaming service Damroo raises investment from HT Media Music Ally May 29
!K7 calls for indie labels to measure EDI as well as their climate impact Music Ally May 29
Why We Built an AI Music Video Generator That Listens to Songs Music Ally May 29
Gaana’s subscribers are growing at the rate of 15% per year, says CEO Music Ally May 28
[Alt-Pop] Freedom's Expensive by Real Enough r/SunoAI now
I want to create 10 kids music videos for free r/SunoAI now
How do you prevent it from going over the top? r/SunoAI now
Pump.fun's Latest Experiment Is Already Getting Weird Decrypt — AI / Music coverage now
[Elec Pop] Frozen Time r/SunoAI now
[Organic House] Ashes to Eden (Organic Remix) by Atonstar Music Universe: Pushing the boundaries of high-end professional sound with Suno v5.5! r/SunoAI 1h
ZEC Crashes 38% as Zcash Discloses ‘Critical Counterfeiting Vulnerability’ Decrypt — AI / Music coverage 1h
[Dark R&B] Not My Ex by Quantum Melody: Using AI vocal processing layers on top of a fully human-made FL Studio beat. How does the texture feel? r/SunoAI 2h
[Dance] My Name Is Suno r/SunoAI 2h
[Phonk] Papai - Curse r/SunoAI 2h
A scoop on Udio's upcoming app, Starstruck Water & Music May 21
Can't make Amapiano songs r/SunoAI 3h
[Nintendo Core / Rage Core] I Came Into Your World by Arthur Wegley (ARed) r/SunoAI 6h
Why your AI songs are getting ignored on Reddit r/SunoAI 9h
AI Is Already Developing AI, Says Anthropic—And Humans May Be Slowing Things Down Decrypt — AI / Music coverage 14h
Republican Lawmaker Plans to Add Prediction Markets to Congressional Stock Ban Bill Decrypt — AI / Music coverage 15h
'Looksmaxxing' Trend Spawns $100M Gray Market Fueled By Bitcoin, Stablecoins: Chainalysis Decrypt — AI / Music coverage 16h
Important information about copyright blocks r/SunoAI 16h
Google DeepMind CEO Says AGI Is Coming Fast: 'We Don't Have Long to Prepare' Decrypt — AI / Music coverage 17h
Strategy's Michael Saylor Blames 'Capital Rotation' Into AI as Bitcoin Dives 13% Decrypt — AI / Music coverage 17h
Crypto Billionaires Donate $9.4M to Farage’s Reform UK in Q1 Decrypt — AI / Music coverage 18h
Bitcoin Miners Emerge as 'Power Landlords' of AI Boom—And Revenue Will Surge: Bernstein Decrypt — AI / Music coverage 19h
DOJ Task Force Freezes $3.8M in Illicit Crypto—With Help From Coinbase, SpaceX and Meta Decrypt — AI / Music coverage 20h
Fannie Mae-Backed Bitcoin Home Mortgages Are Finally Here, Coinbase Says Decrypt — AI / Music coverage 20h
New AI song promotion post r/SunoAI 20h
British Teen Sanctioned By Russia After Alleging Crypto Use to Evade Sanctions Decrypt — AI / Music coverage 21h
As BTC Tests $62,000, How Low Can Bitcoin Go? Decrypt — AI / Music coverage 22h
Morning Minute: Crypto Crashes, New Lows In Sight Decrypt — AI / Music coverage 23h
US Bitcoin Reserve Moving Ahead at ‘Deliberate Speed’: Bessent Decrypt — AI / Music coverage 23h
[Circus Electro Swing] Show Hand by Sheer's Dream Music r/SunoAI yest
[fan made FIFA song] Thunder Grassfire r/SunoAI yest
A complaint I have not yet seen much about... r/SunoAI yest
[Kpop/Emotional/Techno] AMOR FATI | Preview2 r/SunoAI yest
Coinbase Launches Pre-IPO Perps, Starting with Elon Musk's SpaceX Decrypt — AI / Music coverage yest
Tom Lee’s BitMine Plans $300M Preferred Stock Sale for ETH Treasury Push Decrypt — AI / Music coverage yest
[Dark Trap] Philistone - Same As You r/SunoAI yest
How do you make sleep ambiance? r/SunoAI yest
Backup Your Library r/SunoAI yest
Music Distributors r/SunoAI yest
SUNO, Don't Back Down: Creativity Needs Freedom r/SunoAI yest
Suno just valued at 5.4 Billion r/SunoAI yest
The Best AI Models Still Encourage 'Harmful Intimacy' With Chatbots, Study Funds Decrypt — AI / Music coverage yest
Cardano Slumps to 5-Year Low Price as Charles Hoskinson Warns of 'Wave of Failures' Decrypt — AI / Music coverage yest
Tether Debuts Tokenized Gold Stablecoin Visa Card That Pays Out Crypto Rewards Decrypt — AI / Music coverage yest
What are we even doing? 🤨 r/SunoAI yest
AI Lawyers Are Already Better Than Law Professors at Reasoning—Say Law Professors Decrypt — AI / Music coverage yest
Hermes Ends AI Agent Terminal Era With Release of Official Desktop App Decrypt — AI / Music coverage yest
Someone Just Redeemed a 15-Year-Old Physical Bitcoin, Scoring $1.78 Million in BTC Decrypt — AI / Music coverage yest
Perplexity Wants Your Laptop to Do Part of the AI Work—So It Doesn't Have To Decrypt — AI / Music coverage yest
In the coming months, we’ll begin rolling out our first music model developed in partnership with the music industry. - June 3, 2026 r/SunoAI yest
Why Ethereum Could Tank Another 25% Before Finding a Bottom: Analysis Decrypt — AI / Music coverage yest
As Oil Moves Higher, Bitcoin Sinks to Lowest Price Since March Decrypt — AI / Music coverage yest
World Cup Crypto Scams Are Targeting Soccer Fans, Law Enforcement Warns Decrypt — AI / Music coverage yest
Walrus Memory Enables AI Agents to ‘Actually Learn About Us’: Mysten Labs Co-Founder Decrypt — AI / Music coverage yest
Mastercard Expands Stablecoin Settlement via Circle's USDC, Ripple's RLUSD and Beyond Decrypt — AI / Music coverage yest
Stripe Millionaire Loses Bid for Congress to Candidate Backed by Ripple Co-Founder Decrypt — AI / Music coverage yest
I spent a full day running controlled tests on Suno v5.5 output quality. Two things I found that nobody seems to be talking about. [RESEARCH] r/SunoAI yest
What tools do you use to clean up AI-generated songs before release? r/SunoAI yest
Zcash Completes 'Most Ambitious' Network Upgrade as ZEC Resumes Recent Surge Decrypt — AI / Music coverage yest
Poll on Song Topic for 1st Community Picked Theme r/SunoAI yest
[indie] Finally figured out how to get Suno to output authentic, warm bedroom pop (no digital artifacts, 950-character styling breakdown inside) r/SunoAI yest
Suno is indeed down and seems to be going though an existential crisis of sorts r/SunoAI yest
Give us your trash r/SunoAI yest
⚠️ Suno Down ⚠️ r/SunoAI yest
Suno down? r/SunoAI yest
MoonPay Brings Crypto Transactions to Claude and Codex With MoonAgents Desktop App Decrypt — AI / Music coverage yest
Morning Minute: Bitcoin Falls Below $67k as MSTR Plummets Decrypt — AI / Music coverage yest
Worst costumer service / nightmare r/SunoAI yest
Trezor Reveals Hardware Wallet Vulnerability, But Funds 'Safe' Decrypt — AI / Music coverage yest
[RAP] HELL by KND r/SunoAI yest
Let's pick some specific topic to write songs about, and then come back here in two days and see what everyone's got r/SunoAI yest
George Santos Referred to DOJ, CFTC Over State of the Union Kalshi Trades: Report Decrypt — AI / Music coverage Jun 3
UK Regulator Warns Soccer Clubs Over Unauthorized Crypto Sponsorship Deals Decrypt — AI / Music coverage Jun 3
[Cinematic western ballad] The Best Tracker in the West ( r/SunoAI Jun 3
EuroReVision: SUMMERHIT 2026 r/SunoAI Jun 3
People are following me on Suno without listening, liking or commenting on any songs… Why? r/SunoAI Jun 3
Microsoft Reveals '1,000x More Reliable' Quantum Chip as Bitcoin Threat Draws Nearer Decrypt — AI / Music coverage Jun 3
Cardsmiths' New America250 Trading Cards Have Real Bitcoin, Dogecoin Up for Grabs Decrypt — AI / Music coverage Jun 3
US Treasury Sanctions Iranian Crypto Exchanges Including Nobitex for Terrorist Financing Decrypt — AI / Music coverage Jun 2
As Bitcoin Sinks, It's Time for Ethereum to Outperform: Standard Chartered Decrypt — AI / Music coverage Jun 2
Microsoft Says Latest AI Models Beat Claude, Google's Nano Banana Decrypt — AI / Music coverage Jun 2
Strategy Shares Fall for Second Straight Day After $56 Billion Bitcoin Giant Sells BTC Decrypt — AI / Music coverage Jun 2
Microsoft Turns OpenClaw Into an Enterprise AI Agent With Scout Decrypt — AI / Music coverage Jun 2
Bernie Sanders, Elizabeth Warren Urge Labor Department to Drop Bitcoin, Crypto 401K Plan Decrypt — AI / Music coverage Jun 2
Where Does Bitcoin Go From Here? This Is What the Charts Say Decrypt — AI / Music coverage Jun 2
Difficulty cloning my voice on Suno? r/SunoAI Jun 2
[Cabaret] Fine! Un-Fuck You! r/SunoAI Jun 2
Andrew Yang's Noble Acquires Crypto-Fueled Helium Mobile Service Decrypt — AI / Music coverage Jun 2
Songs start over in the middle of the song???? r/SunoAI Jun 2
[trance] Where the Ocean Speaks by Bearded_man r/SunoAI Jun 2
[Vintage Pop-Rock] Quality Time r/SunoAI Jun 2
[RAP] Backwood Blunts ocfogger r/SunoAI Jun 2
Nobody Will Care How the Music Was Made r/SunoAI Jun 2
Mt. Gox Moves $739M in Bitcoin as Repayment Deadline Looms Decrypt — AI / Music coverage Jun 2
suno removed the lyrics model selection?? r/SunoAI Jun 2
NEAR, Worldcoin Post Double-Digit Gains as Market Sees $714M in Liquidations Decrypt — AI / Music coverage Jun 2
Your lyrics steer the melody more than the style box — and you can control dissonance with word texture r/SunoAI Jun 2
Morning Minute: Saylor Sells Bitcoin for First Time Since 2022 Decrypt — AI / Music coverage Jun 2
[Post-Punk / Britpop] The Script Was Rigged — HAL_9001 r/SunoAI Jun 2
Suno Learning Your Voice? r/SunoAI Jun 2
It's over. Here are my suggestions. r/SunoAI Jun 2
5.0 is better than 5.5 r/SunoAI Jun 2
Nvidia Releases Its Best Open AI Model Yet—But Still Lags Behind China Decrypt — AI / Music coverage Jun 1
DuckDuckGo Launched Duck AI. Now Their Hit Product is 'No AI' Decrypt — AI / Music coverage Jun 1
TON Price Pumps After Telegram CEO Says Token Will Be Rebranded to Gram Decrypt — AI / Music coverage Jun 1
Elon Musk's SpaceX Warns $1.75 Billion IPO Investors of Potential Future Share Dilution Decrypt — AI / Music coverage Jun 1
Trump’s Business Partner Teases Future Meme Coin Plans: 'We’re The Biggest Brand on Earth' Decrypt — AI / Music coverage Jun 1
Strategy's Bitcoin Sale Timing Throws $50 Million Polymarket Bet Into Dispute Decrypt — AI / Music coverage Jun 1
Kalshi Eyes Perpetual Futures for XRP, Solana, Dogecoin—And These Altcoins Decrypt — AI / Music coverage Jun 1
Florida Sues OpenAI, Sam Altman Over ChatGPT Safety Claims Decrypt — AI / Music coverage Jun 1
Sell Coinbase Before Derivatives Squeeze Crypto Giant, Says Compass Point Decrypt — AI / Music coverage Jun 1
[Chill EDM] Smoerebroet - Perfect Green Hell r/SunoAI Jun 1
Tom Lee's BitMine Buys $52 Million in Ethereum as Strategy Sells Bitcoin Decrypt — AI / Music coverage Jun 1
[Metal] The Last Bridge - The Last Ascension r/SunoAI Jun 1
[EDM] Umbrella | Electronic Dance | Late Night Anthems By Mwenu r/SunoAI Jun 1
[Club] I Want More (Remix) r/SunoAI Jun 1
[Dark Soul-Blues / Cinematic Noir] Time Wasted — Looking for Honest Feedback r/SunoAI Jun 1
AI Giant Anthropic Files to Go Public After Nearing $1 Trillion Valuation Decrypt — AI / Music coverage Jun 1
[CINEMATIC Hip Hop] L’essor-Chute by D3G3N3RATE r/SunoAI Jun 1
[Future Bass] The Dream Between Us r/SunoAI Jun 1
Strategy Shares Slide Following Bitcoin Sale—Will It Dump More BTC Ahead? Decrypt — AI / Music coverage Jun 1
Whitehat Helps Recover $2M in ETH Stuck Since 2016 ICO Decrypt — AI / Music coverage Jun 1
Sui Blames Last Week's Trio of Network Outages on Gas and Validator Bugs Decrypt — AI / Music coverage Jun 1
[Arena Rock] White Flags Rising r/SunoAI Jun 1
June 2026 Song Feedback Megathread - Leave a review, get a review! r/SunoAI Jun 1
New AI song promotion post r/SunoAI Jun 1
Binance Opens Access to 7,000 US Stocks, Prepares Tokenized 'bStocks' Rollout Decrypt — AI / Music coverage Jun 1
Bitcoin Falls to 2-Month Low After Strategy Sells BTC, ETFs Flip Negative for the Year Decrypt — AI / Music coverage Jun 1
Morning Minute: HYPE Soars as CFTC Gives Perps Green Light Decrypt — AI / Music coverage Jun 1
[Ambient - mixed styles] Yunasak - Prayer to God r/SunoAI Jun 1
Michael Saylor's Bitcoin Treasury Firm Strategy Sells 32 BTC for $2.5M Decrypt — AI / Music coverage Jun 1
Bitcoin ETF Losses Near $3B Across 10 Days as YTD Flows Turn Negative Decrypt — AI / Music coverage Jun 1
Coinbase Launches Direct Indian Rupee Deposit and Withdrawal Rails Decrypt — AI / Music coverage Jun 1
What Is BChat? The Decentralized Messaging App Built for Privacy Decrypt — AI / Music coverage Jun 1
[Techno-Opera] Breath Signal by Michelle r/SunoAI Jun 1
Suno won't accept my original songs r/SunoAI Jun 1
[J-POP] 過ぎる刻 📸 Time We Shared r/SunoAI Jun 1
[Pop rock] Bed Rot r/SunoAI Jun 1
I figure out a good way to use your voice on suno ai r/SunoAI Jun 1
Giving direction within lyrics? r/SunoAI Jun 1
[Kpop/Emotional/Techno] AMOR FATI r/SunoAI May 31
UMG and Sony seek to add over 61k recordings to Suno lawsuit after discovery reveals AI trained on ‘millions’ of their copyrighted tracks r/SunoAI May 31
Sharing is caring, 7 prompts to build yourself as a music creator. r/SunoAI May 31
If you want a Vai guitar r/SunoAI May 31
[ElectroRock] Tyson Saner - Shadows of Vengeance (LoadPuller Cranial Implosion Mix) r/SunoAI May 31
10 Best AI Singing Voice Generators for Making Music in 2025 AudioCipher Blog May 31
Best Sample Managers For Organizing Your Audio & MIDI Files AudioCipher Blog May 31
Best AI Sample Finders: WAVS, Splice and Output AudioCipher Blog May 31
6 Best AI MIDI Generator DAW Plugins and Standalone Apps AudioCipher Blog May 31
Types of Ciphers: A Complete Guide to Early and Modern Codes AudioCipher Blog May 31
Final Fantasy 7: Kabbalah and The Official Soundtrack AudioCipher Blog May 31
What is a Musical Cryptogram? AudioCipher Blog May 31
The Complete Guide to Musical Easter Eggs in Popular Culture AudioCipher Blog May 31
How to Write a Leitmotif for Video Games and Movie Scores AudioCipher Blog May 31
12 Best AI Text To Music Apps for People of All Skill Levels AudioCipher Blog May 31
Best AI Music Generator Software in 2026 AudioCipher Blog May 31
Generative Audio Workstations: AI VSTs & The Future of DAWs AudioCipher Blog May 31
Anyone on here have success linking AI music to their OF? r/SunoAI May 31
[Dark Theater/EDM] S6 E6: Rabbit! (Arthur-verse) by Arthur Wegley (ARed) r/SunoAI May 31
[Skatepunk] I don't have a title ;( r/SunoAI May 31
People Said Digital Movies Weren’t "Real" Once Too r/SunoAI May 31
[Reggaeton, Rap] Hit+Hit By Rognar r/SunoAI May 31
Hit me with your absolute best work on Spotify r/SunoAI May 31
[Indie Rock] Son Tebessum by Kendi Kendime: Trying to see if AI can capture traditional Turkish deep melancholy. What do you think? r/SunoAI May 31
[Breakbeat] Higher Feeling - NRVB r/SunoAI May 31
[Pop/Rock] - Tic Tac Toe - Feedback please r/SunoAI May 31
[Metal] Far My Kingdom - Crown of Falling Leaves (one of The Other Ones) r/SunoAI May 31
I love listening to other people's songs. Let's share 3 favorite Suno songs. r/SunoAI May 31
Feature Idea: We desperately need a "Fill Intensity" slider (0% to 100%) r/SunoAI May 31
How President Trump’s Immigration Order Will Feed the Stablecoin Economy, Bitcoin ATMs Decrypt — AI / Music coverage May 31
[DnB] Praeter's Song by Praeter r/SunoAI May 31
[Indie Pop] The Last Reasonable Hour r/SunoAI May 31
Helped Cheer My Daughter Up! r/SunoAI May 31
New AI song promotion post r/SunoAI May 31
Florida Candidate Liquidates $800K in Bitcoin to Bankroll Congressional Bid Decrypt — AI / Music coverage May 30
What Is an AI Prompt Injection Attack? The Hidden Threat Hijacking Your Chatbots Decrypt — AI / Music coverage May 30
'He’s Full of Shit': JP Morgan's Jamie Dimon Takes Aim at Coinbase CEO Over Clarity Act Decrypt — AI / Music coverage May 29
Treasury Secretary Bessent Says US Has 'Grabbed' $1 Billion in Crypto From Iran Decrypt — AI / Music coverage May 29
Celsius Founder Alex Mashinsky Files to Have 12-Year Crypto Fraud Sentence Vacated Decrypt — AI / Music coverage May 29
Coinbase Becomes First US Exchange Allowed to Offer Global Crypto Perps Trading Decrypt — AI / Music coverage May 29
Lenovo Stock Doubles in May on AI Server Boom—Best Month in 27 Years Decrypt — AI / Music coverage May 29
You Can Now Read the US Constitution via the Bitcoin Blockchain Decrypt — AI / Music coverage May 29
NYSE Parent Isn't 'Freaked Out' by Hyperliquid—It's Learning From the Crypto Perps Giant Decrypt — AI / Music coverage May 29
AI Models Can’t Agree on Basic Facts Most of the Time, Study Shows Decrypt — AI / Music coverage May 29
Wintermute Is Providing Liquidity on Kalshi and Polymarket, Linking Two Giants Decrypt — AI / Music coverage May 29
CFTC Approves Bitcoin Perpetual Futures on Prediction Market Kalshi Decrypt — AI / Music coverage May 29
Sui Network Goes Down for Second Straight Day as Weekly Token Slide Hits 20% Decrypt — AI / Music coverage May 29
ElevenLabs Debuts Music v2; Does Music Need a New Royalty for AI Era?; ACE Studio Builds AI Music on DigitalOcean, AMD GPUs AI Music Newsletter May 29
Bitcoin ETFs Shed $2.8B in Record-Breaking Nine-Day Streak Decrypt — AI / Music coverage May 29
Udio’s AI Music App: Starstruck; Spotify, Universal Strike AI Deal; From Headlines to AI Hip-Hop AI Music Newsletter May 26
From Headlines to Hip-Hop: How Slopdog Automates Music Creation AI Music Newsletter May 22
Splice, ElevenLabs to Power New AI Music Tools; Stability AI's New Audio Models Generate 6-minute Music Tracks AI Music Newsletter May 21
Splice, ElevenLabs Partner to Power Next-Generation AI Music Creation AI Music Newsletter May 19
Tamber Debuts AI Music Platform; Is ‘AI Resistance’ Setting the Music Sector Back? AI Music Newsletter May 19
SUNO is having an issue with: "Uploaded audio matches existing work of art." r/SunoAI May 17
Why superfan subscriptions are dying out Water & Music Apr 30
AI Makes The Rolling Stones Young Again for New Video; Do Musicians Have an 'AI Optimism' Blind Spot? AI Music Newsletter May 15
AI Makes Music Fun; Music After the GenAI Creative Big Bang AI Music Newsletter May 12
Spotify’s AI DJ Expands Globally; The Indie Music Toolkit, Reimagined; Listening to 1500 AI songs AI Music Newsletter May 8
The Indie Music Toolkit, Reimagined AI Music Newsletter May 7
A 50-Track Experiment in AI Songwriting AI Music Newsletter May 6
AI Rules on Music Streaming; Influur Debuts AI Music Marketing Agent; Suno Eyes $5B Valuation AI Music Newsletter May 5
May 2026 Song Feedback Megathread - Leave a review, get a review! r/SunoAI May 1
Spotify Adds Verified Artist Badges; AI Breaks Into Studio Workflow; Tamber Raises $5M for AI Music Tool AI Music Newsletter Apr 30
Spotify Lands in Claude; Honolulu's Airport Has AI Theme Songs; AI Music Made Me Cry AI Music Newsletter Apr 27
The Music Industry Crosses an AI Tipping Point; GRAI Believes AI Can Make Music More Social AI Music Newsletter Apr 23

Keyboard

j / k
move between items
Space
expand / collapse
o
open original
s
save / unsave
m
mark read
/
focus search
?
this help