Research Scientist, Speech
Berlin
A bit about Cantina:
Cantina, founded by Sean Parker, is a new social platform with the most advanced AI character creator. Build, share, and interact with AI bots and your friends directly in the Cantina or across the internet.
Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet. Recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters. Bots are a new media type that offer a way for creators to share infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.
If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!
A bit about the role:
We are seeking talented Research Scientists to join our team, focused on advancing the capabilities of our AI-driven social platform. As a Research Scientist, you will play a pivotal role in developing state-of-the-art speech models that enable our AI bots to perceive and interact with audio and speech in real-time.
As a Research Scientist, you will:
Conduct research to develop novel and scalable algorithms and models for speech generation, voice cloning, voice conversion, and voice generation.
Focus on optimization and efficiency of novel large-scale model architectures to enable real-time interactions within the product
Collaborate with product, design and engineering teams to develop research prototypes solving particular product problems.
Explore and analyze cutting-edge research,survey, evaluate, and integrate new techniques quickly.
Work on LLMs, GANs, Diffusion and Flow matching models.
Publish papers and open-source research breakthroughs within the community
A bit about you:
Ph.D. or equivalent experience in Computer Science, Electrical Engineering, or related fields with a focus on generative modeling, including large language modeling, speech recognition, text-to-speech or computer vision
Proven track record demonstrated through publications at top-tier conferences, or journals, open-source project contributions, and patents.
Excellent hands-on experience with training large-scale generative LLMs and diffusion models and GANs.
Proficiency in signal processing and speech modeling techniques.
Hands-on experience with large-scale datasets, data augmentation, and self-supervised learning for speech tasks.
Strong programming skills in Python with deep learning frameworks (PyTorch, JAX).
Prior experience in deploying research to real-world applications
Ability to work independently and collaboratively in a fast-paced, dynamic environment, with a strong sense of ownership and drive to deliver impactful results.
Why Join Cantina AI?
Opportunity to work on groundbreaking AI technologies that redefine social-media …
This job isn't fresh anymore!
Search Fresh JobsJob Profile
RestrictionsNo Visa Sponsorship
Benefits/PerksCompetitive compensation Equity options Flexible work arrangements Healthcare Benefits Wellness stipend
Tasks- Collaborate with teams
- Publish research
AI Algorithms Deep Learning Diffusion Models GANS Generative Modeling Jax LLMs Python PyTorch Signal processing
Education Timezones