Description

**Job Description:**

As the CTO and co-founder, you will play a crucial role in shaping the technological direction of our company. We are seeking a visionary leader with expertise in AI, ML, and related technologies, including audio and speech processing, TTS (Text-to-Speech), and STT (Speech-to-Text). Your primary responsibility will be to lead the development of our AI voice agent platform, utilizing tools such as Hugging Face for experimenting with open-source LLMs, and training models, and customizing them to suit our business needs. Knowledge of embeddings and vector databases is essential, and familiarity with GPTCache or caching commonly used answers and audio files to reduce latency is a significant plus. You have to have hands-on experience in all of these technologies or be willing to get your hands dirty with the required coding and designing the architecture.

 

**Key Responsibilities:**

- Lead the design, development, and implementation of our AI voice agent platform.

- Utilize your expertise in AI, ML, and LLM to experiment with open-source models and customize them for our business case.

- Integrate audio and speech processing capabilities, including TTS and STT functionalities, into our platform.

- Collaborate with cross-functional teams to define technical requirements, prioritize features, and drive product development.

- Stay updated on the latest advancements in AI technology, incorporating emerging trends and best practices into our product roadmap.

- Leverage tools such as Hugging Face for experimenting with LLMs, training models, and customizing them to meet our specific requirements.

- Implement embeddings and vector databases to optimize data storage and retrieval processes.

- Knowledge of GPU-based hosting companies is a good to have

 

**Requirements:**

- Strong background in AI, ML, and related fields, with expertise in audio and speech processing.

- Proficiency in LLMs and experience with tools like Hugging Face for experimenting with open-source models.

- Knowledge of TTS and STT technologies, including hands-on experience with platforms like Eleven Labs (https://www.eleven-labs.com/), Play.ht (https://play.ht/), and Deepgram (https://www.deepgram.com/).

- Experience with embeddings and vector databases, with the ability to optimize data storage and retrieval processes.

- Familiarity with GPTCache or similar technologies for caching commonly used answers and audio files to reduce latency.

- Use call recordings and transcripts of all recruiting calls by AI agents for re-training models in a continuous way to make our LLM the smartest AI recruiter and able to handle any objections and be more creative