ClearScale (headquartered in San Francisco, California, USA) - AWS Premier Consulting Partner has been offering a full range of professional cloud computing services for over 10 years, including architecture design, DevOps automation, refactoring and cloud-native applications development, integration, migration, solving all sorts of security issues (from just a security check to preventing cyber-attacks) and 24/7 technical support using the best advanced technology.
The list of our customers is diverse: from governments (ClearScale is an official cloud partner of the State of California) and educational institutions (University of California, San Francisco) to well-known global brands (IBM, Samsung, GoPro, HP, Conde Nast, Carl Zeiss, etc.) The number of satisfied customers has been well over 850, some of which can be found on the company's website in the Case Studies section.
- We were the third company to gain a new AWS competence: Applied AI and Machine Learning Operations (MLOps). Less than 15 partners have this competency!
- We have obtained the status in the Database Freedom program (total less than 20 companies in)
- Proven expertise as a Managed Services Provider (total less than 16 companies in). This means that ClearScale can perform a full cycle consultancy and service: from audits, system or software development to the 24/7. You can read more about us on the company page - Managed Services.
About the Role
ClearScale is looking for an individual who performs a Generative AI Engineer role. The successful candidate should demonstrate the ability to design, build, and optimize Generative AI frameworks, leveraging cloud-native technologies and AI/ML best practices.
About the Project
ClearScale is running various projects closely related to Data, Machine Learning, Artificial Intelligence, and Generative AI applications. The team works on AI-powered solutions, including Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) systems, and AI-driven automation tools.
One of the key projects focuses on building scalable Generative AI services, optimizing data pipelines, and integrating state-of-the-art AI tools for production-grade applications.
Responsibilities
- Develop and maintain highly scalable AI/ML services.
- Work with Large Language Models (LLMs) to perform prompt engineering, fine-tuning, and optimization.
- Design, build, and manage robust and scalable ETL data pipelines.
- Develop Proof-of-Concepts (PoCs) using the latest AI technologies and experiment with third-party integrations.
- Build and optimize RAG (Retrieval-Augmented Generation) systems using vector storage solutions.
- Deploy and manage AI applications on AWS cloud infrastructure, ensuring security, scalability, and efficiency.
- Implement best practices, including SOLID principles, code reviews, and CI/CD pipelines.
- Collaborate with cross-functional teams, including data scientists, engineers, and business stakeholders, to deliver production-grade AI applications.
- Monitor, debug, and optimize AI applications for performance, reliability, and cost efficiency.
Requirements
Technologies:
- Proficiency in Python, Terraform, SQL, and Shell scripting.
- Experience with Git, Docker, REST API
Cloud & Infrastructure:
- Expertise in AWS services, including EC2, ELB, ECS, IAM, RDS, S3, Glue, Lambda, Athena, Bedrock, and Step Functions.
- Experience with vector storage solutions (e.g., Pinecone) and NoSQL databases.
AI Frameworks & Tools:
- Experience working with LangChain, Crewai/Autogen, and OpenAI APIs.
- Strong understanding of AI concepts such as LLMs, ChatGPT, RAG, and prompt engineering.
Best Practices:
- Strong grasp of SOLID principles and experience conducting code reviews.
- Experience in deploying production-grade AI applications with scalable architectures.
Preferred Qualifications:
- Experience with serverless computing and event-driven architectures.
- Familiarity with MLOps and monitoring AI model performance.
- Knowledge of data engineering best practices for handling structured and unstructured data.
We Offer:
#1 Fair Wage
- 100% remote position
- High hourly rate in USD
- Payments every 2 weeks
- Career development system – possibility to grow in a fast-developing Delivery Center in Central Europe
- Annual rate review
#2 Professional Development
- Work with innovative Silicon Valley companies and traditional American companies at the cutting edge of digital transformation.
- We work with the newest technologies in AWS cloud and open-source tools like Jira, Confluence, Lucidchart, Slack, etc.
- We operate in an honest and competitive environment and are one of AWS's top 10 key partners.
- The team is willing to share its experience.
- Horizontal and vertical career growth – We keep growing, and people keep growing with us.
If you are passionate about AI, cloud technologies, and building scalable AI-driven applications, we encourage you to apply!