← all jobs

Reinforcement Learning Engineer

Work from home Full-time role Hiring

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled Reinforcement Learning Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential. Reinforcement Learning Engineer Job Title: Reinforcement Learning Engineer Salary Range: 100k$/Annum-150k$/Annum Location: 100% Remote (Continental United States) Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor) Experience: 6+ years Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates. Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party) Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap Compensation: Competitive base salary commensurate with experience, plus benefits. Employment Terms & Visa Policy This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved. We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables. No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience. Job Summary We are looking for a Reinforcement Learning Engineer to design, train, and deploy RL-based systems for high-impact decision-making problems where supervised learning alone is insufficient. The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale. The ideal candidate has both research depth and engineering pragmatism, with experience taking RL solutions out of the lab and into production where stability, safety, and ongoing improvement are critical. Key ResponsibilitiesDesign and implement reinforcement learning solutions for sequential decision-making problems in real and simulated environments. Develop, calibrate, and maintain simulation environments suitable for large-scale agent training. Implement and evaluate modern RL algorithms including policy gradient, actor-critic, off-policy, and offline RL methods. Engineer reward functions and shaping strategies that align agent behavior with desired outcomes and safety constraints. Apply offline RL and imitation learning techniques where exploration is costly or unsafe. Use RLHF, DPO, and related techniques for fine-tuning large language models when relevant. Build scalable training infrastructure for distributed RL, including efficient experience collection and replay systems. Optimize training stability and sample efficiency through algorithmic and engineering improvements. Design rigorous evaluation protocols, including out-of-distribution and adversarial test cases. Implement safety mechanisms such as constraint enforcement, conservative policies, and human-in-the-loop oversight. Collaborate with applied scientists and product teams to identify high-value RL use cases. Monitor deployed policies and models in production for drift, regression, and unintended behaviors, building the alerting and dashboards that surface issues before they meaningfully affect users. Document methodology, design decisions, and operational characteristics for internal stakeholders. Stay current with RL research and translate promising techniques into production-ready solutions. Required QualificationsMaster’s or PhD in Computer Science, Machine Learning, or a related field; or equivalent applied experience. Six or more years of combined RL research and engineering experience. Strong proficiency in Python and modern deep learning frameworks. Hands-on experience with at least one major RL library or in-house RL stack. Solid understanding of probability, optimization, and the theoretical foundations of RL. Experience designing and tuning reward functions in non-trivial environments. Familiarity with simulation environments and large-scale experience collection. Experience training neural network policies on GPU clusters. Strong written and verbal communication skills. Track record of shipping or publishing impactful RL work. Preferred QualificationsExperience with RLHF for large language models. Familiarity with multi-agent RL or hierarchical RL. Exposure to robotics, control systems, or autonomous driving. Publications in RL or related research venues. Open-source contributions to RL libraries or environments.

How to Apply

Would you like to know more about this opportunity? For immediate consideration, please send your resume to [email protected] or contact us at (908) 650-6699. Learn more about Bright Vision Technologies at www.bvteck.com. We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs. Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans. Position offered by “No Fee Agency.” Equal Employment Opportunity (EEO) Statement Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall. BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.

More open positions

SAP Integration Suite (CPI) Developer

Work from home Full-time role

SAP Basis / SAP Platform Engineer

Work from home Full-time role

Service Mesh Engineer

Work from home Full-time role

PTC Windchill Developer

Work from home Full-time role

Salesforce Technical Developer

Work from home Full-time role

Remote Data Entry Specialist – High‑Volume Accurate Data Management & Administrative Support for careerzynith

Work from home Full-time role

Experienced Data Entry Clerk for 17-Year-Olds – Entry-Level Position at Hirevector About Hirevector At Hirevector, we are driven by a mission to be the world's most customer-centric company. We strive to offer our customers the lowest possible prices, the best available selection, and the utmost convenience. Established in 1994, we’ve grown from an online bookstore into a global powerhouse that specializes in e-commerce, cloud computing, digital streaming, and artificial intelligence. Your Opportunity Awaits We are excited to announce our Data Entry Clerk position specifically tailored for 17-year-olds! This is a unique opportunity to start your career with one of the world's leading companies while improving your computer skills and gaining real-world experience. Position Overview As a Data Entry Clerk at Hirevector, you will play a crucial role in our operations by managing various forms of data input and validation. This position is an excellent opportunity for motivated and detail-oriented teenagers looking to build valuable work experience in a fast-paced environment. Key Responsibilities: Accurately enter customer data into our internal systems. Review and verify data for accuracy and completeness. Organize and maintain data files and records. Assist in organizing information and preparing reports. Communicate effectively with team members to resolve discrepancies. Who We Are Looking For This role is perfect for a responsible 17-year-old who is eager to learn and grow. We are looking for candidates who meet the following criteria: Essential Qualifications: Must be 17 years old by the time of application. High School student or recent graduate preferred. Basic computer skills and familiarity with Microsoft Office Suite. Strong attention to detail and organization skills. Ability to work independently as well as a part of a team. Effective communication skills—both written and verbal. Willingness to learn and accept feedback. What We Offer Working at Hirevector comes with unique benefits tailored to help you thrive: Benefits and Perks: Flexible working hours that can accommodate your school schedule. A competitive hourly wage. Professional development opportunities and training. A supportive work environment with a focus on teamwork. Networking and potential career advancement within the company. Diversity and Inclusion At Hirevector, we value diversity and strive to create an inclusive work environment. We believe that the more diverse our workforce, the better we can serve our customers. We are proud to be an Equal Opportunity Employer where everyone can find success. Your Next Steps If you’re excited about the opportunity to gain skills and be part of an innovative team, we encourage you to apply! This position is a fantastic way for 17-year-olds to gain essential work experience and a chance to contribute to a global leader in technology. Career Growth Opportunities At Hirevector, we believe in investing in our employees' growth and development. As a Data Entry Clerk, you will have the opportunity to learn and grow with our company, taking on new challenges and responsibilities as you progress in your career. Work Environment and Culture Our work environment is fast-paced and dynamic, with a focus on teamwork and collaboration. We encourage open communication, creativity, and innovation, and we strive to create a positive and inclusive work environment for all employees. Compensation, Perks, and Benefits We offer a competitive hourly wage, flexible working hours, and a range of benefits and perks to support your well-being and career development. We also provide comprehensive training and professional development opportunities to help you succeed in your role. Conclusion Data entry jobs for 17-year-olds at Hirevector represent a valuable starting point for any young aspiring professional. With the right guidance and opportunity, you can not only develop practical skills essential for your career but also join a company that embraces innovation and creativity. This is your chance to take those first steps toward a bright future. Don’t hesitate—apply today and be part of something bigger! FAQs Q: What is the minimum age requirement for this position? A: You must be at least 17 years old to apply for this position. Q: Do I need prior experience in data entry to apply? A: No prior experience is necessary, but basic computer skills and a willingness to learn are important. Q: What are the working hours for this role? A: The working hours are flexible and can be arranged to fit around your school schedule. Q: Will training be provided? A: Yes, comprehensive training will be provided to ensure you are fully prepared for your responsibilities. Q: What growth opportunities exist within this role? A: There are numerous opportunities for career advancement within Hirevector, especially for dedicated employees who excel in their roles. Apply Now! Ready to take the first step in your career? Apply now for the Data Entry Clerk position at Hirevector and join our team of innovative and dedicated professionals!

Work from home Full-time role

Remote Civil Engineer I - Design, Docs & Field Support

Work from home Full-time role

PPC Specialist (Remote, MUST BE FL RESIDENT)

Work from home Full-time role

Sr Data Architect

Work from home Full-time role

Lead Director, Network Management Value-Based Care

Work from home Full-time role

[Remote] Licensed Insurance Agent (Remote)

Work from home Full-time role

[Remote] AI Product Manager Banking

Work from home Full-time role

Remote Customer Service Representative – Work‑From‑Home Support for careerzynith E‑Commerce Platform

Work from home Full-time role

Administrative Assistant

Work from home Full-time role

Customer Success Engineer Remote-first | | 70K – 100K a year + profit share

Work from home Full-time role

Clinical Review Nurse - Correspondence

Work from home Full-time role

Director, Monitoring & Site Management

Work from home Full-time role

Credentialing Coordinator

Work from home Full-time role

Remote careerzynith Live Chat Customer Support Specialist – Flexible Home‑Based Role with Growth Opportunities

Work from home Full-time role

(Fluent English) Customer Support Consultant (remote, Europe)

Work from home Full-time role