Entrenaré a un agente de aprendizaje de refuerzo profundo para ti.


Acerca de este Servicio
Traducción automática
Ingeniero de investigación experimentado en visión por computadora en aprendizaje por refuerzo con habilidades en entrenar agentes de aprendizaje por refuerzo.
Trabajos anteriores incluyen:
- Implementación de artículos de investigación.
- Agentes Q learning para jugar juegos individuales y multijugador.
- Entrenamiento en la mayoría de los entornos de OpenAI Gym.
- Entrenamiento de DQN desde cero usando solo NumPy.
- Entrenamiento de múltiples agentes personalizados.
- Entrenar cualquier agente de aprendizaje por refuerzo en entornos personalizados.
Ofrezco implementación de vanguardia de algoritmos de aprendizaje por refuerzo para tus entornos personalizados o entornos de OpenAI gymnasium.
Capaz de manejar tanto entornos simples como complejos.
- Competente en MDPs, TD y Q-learning.
- DQN (Deep Q-Networks)
- PPO (Proximal Policy Optimization)
- TRPO (Trust Region Policy Optimization)
- Actor-Critic Methods
- A2C (Advantage Actor-Critic)
- A3C (Asynchronous Advantage Actor-Critic)
- Métodos Monte Carlo
- DDPG (Deep Deterministic Policy Gradient)
- SAC (Soft Actor-Critic)
- HER (Hindsight Experience Replay)
- ACER (Actor-Critic con experiencia de repetición)
Contacta antes de hacer tu pedido para recibir ayuda rápida.
No te preocupes, obtendrás una respuesta rápida.
Conoce a Hakim Ali
- DePakistán
- Miembro desdeene 2023
- Última entrega2 años
Idiomas
Inglés
Traducción automática
4 comentarios sobre este Servicio
| (4) | ||
| (0) | ||
| (0) | ||
| (0) | ||
| (0) |
Desglose de calificaciones
- Nivel de comunicación del Freelancer
- Recomendar a un amigo
- Servicio según lo descrito
Ordenar por
A ash5355
Cliente recurrente

Reino Unido
This is second time I work with him..Great job..Indeed a smart person who deliver the work in a day or two as quick as posdibke covering all the requirements.
USD50-USD100
Precio
4 días
Tiempo
Útil?N 
nemosu
Cliente recurrente

Emiratos Árabes Unidos
Hakim is very brilliant and talented. He gave me a perfect MLP built from scratch without using any Python libraries as I asked him and implemented TD correctly to the game. He was very patient with me in changing anything I point to him and answer any questions I had. He cares about his clients and...
USD50-USD100
Precio
4 días
Tiempo
Útil?A ash5355
Cliente recurrente

Reino Unido
Best work ever..He delivered within few hours..Perfectionist..would definitely recommend him ..He trained an RL agent for me to race car..
Útil?Z zagato5800

Alemania
Thanks again for your fast and good work, ihkali! I am very satisfied with the results achieved and would also let you solve RL tasks in the future! In addition, a nice contact who also responds to questions very well.
Útil?
4 comentarios sobre este Servicio
| (4) | ||
| (0) | ||
| (0) | ||
| (0) | ||
| (0) |
Desglose de calificaciones
- Nivel de comunicación del Freelancer
- Recomendar a un amigo
- Servicio según lo descrito
Ordenar por
A ash5355
Cliente recurrente

Reino Unido
This is second time I work with him..Great job..Indeed a smart person who deliver the work in a day or two as quick as posdibke covering all the requirements.
USD50-USD100
Precio
4 días
Tiempo
Útil?N 
nemosu
Cliente recurrente

Emiratos Árabes Unidos
Hakim is very brilliant and talented. He gave me a perfect MLP built from scratch without using any Python libraries as I asked him and implemented TD correctly to the game. He was very patient with me in changing anything I point to him and answer any questions I had. He cares about his clients and...
USD50-USD100
Precio
4 días
Tiempo
Útil?A ash5355
Cliente recurrente

Reino Unido
Best work ever..He delivered within few hours..Perfectionist..would definitely recommend him ..He trained an RL agent for me to race car..
Útil?Z zagato5800

Alemania
Thanks again for your fast and good work, ihkali! I am very satisfied with the results achieved and would also let you solve RL tasks in the future! In addition, a nice contact who also responds to questions very well.
Útil?
