a type of machine learning where an agent learns to make decisions by performing actions and receiving feedback
вид машинного обучения, при котором агент учится принимать решения, выполняя действия и получая обратную связь (vid mashinnogo obucheniya, pri kotorom agent uchitsya prinimat' resheniya, vypolnyaya deystviya i poluchaya obratnuyu svyaz')
"In reinforcement learning, the model improves over time as it learns from trial and error."
"В reinforcement learning модель улучшает себя со временем, обучаясь на пробах и ошибках." (V **reinforcement learning** model' uluchshaet sebya so vremenem, obuchayas' na probakh i oshibkakh.)
to sell shares of a company to the public for the first time
to give in or fail due to too much stress or pressure
to make a firm decision or opinion known
to keep possession of something
to take a short break from a situation
to be more successful than competitors