a type of machine learning where an agent learns to make decisions by performing actions and receiving feedback
一种机器学习方式,其中代理通过执行动作并接受反馈来学习做出决策 (yī zhǒng jīqì xuéxí fāngshì, qízhōng dàilǐ tōngguò zhíxíng dòngzuò bìng jiēshòu fǎnkuì lái xuéxí zuòchū juécè)
"In reinforcement learning, the model improves over time as it learns from trial and error."
"reinforcement learning 中,模型随着时间的推移逐渐改进,因为它通过试验和错误进行学习。" (**reinforcement learning** zhōng, móxíng suízhe shíjiān de tuīyí zhújiàn gǎijìn, yīnwèi tā tōngguò shìyàn hé cuòwù jìnxíng xuéxí.)
to show agreement by nodding
to become angry suddenly
to spend large amounts of money in a hasty or unwise way
to change direction or strategy in response to market feedba...
To travel or look around to discover new things.
to explain something in simple, non-technical language