Vignya Durvasula

Q Learning vs SARSA: Key Differences in Reinforcement Learning

Reinforcement Learning is a branch of machine learning that focuses on decision-making to maximize positive outcomes while successfully reaching a goal state. It involves performing certain actions through a trial-and-error method. Reinforcement Learning is used in many real-world applications like Gaming, Recommendation systems, autonomous vehicles, and many more. The agent is expected to explore the

Q Learning vs SARSA: Key Differences in Reinforcement Learning Read More »

SARSA: A Guide to the On-Policy Reinforcement Learning Algorithm

Reinforcement Learning is a fascinating machine learning technique, which differs from the other techniques of ML. In reinforcement learning, the actor(Agent) has to continuously interact with the environment to make decisions, which results in some sort of feedback. In a way, the agent is self-taught and learns about the environment by exploring. But to explore

SARSA: A Guide to the On-Policy Reinforcement Learning Algorithm Read More »

A Comprehensive Guide to Greek Math Symbols in Machine Learning

Mathematics, a field rich in diversity and applications, uses various notations for different measurements in both mathematics and physics. We cannot use the same notation for every measurement we conduct in mathematics and physics, right? To solve this problem, the Greek Mathematicians have introduced the Greek Math symbols for each measurement or variable representing the

A Comprehensive Guide to Greek Math Symbols in Machine Learning Read More »

What is Kullback-Leibler Divergence in Machine Learning

Information theory is the study of general collection, storage, and transmission of information. It leverages many concepts such as entropy, to efficiently communicate the information. Kullback-Leibler Divergence (KL Divergence) is one such metric used in information theory to quantify or measure the difference between two distributions. KL Divergence also known as relative entropy is closely

What is Kullback-Leibler Divergence in Machine Learning Read More »