Added waste management through RL #1150

Panchadip-128 · 2024-10-24T20:01:28Z

The project aims to develop a reinforcement learning (RL) agent to optimize waste collection in a simulated environment, minimizing overflow events and improving efficiency.

Environment and State Representation:
The state is represented by four features: Waste Level: Current waste level (0 to 1) Time of Day: A random value representing the time (0 to 24 hours) Weather Condition: A random value (0 to 1) indicating the weather Distance to Collection Point: A random value (0 to 10) representing the distance to the waste collection point.

Action Space:
The agent can choose between two actions: Wait (0): Do not collect waste. Collect Waste (1): Proceed with waste collection.

Reward Structure:
The reward system is designed to encourage efficient waste collection: +10 for timely collection when the waste level exceeds the threshold. -5 for premature collection when the waste level is below the threshold. -1 for each time step to penalize waiting.

Training Process:
The agent is trained over 100 episodes, where each episode simulates a series of time steps (up to 20) where the agent makes decisions based on the current state. The agent learns from experience using a replay memory and updates its policy through Q-learning.

Evaluation Metrics:
Performance is evaluated using: Average Reward per Episode: Measures the effectiveness of the agent's actions. Epsilon Decay: Tracks the exploration rate, indicating how the agent balances exploration vs. exploitation. Overflow Events: Counts occurrences when the waste level exceeds the maximum capacity as per previous updation.

Visualization:
The results are visualized using Matplotlib to plot: Average rewards per episode, showing the agent's learning progression and rewards gained on successfull execution and implementation of a specified condition Epsilon decay over episodes, illustrating the shift from exploration to exploitation. Overflow events per episode, highlighting improvements in waste management techniques

…ntial Smoothing

… (1).ipynb to Visualizing_Epsilon_Decay_and_Scores_in_Reinforcement_Learning.ipynb

github-actions · 2024-10-24T20:01:41Z

Thank you for submitting your pull request! 🙌 We'll review it as soon as possible. If there are any specific instructions or feedback regarding your PR, we'll provide them here. Thanks again for your contribution! 😊

Panchadip-128 · 2024-10-24T20:04:17Z

This PR is for issue #1142

Panchadip-128 added 8 commits October 19, 2024 02:16

Create Gold Price Predictor using Naive models, Regression and Expone…

5d16966

…ntial Smoothing

Add files via upload

7a0ecf3

Create wm_RL files

90f0aa0

Add files via upload

19787a6

Rename README (15).md to README.md

626b591

Rename Visualizing_Epsilon_Decay_and_Scores_in_Reinforcement_Learning…

09cbfe0

… (1).ipynb to Visualizing_Epsilon_Decay_and_Scores_in_Reinforcement_Learning.ipynb

Rename gitignore (3).txt to gitignore.txt

79d8d27

Rename requirements (3).txt to requirements.txt

8d846bb

Panchadip-128 closed this by deleting the head repository Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added waste management through RL #1150

Added waste management through RL #1150

Panchadip-128 commented Oct 24, 2024

github-actions bot commented Oct 24, 2024

Panchadip-128 commented Oct 24, 2024

Added waste management through RL #1150

Added waste management through RL #1150

Conversation

Panchadip-128 commented Oct 24, 2024

github-actions bot commented Oct 24, 2024

Panchadip-128 commented Oct 24, 2024