DRL algorithm with api #2528

eightreal · 2024-04-01T11:52:52Z

Hello , Dear Contributors
I notice that the application DQN don't use the api .h file.
And there only exists defined loss function, so if I want to develop a DQN methods, I would like to ask you to confirm the following.

Is there an interface or method to customize the Los function?
Can I copy the header file you used in Aplication / DRL, and if so, which release package should I use? nntrainer-devel?

Or you have better advice.

taos-ci · 2024-04-01T11:52:54Z

cibot: Thank you for posting issue #2528. The person in charge will reply soon.

myungjoo · 2024-04-03T06:52:23Z

Example: https://github.com/nnstreamer/nntrainer/blob/main/Applications/Custom/mae_loss.cpp
Yes, you can. A devel package is always recommended, too, if you want to setup a CI/CD system.

eightreal · 2024-04-08T06:43:14Z

Another question.
When I call the run interface and save the model, do I also save the current training status (such as gradient information)? Is it possible to continue training after the model is loaded in the future.

EunjuYang · 2024-04-08T07:35:58Z

Hello! Thank you for your question and concern. Here're my answer on your questions:
First, you can save the model after training. However, it does not support to save the gradient information.
Second, Yes. it is possible to continue training after the model is loaded.

myungjoo · 2024-04-11T10:42:19Z

You can do checkout and continue training process, but that's just not based on gradient saving.
You can do epoch-based checkpointing (that's what most nntrainer's mobile applications do), but I'm not sure about finer-grained checkpointing.

eightreal · 2024-04-12T05:46:05Z

ok, thanks for your reply ,
another question , is there any method for a model copy and Polyak update?

myungjoo · 2024-04-19T14:48:49Z

For model copy, if there is no copy-constructor for model class and the default behavior does not do what you want, you may try "original.save()" and "cloned.load()".

For Polyak update, it appears that the DQN application (or simple "reinforcement learning" app) has its own "custom" op. But I'm not too sure about this. I guess @jijoongmoon may answer this when he returns from trip.

eightreal · 2024-05-13T02:30:32Z

Hello, I checked the reinforcement learning app ,
you update the net by save file and load file, but not polyak update ,
could you help check it ?
And if if there is a impl of polyak update, could you help clear its path and code line?

myungjoo added the question label Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRL algorithm with api #2528

DRL algorithm with api #2528

eightreal commented Apr 1, 2024

taos-ci commented Apr 1, 2024

myungjoo commented Apr 3, 2024 •

edited

Loading

eightreal commented Apr 8, 2024

EunjuYang commented Apr 8, 2024

myungjoo commented Apr 11, 2024

eightreal commented Apr 12, 2024

myungjoo commented Apr 19, 2024

eightreal commented May 13, 2024

DRL algorithm with api #2528

DRL algorithm with api #2528

Comments

eightreal commented Apr 1, 2024

taos-ci commented Apr 1, 2024

myungjoo commented Apr 3, 2024 • edited Loading

eightreal commented Apr 8, 2024

EunjuYang commented Apr 8, 2024

myungjoo commented Apr 11, 2024

eightreal commented Apr 12, 2024

myungjoo commented Apr 19, 2024

eightreal commented May 13, 2024

myungjoo commented Apr 3, 2024 •

edited

Loading