backpropagation - How does PyTorch autograd backpropogate successfully through non-tensor elements in the computational graph fo

IT技术

更新时间：2025-03-120

admin管理员组
文章数量:1305166

I am trying to understand the example REINFORCE PyTorch implementation on PyTorch GitHub: .py

One particular point is a sticking point I am unable to understand at line 75.

policy_loss.backward()

There are many non-tensor variables from state input to policy all the way to policy_loss.backward() which would stop autograd from back propagating, based on my understanding.

Eg, policy_loss.backward() is called on policy_loss, derived from policy.saved_log_probs and returns

    for log_prob, R in zip(policy.saved_log_probs, returns):
        policy_loss.append(-log_prob * R)

policy.saved_log_probs is a non-tensor

self.saved_log_probs = []

And so is returns, which in turn is calculated from policy.rewards (which is a non-tensor).

        self.rewards = []

    for r in policy.rewards[::-1]:
        R = r + args.gamma * R
        returns.appendleft(R)

So how would autograd back prop past these all the way to affine1 linear layer’s weights?

class Policy(nn.Module):
    def __init__(self):
        super(Policy, self).__init__()
        self.affine1 = nn.Linear(4, 128)

I am trying to understand the example REINFORCE PyTorch implementation on PyTorch GitHub: https://github/pytorch/examples/blob/main/reinforcement_learning/reinforce.py

One particular point is a sticking point I am unable to understand at line 75.

policy_loss.backward()

There are many non-tensor variables from state input to policy all the way to policy_loss.backward() which would stop autograd from back propagating, based on my understanding.

Eg, policy_loss.backward() is called on policy_loss, derived from policy.saved_log_probs and returns

    for log_prob, R in zip(policy.saved_log_probs, returns):
        policy_loss.append(-log_prob * R)

policy.saved_log_probs is a non-tensor

self.saved_log_probs = []

And so is returns, which in turn is calculated from policy.rewards (which is a non-tensor).

        self.rewards = []

    for r in policy.rewards[::-1]:
        R = r + args.gamma * R
        returns.appendleft(R)

So how would autograd back prop past these all the way to affine1 linear layer’s weights?

class Policy(nn.Module):
    def __init__(self):
        super(Policy, self).__init__()
        self.affine1 = nn.Linear(4, 128)

Share Improve this question asked Feb 4 at 7:07 TalkArtFunDay 111 bronze badge

Add a comment |

1 Answer 1

Sorted by: Reset to default 0

As you suspected, the list is indeed not part of the computational graph. The fact that you hold the input or output tensor of an arithmatic operation in a list, dict or any other data structure is irrelevant. Every time a tensor is involved in a derivable operation (e.g. multiplication, addition, or even concatination), the result has a reference to the location in the computational graph that is built by the operation.

In the examples you provided, note that later the tensors inside the list are used in the arithmatic ops, not the list that contains it.

For background, you may find it interesting to read a bit about how computational graphs are built.

本文标签：

版权声明：本文标题：backpropagation - How does PyTorch autograd backpropogate successfully through non-tensor elements in the computational graph fo 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1741783573a2397450.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

android - How to fully customize notifications - Stack Overflow

IT技术

26分钟前

When I try to create a custom notification, there is always an expand button in the upper right corner,

javascript - Change standard position of tooltip if hoveredactive in angular - Stack Overflow

IT技术

25分钟前

Is it possible to change the standard positioning of a tooltip?In my case, when using "bottom"

javascript - How to remove underscores from json key value - Stack Overflow

IT技术

24分钟前

I have a json file like{"Asian_Cities_Countries":[{"name":"Beijing",&q

javascript - Set cookie to hide div when button is clicked - Stack Overflow

IT技术

22分钟前

I'm trying to display a div (containing terms and conditions) which is shown by default unless a c

javascript - How to add time zone to specific format in momentjs? - Stack Overflow

IT技术

21分钟前

I am trying to get specific format of datetime with time zonei am getting string of time format which i

javascript - Centering Google Maps on responsive resize with marker - Stack Overflow

IT技术

21分钟前

I'm trying to get Google Maps to keep its center when resizing responsively. I've managed to

wp register style - Do not load the css file for a plugin from the header

IT技术

17分钟前

I'm newbie to PHP.I need to remove a css file from home of my siteThis is the html code to remove<link rel=&#

Where to add my PHP codes for AJAX Jquery to work?

IT技术

12分钟前

I am trying to display state's names responsively based on what country is selected from a list (via a library).I

javascript - How to create interface from typeof? - Stack Overflow

IT技术

11分钟前

How do I dynamically go "backwards" from object to interface?const o = {a: 1,b: "hi"

typescript - Type similar to Record<A, B> but saying that not for every A there is a B value (not even an undefine

IT技术

9分钟前

Record is defined as follows:*** Construct a type with a set of properties K of type T*type Record&

javascript - How to cachesave images in react and display them? - Stack Overflow

IT技术

9分钟前

I have a list of URLs to images. Each of these images sometimes load and sometimes 404 (meaning that an

javascript - Calling an async JS function within if condition - Stack Overflow

IT技术

8分钟前

I want to generate a unique token, so I have to check if the token value already exists in the MongoDB

plugins - How to disable autocomplete for inputs in contact form 7?

IT技术

6分钟前

Closed. This question is off-topic. It is not currently accepting answers.Your question should be specific to WordPress.

javascript - typescript : trigger "organizeImports" from command line - Stack Overflow

IT技术

6分钟前

VSCode has an editor feature, which allows to clean and order imports in javascript and typescript file

internet explorer 8 - Reason behind a JavaScript parsing error in MSIE 8 - Stack Overflow

IT技术

4分钟前

Given something likevar obj = {foo: function(){try{doSomething();}catch(ex){@TODO - report error}}}MS

javascript - How do I use @ViewChild with an external ng-template (Angular 11) - Stack Overflow

IT技术

4分钟前

THE PROBLEMSo I have two Angular ponents, a parent and a child. The parent passes a custom template to

Detecting a retina display iPad with javascript - Stack Overflow

IT技术

3分钟前

I'm having a problem detecting a retina iPad (and similar devices) using just screen.availWidth an

javascript - jQuery textarea draggable - Stack Overflow

IT技术

1分钟前

OI have a simple problem with jQuery draggable with a textarea.I have to insert a textarea into a div d

react native - Expo Router 52 not detecting files and directories in app directory - Stack Overflow

IT技术

1分钟前

I am using React Native Expo Router SDK 52 that uses file-based routing to navigate between screens. I

javascript - AsyncStorage always returns {"_U": 0, "_V": 0, "_W": null, &a

IT技术

5秒前

async function getTokenFromAsync() {const userToken = await AsyncStorage.getItem('@User_Token'

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

backpropagation - How does PyTorch autograd backpropogate successfully through non-tensor elements in the computational graph fo

1 Answer 1

更多相关文章

android - How to fully customize notifications - Stack Overflow

javascript - Change standard position of tooltip if hoveredactive in angular - Stack Overflow

javascript - How to remove underscores from json key value - Stack Overflow

javascript - Set cookie to hide div when button is clicked - Stack Overflow

javascript - How to add time zone to specific format in momentjs? - Stack Overflow

javascript - Centering Google Maps on responsive resize with marker - Stack Overflow

wp register style - Do not load the css file for a plugin from the header

Where to add my PHP codes for AJAX Jquery to work?

javascript - How to create interface from typeof? - Stack Overflow

typescript - Type similar to Record&lt;A, B&gt; but saying that not for every A there is a B value (not even an undefine

javascript - How to cachesave images in react and display them? - Stack Overflow

javascript - Calling an async JS function within if condition - Stack Overflow

plugins - How to disable autocomplete for inputs in contact form 7?

javascript - typescript : trigger &quot;organizeImports&quot; from command line - Stack Overflow

internet explorer 8 - Reason behind a JavaScript parsing error in MSIE 8 - Stack Overflow

javascript - How do I use @ViewChild with an external ng-template (Angular 11) - Stack Overflow

Detecting a retina display iPad with javascript - Stack Overflow

javascript - jQuery textarea draggable - Stack Overflow

react native - Expo Router 52 not detecting files and directories in app directory - Stack Overflow

javascript - AsyncStorage always returns {&quot;_U&quot;: 0, &quot;_V&quot;: 0, &quot;_W&quot;: null, &a

发表评论

推荐文章

Posting to a Custom Post Type from front end - user generated content

javascript - NestJS: Using forRootforChild in custom module - race condition? - Stack Overflow

javascript - How to import into properties using ES6 module syntax (destructing)? - Stack Overflow

jquery - Javascript querySelectorAll, how to match with only top elements? - Stack Overflow

javascript - JQuery apply input mask to field onfocus and remove onblur so to avoid problems with placeholder text - Stack Overf

热门文章

javascript - JSjQuery - animated random name picker - Stack Overflow

node.js - Javascript how to reference an Express-Session on the client - Stack Overflow

javascript - Bootstrap carousel slide start without &quot;active&quot; - Stack Overflow

Single Post function to display all single post images in a carousel

javascript - Why select.setAttribute(&#39;value&#39;,value) produce different results than select.value=value? - Stack O

Asp.Net Core IExceptionHandler loses scope - Stack Overflow

javascript - how to print numberlong value? - Stack Overflow

javascript - Webpack failed to compile React.js project - Stack Overflow

arrays - Javascript- Convert Dictionary into List of Objects - Stack Overflow

javascript - How can I give a plugin&#39;s default settings access to the final settings? - Stack Overflow

最新文章

Win7各正式版下载地址和SHA验证

怎么样把中文版的Windows7改成英文版的Windows7

Win7系统笔记本蓝牙打开指南：详细步骤助你轻松连接

win7开机弹计算机,win7开机弹出Windows Installer窗口的解决方法

windows7虚拟机安装vmtools方法

android - Updating UiState of a screen from a viewModel of another screen Kotlin - Stack Overflow

javascript - AsyncStorage always returns {&quot;_U&quot;: 0, &quot;_V&quot;: 0, &quot;_W&quot;: null, &a

javascript - How to bind an event to Tabbing Off an element? - Stack Overflow

react native - Expo Router 52 not detecting files and directories in app directory - Stack Overflow

javascript - jQuery textarea draggable - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

typescript - Type similar to Record<A, B> but saying that not for every A there is a B value (not even an undefine

javascript - typescript : trigger "organizeImports" from command line - Stack Overflow

javascript - AsyncStorage always returns {"_U": 0, "_V": 0, "_W": null, &a

javascript - Bootstrap carousel slide start without "active" - Stack Overflow

javascript - Why select.setAttribute('value',value) produce different results than select.value=value? - Stack O

javascript - How can I give a plugin's default settings access to the final settings? - Stack Overflow

javascript - AsyncStorage always returns {"_U": 0, "_V": 0, "_W": null, &a