CE(pseudo hard label, teacher_us) #21

miramirakim227 · 2021-08-15T08:13:30Z

I have a small question about the above code.

The aim of the cross entropy is to make the two different distributions be similar with each other.
I understood the first CE loss but not the second CE loss, which confuses me a lot.
In line 218, the latter argument of CE should follow the former argument.
In this case is the purpose of CE loss to make the raw output of teacher model to be sharpened?

Xiao0728 · 2021-12-30T18:27:35Z

did you figure it out? and how was your implementation results? does it work? Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CE(pseudo hard label, teacher_us) #21

CE(pseudo hard label, teacher_us) #21

miramirakim227 commented Aug 15, 2021 •

edited

Loading

Xiao0728 commented Dec 30, 2021 •

edited

Loading

CE(pseudo hard label, teacher_us) #21

CE(pseudo hard label, teacher_us) #21

Comments

miramirakim227 commented Aug 15, 2021 • edited Loading

Xiao0728 commented Dec 30, 2021 • edited Loading

miramirakim227 commented Aug 15, 2021 •

edited

Loading

Xiao0728 commented Dec 30, 2021 •

edited

Loading