Skip to content

Commit

Permalink
fix profile
Browse files Browse the repository at this point in the history
  • Loading branch information
ngc92 committed Jun 25, 2024
1 parent 39f976f commit 2413b0a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion profile_gpt2.cu
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 61,7 @@ int main(int argc, char *argv[]) {
// do a training step
gpt2_forward(&model, x, y, B, T);
gpt2_zero_grad(&model);
gpt2_backward_and_reduce(&model, x, true);
gpt2_backward_and_reduce(&model, x, 1, true);
gpt2_update(&model, 1e-4f, 0.9f, 0.999f, 1e-8f, 0.0f, 1.f, 1, &multi_gpu_config);
cudaCheck(cudaDeviceSynchronize()); // finish all CUDA work to get correct precise timings

Expand Down

0 comments on commit 2413b0a

Please sign in to comment.