Multiplication Table via Transformer: NanoTransformer White-Box Lab

Step (Batch=100) 0
Avg Loss -.----

🧠 Live Neural Network Monitor

Displays real-time dimensions, mean, std, and gradient magnitude of every parameter layer.

Parameter Layer Shape Params Mean Std Gradient (Avg Abs)
Total Parameters: 0

🎯 Free Inference Lab

Edit the input below to test the model's next-token prediction probability in real time.

🧊 3D Flow

1st Digit Prediction

2nd Digit Prediction (based on 1st)

πŸ“– Project Overview