Sanity Checking: | | 0/? [00:00<?, ?it/s]
Sanity Checking: | | 0/? [00:00<?, ?it/s]
Sanity Checking DataLoader 0: 0%| | 0/1 [00:00<?, ?it/s]
Sanity Checking DataLoader 0: 100%|##########| 1/1 [00:00<00:00, 21.57it/s]
Training: | | 0/? [00:00<?, ?it/s]
Training: | | 0/? [00:00<?, ?it/s]
Epoch 0: 0%| | 0/26 [00:00<?, ?it/s]
Epoch 0: 4%|3 | 1/26 [00:00<00:03, 8.01it/s]
Epoch 0: 4%|3 | 1/26 [00:00<00:03, 7.94it/s, train_loss_step=423.0]
Epoch 0: 8%|7 | 2/26 [00:00<00:02, 8.33it/s, train_loss_step=423.0]
Epoch 0: 8%|7 | 2/26 [00:00<00:02, 8.32it/s, train_loss_step=227.0]
Epoch 0: 12%|#1 | 3/26 [00:00<00:02, 8.84it/s, train_loss_step=227.0]
Epoch 0: 12%|#1 | 3/26 [00:00<00:02, 8.84it/s, train_loss_step=254.0]
Epoch 0: 15%|#5 | 4/26 [00:00<00:02, 8.19it/s, train_loss_step=254.0]
Epoch 0: 15%|#5 | 4/26 [00:00<00:02, 8.19it/s, train_loss_step=226.0]
Epoch 0: 19%|#9 | 5/26 [00:00<00:02, 7.69it/s, train_loss_step=226.0]
Epoch 0: 19%|#9 | 5/26 [00:00<00:02, 7.68it/s, train_loss_step=209.0]
Epoch 0: 23%|##3 | 6/26 [00:00<00:02, 7.37it/s, train_loss_step=209.0]
Epoch 0: 23%|##3 | 6/26 [00:00<00:02, 7.36it/s, train_loss_step=232.0]
Epoch 0: 27%|##6 | 7/26 [00:00<00:02, 7.19it/s, train_loss_step=232.0]
Epoch 0: 27%|##6 | 7/26 [00:00<00:02, 7.19it/s, train_loss_step=237.0]
Epoch 0: 31%|### | 8/26 [00:01<00:02, 7.07it/s, train_loss_step=237.0]
Epoch 0: 31%|### | 8/26 [00:01<00:02, 7.07it/s, train_loss_step=224.0]
Epoch 0: 35%|###4 | 9/26 [00:01<00:02, 6.96it/s, train_loss_step=224.0]
Epoch 0: 35%|###4 | 9/26 [00:01<00:02, 6.96it/s, train_loss_step=250.0]
Epoch 0: 38%|###8 | 10/26 [00:01<00:02, 6.89it/s, train_loss_step=250.0]
Epoch 0: 38%|###8 | 10/26 [00:01<00:02, 6.89it/s, train_loss_step=227.0]
Epoch 0: 42%|####2 | 11/26 [00:01<00:02, 6.82it/s, train_loss_step=227.0]
Epoch 0: 42%|####2 | 11/26 [00:01<00:02, 6.82it/s, train_loss_step=234.0]
Epoch 0: 46%|####6 | 12/26 [00:01<00:02, 6.85it/s, train_loss_step=234.0]
Epoch 0: 46%|####6 | 12/26 [00:01<00:02, 6.85it/s, train_loss_step=214.0]
Epoch 0: 50%|##### | 13/26 [00:01<00:01, 7.05it/s, train_loss_step=214.0]
Epoch 0: 50%|##### | 13/26 [00:01<00:01, 7.05it/s, train_loss_step=215.0]
Epoch 0: 54%|#####3 | 14/26 [00:01<00:01, 7.06it/s, train_loss_step=215.0]
Epoch 0: 54%|#####3 | 14/26 [00:01<00:01, 7.06it/s, train_loss_step=275.0]
Epoch 0: 58%|#####7 | 15/26 [00:02<00:01, 6.98it/s, train_loss_step=275.0]
Epoch 0: 58%|#####7 | 15/26 [00:02<00:01, 6.98it/s, train_loss_step=191.0]
Epoch 0: 62%|######1 | 16/26 [00:02<00:01, 6.92it/s, train_loss_step=191.0]
Epoch 0: 62%|######1 | 16/26 [00:02<00:01, 6.92it/s, train_loss_step=201.0]
Epoch 0: 65%|######5 | 17/26 [00:02<00:01, 6.86it/s, train_loss_step=201.0]
Epoch 0: 65%|######5 | 17/26 [00:02<00:01, 6.86it/s, train_loss_step=222.0]
Epoch 0: 69%|######9 | 18/26 [00:02<00:01, 6.78it/s, train_loss_step=222.0]
Epoch 0: 69%|######9 | 18/26 [00:02<00:01, 6.78it/s, train_loss_step=204.0]
Epoch 0: 73%|#######3 | 19/26 [00:02<00:01, 6.73it/s, train_loss_step=204.0]
Epoch 0: 73%|#######3 | 19/26 [00:02<00:01, 6.73it/s, train_loss_step=230.0]
Epoch 0: 77%|#######6 | 20/26 [00:03<00:00, 6.66it/s, train_loss_step=230.0]
Epoch 0: 77%|#######6 | 20/26 [00:03<00:00, 6.66it/s, train_loss_step=202.0]
Epoch 0: 81%|######## | 21/26 [00:03<00:00, 6.72it/s, train_loss_step=202.0]
Epoch 0: 81%|######## | 21/26 [00:03<00:00, 6.72it/s, train_loss_step=215.0]
Epoch 0: 85%|########4 | 22/26 [00:03<00:00, 6.84it/s, train_loss_step=215.0]
Epoch 0: 85%|########4 | 22/26 [00:03<00:00, 6.84it/s, train_loss_step=232.0]
Epoch 0: 88%|########8 | 23/26 [00:03<00:00, 6.95it/s, train_loss_step=232.0]
Epoch 0: 88%|########8 | 23/26 [00:03<00:00, 6.95it/s, train_loss_step=204.0]
Epoch 0: 92%|#########2| 24/26 [00:03<00:00, 7.07it/s, train_loss_step=204.0]
Epoch 0: 92%|#########2| 24/26 [00:03<00:00, 7.07it/s, train_loss_step=181.0]
Epoch 0: 96%|#########6| 25/26 [00:03<00:00, 7.17it/s, train_loss_step=181.0]
Epoch 0: 96%|#########6| 25/26 [00:03<00:00, 7.17it/s, train_loss_step=216.0]
Epoch 0: 100%|##########| 26/26 [00:03<00:00, 7.18it/s, train_loss_step=216.0]
Epoch 0: 100%|##########| 26/26 [00:03<00:00, 7.18it/s, train_loss_step=207.0]
Validation: | | 0/? [00:00<?, ?it/s][A
Validation: | | 0/? [00:00<?, ?it/s][A
Validation DataLoader 0: 0%| | 0/1 [00:00<?, ?it/s][A
Validation DataLoader 0: 100%|##########| 1/1 [00:00<00:00, 25.59it/s][A
[A
Epoch 0: 100%|##########| 26/26 [00:03<00:00, 7.08it/s, train_loss_step=207.0, val_loss=346.0]
Epoch 0: 100%|##########| 26/26 [00:03<00:00, 7.08it/s, train_loss_step=207.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 0: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=207.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=207.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 4%|3 | 1/26 [00:00<00:04, 6.13it/s, train_loss_step=207.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 4%|3 | 1/26 [00:00<00:04, 6.13it/s, train_loss_step=205.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 8%|7 | 2/26 [00:00<00:03, 6.14it/s, train_loss_step=205.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 8%|7 | 2/26 [00:00<00:03, 6.13it/s, train_loss_step=226.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 12%|#1 | 3/26 [00:00<00:03, 6.12it/s, train_loss_step=226.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 12%|#1 | 3/26 [00:00<00:03, 6.12it/s, train_loss_step=181.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 15%|#5 | 4/26 [00:00<00:03, 6.14it/s, train_loss_step=181.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 15%|#5 | 4/26 [00:00<00:03, 6.14it/s, train_loss_step=229.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 19%|#9 | 5/26 [00:00<00:03, 6.22it/s, train_loss_step=229.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 19%|#9 | 5/26 [00:00<00:03, 6.22it/s, train_loss_step=193.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 23%|##3 | 6/26 [00:00<00:03, 6.32it/s, train_loss_step=193.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 23%|##3 | 6/26 [00:00<00:03, 6.32it/s, train_loss_step=197.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 27%|##6 | 7/26 [00:01<00:02, 6.65it/s, train_loss_step=197.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 27%|##6 | 7/26 [00:01<00:02, 6.65it/s, train_loss_step=219.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 31%|### | 8/26 [00:01<00:02, 7.00it/s, train_loss_step=219.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 31%|### | 8/26 [00:01<00:02, 7.00it/s, train_loss_step=203.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 35%|###4 | 9/26 [00:01<00:02, 7.22it/s, train_loss_step=203.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 35%|###4 | 9/26 [00:01<00:02, 7.22it/s, train_loss_step=203.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 38%|###8 | 10/26 [00:01<00:02, 7.29it/s, train_loss_step=203.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 38%|###8 | 10/26 [00:01<00:02, 7.29it/s, train_loss_step=190.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 42%|####2 | 11/26 [00:01<00:02, 7.26it/s, train_loss_step=190.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 42%|####2 | 11/26 [00:01<00:02, 7.26it/s, train_loss_step=208.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 46%|####6 | 12/26 [00:01<00:01, 7.32it/s, train_loss_step=208.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 46%|####6 | 12/26 [00:01<00:01, 7.32it/s, train_loss_step=187.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 50%|##### | 13/26 [00:01<00:01, 7.42it/s, train_loss_step=187.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 50%|##### | 13/26 [00:01<00:01, 7.42it/s, train_loss_step=206.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 54%|#####3 | 14/26 [00:01<00:01, 7.36it/s, train_loss_step=206.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 54%|#####3 | 14/26 [00:01<00:01, 7.36it/s, train_loss_step=214.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 58%|#####7 | 15/26 [00:01<00:01, 7.54it/s, train_loss_step=214.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 58%|#####7 | 15/26 [00:01<00:01, 7.54it/s, train_loss_step=219.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 62%|######1 | 16/26 [00:02<00:01, 7.61it/s, train_loss_step=219.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 62%|######1 | 16/26 [00:02<00:01, 7.61it/s, train_loss_step=196.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 65%|######5 | 17/26 [00:02<00:01, 7.65it/s, train_loss_step=196.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 65%|######5 | 17/26 [00:02<00:01, 7.64it/s, train_loss_step=227.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 69%|######9 | 18/26 [00:02<00:01, 7.58it/s, train_loss_step=227.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 69%|######9 | 18/26 [00:02<00:01, 7.58it/s, train_loss_step=192.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 73%|#######3 | 19/26 [00:02<00:00, 7.52it/s, train_loss_step=192.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 73%|#######3 | 19/26 [00:02<00:00, 7.52it/s, train_loss_step=168.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 77%|#######6 | 20/26 [00:02<00:00, 7.48it/s, train_loss_step=168.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 77%|#######6 | 20/26 [00:02<00:00, 7.48it/s, train_loss_step=208.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 81%|######## | 21/26 [00:02<00:00, 7.43it/s, train_loss_step=208.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 81%|######## | 21/26 [00:02<00:00, 7.43it/s, train_loss_step=183.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 85%|########4 | 22/26 [00:02<00:00, 7.37it/s, train_loss_step=183.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 85%|########4 | 22/26 [00:02<00:00, 7.37it/s, train_loss_step=180.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 88%|########8 | 23/26 [00:03<00:00, 7.32it/s, train_loss_step=180.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 88%|########8 | 23/26 [00:03<00:00, 7.32it/s, train_loss_step=171.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 92%|#########2| 24/26 [00:03<00:00, 7.28it/s, train_loss_step=171.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 92%|#########2| 24/26 [00:03<00:00, 7.28it/s, train_loss_step=224.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 96%|#########6| 25/26 [00:03<00:00, 7.23it/s, train_loss_step=224.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 96%|#########6| 25/26 [00:03<00:00, 7.23it/s, train_loss_step=190.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 100%|##########| 26/26 [00:03<00:00, 7.19it/s, train_loss_step=190.0, val_loss=346.0, train_loss_epoch=229.0]
Epoch 1: 100%|##########| 26/26 [00:03<00:00, 7.19it/s, train_loss_step=208.0, val_loss=346.0, train_loss_epoch=229.0]
Validation: | | 0/? [00:00<?, ?it/s][A
Validation: | | 0/? [00:00<?, ?it/s][A
Validation DataLoader 0: 0%| | 0/1 [00:00<?, ?it/s][A
Validation DataLoader 0: 100%|##########| 1/1 [00:00<00:00, 32.12it/s][A
[A
Epoch 1: 100%|##########| 26/26 [00:03<00:00, 7.11it/s, train_loss_step=208.0, val_loss=289.0, train_loss_epoch=229.0]
Epoch 1: 100%|##########| 26/26 [00:03<00:00, 7.10it/s, train_loss_step=208.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 1: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=208.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=208.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 4%|3 | 1/26 [00:00<00:04, 5.62it/s, train_loss_step=208.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 4%|3 | 1/26 [00:00<00:04, 5.62it/s, train_loss_step=183.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 8%|7 | 2/26 [00:00<00:03, 6.29it/s, train_loss_step=183.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 8%|7 | 2/26 [00:00<00:03, 6.27it/s, train_loss_step=179.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 12%|#1 | 3/26 [00:00<00:03, 6.05it/s, train_loss_step=179.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 12%|#1 | 3/26 [00:00<00:03, 6.05it/s, train_loss_step=192.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 15%|#5 | 4/26 [00:00<00:03, 5.95it/s, train_loss_step=192.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 15%|#5 | 4/26 [00:00<00:03, 5.95it/s, train_loss_step=228.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 19%|#9 | 5/26 [00:00<00:03, 5.90it/s, train_loss_step=228.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 19%|#9 | 5/26 [00:00<00:03, 5.90it/s, train_loss_step=192.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 23%|##3 | 6/26 [00:01<00:03, 5.80it/s, train_loss_step=192.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 23%|##3 | 6/26 [00:01<00:03, 5.80it/s, train_loss_step=186.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 27%|##6 | 7/26 [00:01<00:03, 5.74it/s, train_loss_step=186.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 27%|##6 | 7/26 [00:01<00:03, 5.74it/s, train_loss_step=198.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 31%|### | 8/26 [00:01<00:03, 5.70it/s, train_loss_step=198.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 31%|### | 8/26 [00:01<00:03, 5.70it/s, train_loss_step=233.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 35%|###4 | 9/26 [00:01<00:03, 5.67it/s, train_loss_step=233.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 35%|###4 | 9/26 [00:01<00:03, 5.67it/s, train_loss_step=187.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 38%|###8 | 10/26 [00:01<00:02, 5.66it/s, train_loss_step=187.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 38%|###8 | 10/26 [00:01<00:02, 5.66it/s, train_loss_step=196.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 42%|####2 | 11/26 [00:01<00:02, 5.66it/s, train_loss_step=196.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 42%|####2 | 11/26 [00:01<00:02, 5.66it/s, train_loss_step=186.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 46%|####6 | 12/26 [00:02<00:02, 5.63it/s, train_loss_step=186.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 46%|####6 | 12/26 [00:02<00:02, 5.63it/s, train_loss_step=186.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 50%|##### | 13/26 [00:02<00:02, 5.63it/s, train_loss_step=186.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 50%|##### | 13/26 [00:02<00:02, 5.63it/s, train_loss_step=260.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 54%|#####3 | 14/26 [00:02<00:02, 5.62it/s, train_loss_step=260.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 54%|#####3 | 14/26 [00:02<00:02, 5.62it/s, train_loss_step=209.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 58%|#####7 | 15/26 [00:02<00:01, 5.62it/s, train_loss_step=209.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 58%|#####7 | 15/26 [00:02<00:01, 5.62it/s, train_loss_step=189.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 62%|######1 | 16/26 [00:02<00:01, 5.62it/s, train_loss_step=189.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 62%|######1 | 16/26 [00:02<00:01, 5.62it/s, train_loss_step=215.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 65%|######5 | 17/26 [00:03<00:01, 5.63it/s, train_loss_step=215.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 65%|######5 | 17/26 [00:03<00:01, 5.63it/s, train_loss_step=165.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 69%|######9 | 18/26 [00:03<00:01, 5.64it/s, train_loss_step=165.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 69%|######9 | 18/26 [00:03<00:01, 5.64it/s, train_loss_step=196.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 73%|#######3 | 19/26 [00:03<00:01, 5.65it/s, train_loss_step=196.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 73%|#######3 | 19/26 [00:03<00:01, 5.65it/s, train_loss_step=165.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 77%|#######6 | 20/26 [00:03<00:01, 5.66it/s, train_loss_step=165.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 77%|#######6 | 20/26 [00:03<00:01, 5.66it/s, train_loss_step=191.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 81%|######## | 21/26 [00:03<00:00, 5.69it/s, train_loss_step=191.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 81%|######## | 21/26 [00:03<00:00, 5.69it/s, train_loss_step=234.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 85%|########4 | 22/26 [00:03<00:00, 5.72it/s, train_loss_step=234.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 85%|########4 | 22/26 [00:03<00:00, 5.72it/s, train_loss_step=202.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 88%|########8 | 23/26 [00:04<00:00, 5.74it/s, train_loss_step=202.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 88%|########8 | 23/26 [00:04<00:00, 5.74it/s, train_loss_step=189.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 92%|#########2| 24/26 [00:04<00:00, 5.76it/s, train_loss_step=189.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 92%|#########2| 24/26 [00:04<00:00, 5.76it/s, train_loss_step=204.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 96%|#########6| 25/26 [00:04<00:00, 5.80it/s, train_loss_step=204.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 96%|#########6| 25/26 [00:04<00:00, 5.80it/s, train_loss_step=181.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 100%|##########| 26/26 [00:04<00:00, 5.81it/s, train_loss_step=181.0, val_loss=289.0, train_loss_epoch=201.0]
Epoch 2: 100%|##########| 26/26 [00:04<00:00, 5.81it/s, train_loss_step=200.0, val_loss=289.0, train_loss_epoch=201.0]
Validation: | | 0/? [00:00<?, ?it/s][A
Validation: | | 0/? [00:00<?, ?it/s][A
Validation DataLoader 0: 0%| | 0/1 [00:00<?, ?it/s][A
Validation DataLoader 0: 100%|##########| 1/1 [00:00<00:00, 29.24it/s][A
[A
Epoch 2: 100%|##########| 26/26 [00:04<00:00, 5.76it/s, train_loss_step=200.0, val_loss=333.0, train_loss_epoch=201.0]
Epoch 2: 100%|##########| 26/26 [00:04<00:00, 5.75it/s, train_loss_step=200.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 2: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=200.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=200.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 4%|3 | 1/26 [00:00<00:04, 6.22it/s, train_loss_step=200.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 4%|3 | 1/26 [00:00<00:04, 6.22it/s, train_loss_step=222.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 8%|7 | 2/26 [00:00<00:03, 7.10it/s, train_loss_step=222.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 8%|7 | 2/26 [00:00<00:03, 7.10it/s, train_loss_step=196.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 12%|#1 | 3/26 [00:00<00:02, 7.69it/s, train_loss_step=196.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 12%|#1 | 3/26 [00:00<00:02, 7.69it/s, train_loss_step=193.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 15%|#5 | 4/26 [00:00<00:02, 8.03it/s, train_loss_step=193.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 15%|#5 | 4/26 [00:00<00:02, 8.03it/s, train_loss_step=199.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 19%|#9 | 5/26 [00:00<00:02, 7.65it/s, train_loss_step=199.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 19%|#9 | 5/26 [00:00<00:02, 7.65it/s, train_loss_step=225.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 23%|##3 | 6/26 [00:00<00:02, 7.50it/s, train_loss_step=225.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 23%|##3 | 6/26 [00:00<00:02, 7.50it/s, train_loss_step=179.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 27%|##6 | 7/26 [00:00<00:02, 7.42it/s, train_loss_step=179.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 27%|##6 | 7/26 [00:00<00:02, 7.41it/s, train_loss_step=241.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 31%|### | 8/26 [00:01<00:02, 7.30it/s, train_loss_step=241.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 31%|### | 8/26 [00:01<00:02, 7.29it/s, train_loss_step=189.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 35%|###4 | 9/26 [00:01<00:02, 7.25it/s, train_loss_step=189.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 35%|###4 | 9/26 [00:01<00:02, 7.25it/s, train_loss_step=185.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 38%|###8 | 10/26 [00:01<00:02, 7.32it/s, train_loss_step=185.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 38%|###8 | 10/26 [00:01<00:02, 7.32it/s, train_loss_step=216.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 42%|####2 | 11/26 [00:01<00:02, 7.26it/s, train_loss_step=216.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 42%|####2 | 11/26 [00:01<00:02, 7.26it/s, train_loss_step=201.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 46%|####6 | 12/26 [00:01<00:01, 7.19it/s, train_loss_step=201.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 46%|####6 | 12/26 [00:01<00:01, 7.18it/s, train_loss_step=186.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 50%|##### | 13/26 [00:01<00:01, 7.09it/s, train_loss_step=186.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 50%|##### | 13/26 [00:01<00:01, 7.09it/s, train_loss_step=198.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 54%|#####3 | 14/26 [00:02<00:01, 7.00it/s, train_loss_step=198.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 54%|#####3 | 14/26 [00:02<00:01, 7.00it/s, train_loss_step=204.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 58%|#####7 | 15/26 [00:02<00:01, 6.99it/s, train_loss_step=204.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 58%|#####7 | 15/26 [00:02<00:01, 6.99it/s, train_loss_step=194.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 62%|######1 | 16/26 [00:02<00:01, 6.98it/s, train_loss_step=194.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 62%|######1 | 16/26 [00:02<00:01, 6.98it/s, train_loss_step=197.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 65%|######5 | 17/26 [00:02<00:01, 7.00it/s, train_loss_step=197.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 65%|######5 | 17/26 [00:02<00:01, 7.00it/s, train_loss_step=211.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 69%|######9 | 18/26 [00:02<00:01, 7.02it/s, train_loss_step=211.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 69%|######9 | 18/26 [00:02<00:01, 7.02it/s, train_loss_step=194.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 73%|#######3 | 19/26 [00:02<00:00, 7.04it/s, train_loss_step=194.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 73%|#######3 | 19/26 [00:02<00:00, 7.04it/s, train_loss_step=183.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 77%|#######6 | 20/26 [00:02<00:00, 7.11it/s, train_loss_step=183.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 77%|#######6 | 20/26 [00:02<00:00, 7.11it/s, train_loss_step=174.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 81%|######## | 21/26 [00:02<00:00, 7.24it/s, train_loss_step=174.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 81%|######## | 21/26 [00:02<00:00, 7.24it/s, train_loss_step=183.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 85%|########4 | 22/26 [00:03<00:00, 7.33it/s, train_loss_step=183.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 85%|########4 | 22/26 [00:03<00:00, 7.32it/s, train_loss_step=188.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 88%|########8 | 23/26 [00:03<00:00, 7.39it/s, train_loss_step=188.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 88%|########8 | 23/26 [00:03<00:00, 7.39it/s, train_loss_step=173.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 92%|#########2| 24/26 [00:03<00:00, 7.47it/s, train_loss_step=173.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 92%|#########2| 24/26 [00:03<00:00, 7.47it/s, train_loss_step=236.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 96%|#########6| 25/26 [00:03<00:00, 7.52it/s, train_loss_step=236.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 96%|#########6| 25/26 [00:03<00:00, 7.52it/s, train_loss_step=200.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 100%|##########| 26/26 [00:03<00:00, 7.61it/s, train_loss_step=200.0, val_loss=333.0, train_loss_epoch=198.0]
Epoch 3: 100%|##########| 26/26 [00:03<00:00, 7.61it/s, train_loss_step=177.0, val_loss=333.0, train_loss_epoch=198.0]
Validation: | | 0/? [00:00<?, ?it/s][A
Validation: | | 0/? [00:00<?, ?it/s][A
Validation DataLoader 0: 0%| | 0/1 [00:00<?, ?it/s][A
Validation DataLoader 0: 100%|##########| 1/1 [00:00<00:00, 40.54it/s][A
[A
Epoch 3: 100%|##########| 26/26 [00:03<00:00, 7.54it/s, train_loss_step=177.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 3: 100%|##########| 26/26 [00:03<00:00, 7.53it/s, train_loss_step=177.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 3: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=177.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 0%| | 0/26 [00:00<?, ?it/s, train_loss_step=177.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 4%|3 | 1/26 [00:00<00:02, 11.07it/s, train_loss_step=177.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 4%|3 | 1/26 [00:00<00:02, 11.07it/s, train_loss_step=239.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 8%|7 | 2/26 [00:00<00:02, 8.90it/s, train_loss_step=239.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 8%|7 | 2/26 [00:00<00:02, 8.85it/s, train_loss_step=193.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 12%|#1 | 3/26 [00:00<00:02, 8.24it/s, train_loss_step=193.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 12%|#1 | 3/26 [00:00<00:02, 8.24it/s, train_loss_step=181.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 15%|#5 | 4/26 [00:00<00:02, 7.89it/s, train_loss_step=181.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 15%|#5 | 4/26 [00:00<00:02, 7.89it/s, train_loss_step=188.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 19%|#9 | 5/26 [00:00<00:02, 7.69it/s, train_loss_step=188.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 19%|#9 | 5/26 [00:00<00:02, 7.68it/s, train_loss_step=179.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 23%|##3 | 6/26 [00:00<00:02, 7.60it/s, train_loss_step=179.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 23%|##3 | 6/26 [00:00<00:02, 7.59it/s, train_loss_step=220.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 27%|##6 | 7/26 [00:00<00:02, 7.50it/s, train_loss_step=220.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 27%|##6 | 7/26 [00:00<00:02, 7.49it/s, train_loss_step=187.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 31%|### | 8/26 [00:01<00:02, 7.42it/s, train_loss_step=187.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 31%|### | 8/26 [00:01<00:02, 7.42it/s, train_loss_step=222.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 35%|###4 | 9/26 [00:01<00:02, 7.35it/s, train_loss_step=222.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 35%|###4 | 9/26 [00:01<00:02, 7.35it/s, train_loss_step=215.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 38%|###8 | 10/26 [00:01<00:02, 7.32it/s, train_loss_step=215.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 38%|###8 | 10/26 [00:01<00:02, 7.32it/s, train_loss_step=178.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 42%|####2 | 11/26 [00:01<00:02, 7.31it/s, train_loss_step=178.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 42%|####2 | 11/26 [00:01<00:02, 7.31it/s, train_loss_step=178.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 46%|####6 | 12/26 [00:01<00:01, 7.29it/s, train_loss_step=178.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 46%|####6 | 12/26 [00:01<00:01, 7.29it/s, train_loss_step=174.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 50%|##### | 13/26 [00:01<00:01, 7.31it/s, train_loss_step=174.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 50%|##### | 13/26 [00:01<00:01, 7.31it/s, train_loss_step=182.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 54%|#####3 | 14/26 [00:01<00:01, 7.32it/s, train_loss_step=182.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 54%|#####3 | 14/26 [00:01<00:01, 7.32it/s, train_loss_step=207.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 58%|#####7 | 15/26 [00:02<00:01, 7.34it/s, train_loss_step=207.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 58%|#####7 | 15/26 [00:02<00:01, 7.34it/s, train_loss_step=177.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 62%|######1 | 16/26 [00:02<00:01, 7.37it/s, train_loss_step=177.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 62%|######1 | 16/26 [00:02<00:01, 7.36it/s, train_loss_step=203.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 65%|######5 | 17/26 [00:02<00:01, 7.40it/s, train_loss_step=203.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 65%|######5 | 17/26 [00:02<00:01, 7.40it/s, train_loss_step=218.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 69%|######9 | 18/26 [00:02<00:01, 7.50it/s, train_loss_step=218.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 69%|######9 | 18/26 [00:02<00:01, 7.50it/s, train_loss_step=232.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 73%|#######3 | 19/26 [00:02<00:00, 7.54it/s, train_loss_step=232.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 73%|#######3 | 19/26 [00:02<00:00, 7.54it/s, train_loss_step=198.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 77%|#######6 | 20/26 [00:02<00:00, 7.58it/s, train_loss_step=198.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 77%|#######6 | 20/26 [00:02<00:00, 7.57it/s, train_loss_step=193.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 81%|######## | 21/26 [00:02<00:00, 7.61it/s, train_loss_step=193.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 81%|######## | 21/26 [00:02<00:00, 7.60it/s, train_loss_step=180.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 85%|########4 | 22/26 [00:02<00:00, 7.62it/s, train_loss_step=180.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 85%|########4 | 22/26 [00:02<00:00, 7.62it/s, train_loss_step=187.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 88%|########8 | 23/26 [00:02<00:00, 7.68it/s, train_loss_step=187.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 88%|########8 | 23/26 [00:02<00:00, 7.67it/s, train_loss_step=187.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 92%|#########2| 24/26 [00:03<00:00, 7.73it/s, train_loss_step=187.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 92%|#########2| 24/26 [00:03<00:00, 7.73it/s, train_loss_step=196.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 96%|#########6| 25/26 [00:03<00:00, 7.75it/s, train_loss_step=196.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 96%|#########6| 25/26 [00:03<00:00, 7.75it/s, train_loss_step=226.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 100%|##########| 26/26 [00:03<00:00, 7.82it/s, train_loss_step=226.0, val_loss=323.0, train_loss_epoch=198.0]
Epoch 4: 100%|##########| 26/26 [00:03<00:00, 7.82it/s, train_loss_step=198.0, val_loss=323.0, train_loss_epoch=198.0]
Validation: | | 0/? [00:00<?, ?it/s][A
Validation: | | 0/? [00:00<?, ?it/s][A
Validation DataLoader 0: 0%| | 0/1 [00:00<?, ?it/s][A
Validation DataLoader 0: 100%|##########| 1/1 [00:00<00:00, 45.35it/s][A
[A
Epoch 4: 100%|##########| 26/26 [00:03<00:00, 7.75it/s, train_loss_step=198.0, val_loss=310.0, train_loss_epoch=198.0]
Epoch 4: 100%|##########| 26/26 [00:03<00:00, 7.74it/s, train_loss_step=198.0, val_loss=310.0, train_loss_epoch=198.0]
Epoch 4: 100%|##########| 26/26 [00:03<00:00, 7.74it/s, train_loss_step=198.0, val_loss=310.0, train_loss_epoch=198.0]