So it shows very high energy and also very low energy values around 1000Hz band.
I would not expect these “very low energy values”.
I would expect a single band around 1000Hz with constant high energy value.
FYI: MEL spectogram for same audio fragment in audacity: