I took your advice and loop from different video outputs. It seems to work with 3 of them: vdpau, xv and X11:
~ mpv --vo=vdpau 1.mp4 ✔
(+) Video --vid=1 (*) (h264 1280x720 23.976fps)
(+) Audio --aid=1 --alang=jpn (*) (aac 2ch 44100Hz)
[vo/vdpau] Warning: this compatibility VO is low quality and may have issues with OSD, scaling, screenshots and more.
[vo/vdpau] vo=gpu is the preferred choice in any case and includes VDPAU support via hwdec=vdpau or vdpau-copy.
AO: [pulse] 44100Hz stereo 2ch float
VO: [vdpau] 1280x720 yuv420p
AV: 00:00:04 / 00:46:51 (0%) A-V: 0.496 Dropped: 3
Audio/Video desynchronisation detected! Possible reasons include too slow
hardware, temporary CPU spikes, broken drivers, and broken files. Audio
position will not match to the video (see A-V status field).
AV: 00:00:46 / 00:46:51 (2%) A-V: 1.286 Dropped: 466
Exiting... (Quit)
~ mpv --vo=vaapi 1.mp4 4 ✘ 1m 0s
(+) Video --vid=1 (*) (h264 1280x720 23.976fps)
(+) Audio --aid=1 --alang=jpn (*) (aac 2ch 44100Hz)
[vaapi] Failed to initialize VAAPI: unknown libva error
Error opening/initializing the selected video_out (--vo) device.
Video: no video
Exiting... (Errors when loading file)
~ mpv --vo=sdl 1.mp4 2 ✘
(+) Video --vid=1 (*) (h264 1280x720 23.976fps)
(+) Audio --aid=1 --alang=jpn (*) (aac 2ch 44100Hz)
[vo] Video output sdl not found!
Error opening/initializing the selected video_out (--vo) device.
Video: no video
Exiting... (Errors when loading file)
~ mpv --vo=xv 1.mp4 2 ✘
(+) Video --vid=1 (*) (h264 1280x720 23.976fps)
(+) Audio --aid=1 --alang=jpn (*) (aac 2ch 44100Hz)
[vo/xv] Warning: this legacy VO has bad quality and performance, and will in particular result in blurry OSD and subtitles. You should fix your graphics drivers, or not force the xv VO.
AO: [pulse] 44100Hz stereo 2ch float
VO: [xv] 1280x720 yuv420p
AV: 00:01:34 / 00:46:51 (3%) A-V: 0.000 Dropped: 2
Exiting... (Quit)
~ mpv --vo=x11 1.mp4 4 ✘ 1m 36s
(+) Video --vid=1 (*) (h264 1280x720 23.976fps)
(+) Audio --aid=1 --alang=jpn (*) (aac 2ch 44100Hz)
[vo/x11] Warning: this legacy VO has bad performance. Consider fixing your graphics drivers, or not forcing the x11 VO.
AO: [pulse] 44100Hz stereo 2ch float
VO: [x11] 1280x720 yuv420p
AV: 00:01:00 / 00:46:51 (2%) A-V: 0.479 Dropped: 488
Audio/Video desynchronisation detected! Possible reasons include too slow
hardware, temporary CPU spikes, broken drivers, and broken files. Audio
position will not match to the video (see A-V status field).
AV: 00:01:18 / 00:46:51 (3%) A-V: 0.000
[vo/x11] can't keep up! Waiting for XShm completion events...
AV: 00:08:10 / 00:46:51 (17%) A-V: 0.000 Dropped: 119
Exiting... (Quit)