The MIDV datasets solve this by using "mock" documents that mimic the layout and security features of real ones but contain artificial data. While earlier versions like focused on basic recognition, newer iterations like
Ground truth data including document boundary quadrangles (IoU metrics), text field positions, and facial ovals. midv250