Skip to content

The model hallucinates blank images and creates output out of thin air #152

@SleepEarlyLiveLong

Description

@SleepEarlyLiveLong

When using the ParseQ model for inference, if a blank image is input, the output content is out of thin air.
image

To solve this problem, I generated 10,000 blank images of varying lengths and set their labels to a single space ‘ ’, but these data were ignored when finetuning the model:

# We filter out samples which don't contain any supported characters

image

How can I solve this problem?
Should I simply set remove_whitespace to false?
For blank images of varying lengths, should I set their labels to a single space or to different numbers of spaces depending on the length of the image?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions