Skip to content

Compression Ration when data file use a few alphabet is bad #3203

@shulib

Description

@shulib

Describe the bug
A clear and concise description of what the bug is.
To Reproduce
Steps to reproduce the behavior:

  1. Downloads data 'file which contains only 104 size two characters which its ascii 0x00 and 0x01 '
  2. Run '...' with flags 'without flags'
  3. Scroll up on the log to '....'
  4. See error
    the error is:
    Frames Skips Compressed Uncompressed Ratio Check Filename
    1 0 64 B 104 B 1.625 None try78__1_2.zst
    literal section compressed to raw
    Expected behavior
    I expect that the literal section to be compressed by Huffman or FSE.
    if I do that handily - I got after decompress and run with log
    Frames Skips Compressed Uncompressed Ratio Check Filename
    1 0 29 B 104 B 3.586 None try78__1.zst
    the file decompressed successfully
    I expect to larger compression ration in that case!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions