🚀 BigBear Unicode Scanner 2.x Series Major Updates (incl. 2.1.1 release!)

I just shipped version 2.1.1 of the BigBear Unicode Security Scanner, bringing a major fix for binary file false positives and a bunch of new safety and usability improvements. Thanks to the BigBearCommunity for ongoing feedback and for helping make this tool stronger and more secure!


:new_button: 2.1.1 – 2025-10-30: Binary File False Positive Fix

What’s new:

  • :prohibited: Automatic Binary File Skipping
    The scanner now skips binary files by default—no more false positive alerts from archives, images, videos, executables, or docs!
    • Files now auto-skipped:
      Archives (.jar, .war, .ear, .zip, .tar, .gz, etc.)
      Images (.jpg, .png, .svg, .webp, etc.)
      Video/Audio (.mp4, .avi, .mp3, .flac, etc.)
      Executables (.exe, .dll, .so, etc.)
      Fonts (.ttf, .woff, .otf, etc.)
      Binary docs (.pdf, .docx, .xls, etc.)
  • :wrench: New --include-binary Flag
    Want to scan binaries anyway? You can opt-in for deep scans.
  • :test_tube: Better Test Coverage
    Added tests to ensure binary files are skipped correctly (unless you enable --include-binary).
  • :bug: Other Fixes:
    • No more Unicode false positives in .jar/.zip/.png/.pdf, etc.
    • Shell scripts are now accurately detected as non-binary when appropriate.
    • Improved binary detection: uses both file extension & MIME type for checks.
  • :magnifying_glass_tilted_left: Internals:
    • Upgraded detection logic—new is_binary_file() for comprehensive checks.
  • :books: Docs & Help:
    • Expanded README and CLI help for binary scanning.
    • Test suite expanded from 9 to 11 tests.
  • :locked: Security Notes:
    • Text-based Unicode threats and AI attacks are still detected—smart defaults keep you safe!
    • Full control: Force binary scanning any time, if needed.

:new_button: 2.1.0 – 2025-10-23: False Positive Fixes for Emoji, Documentation & UI

  • :bullseye: Context-aware emoji detection & exclusion
  • :memo: Unicode skipping options for docs (smart quotes, dashes, ellipsis, etc.)
  • :clipboard: New allowlist template for legit Unicode usages (UI, docs, i18n, math, etc.)
  • :test_tube: Expanded test suite for emoji and typography edge cases

:new_button: 2.0.0 – 2024: Massive AI+ Unicode Security Overhaul

  • :robot: Detects 150+ risky Unicode patterns and AI injection exploits
  • :globe_with_meridians: Homograph attack coverage (Cyrillic, Greek, Armenian, Thai lookalikes)
  • :magnifying_glass_tilted_left: Enhanced CLI w/ JSON output, severity filters, allowlists, and easy CI/CD integration
  • :books: Documentation & usage examples for everything

Full changelog and doc updates here:


:shield: I appreciate everyone in the BigBearCommunity for helping make this project more secure, accurate, and developer-friendly! If you run into edge cases or have feedback for the next release, please reach out in the forum or GitHub.

:hot_beverage: If you find this project helpful, please consider donating to support my work: https://ko-fi.com/bigbeartechworld