Test coverage gaps accumulate in every codebase. New code ships with insufficient tests because writing thorough tests takes time and slows velocity in the short term. The deferred cost is paid later in bugs that reach production and slow, risky deploys.
The specific challenge with test generation is that it requires understanding the intent of the code, not just its structure. A test that merely achieves line coverage but does not test meaningful behavior provides false confidence. Good tests verify behavior under edge cases, with unexpected inputs, and across error paths.