
Strategic Overview: Enterprise technology narratives claim that the proliferation of generative AI will trigger an exponential spike in corporate data volumes and subsequent backup infrastructure costs. However, empirical telemetry from the Keepit Annual Data Report reveals a far more controlled reality. Backup volumes are not exploding—and the data exposes exactly why.
1. The 2% Equilibrium: Predictable Linear Growth
When an enterprise initially onboards its SaaS estate (such as Microsoft 365, Google Workspace, or Salesforce) to Keepit, the baseline ingestion represents 100% of the active data footprint. Following this initial ingestion, the daily change rate settles down immediately. Across all global enterprise tenants, subsequent daily incremental backups average a stable change rate of approximately 2% relative to the original baseline.
This linear progression is maintained due to two key structural mechanisms:
- Intelligent SaaS Infrastructure: Native cloud ecosystems handle high-frequency file compression and intelligent version control internally, suppressing the raw storage overhead before it transits the network.
- Always-Incremental Ingestion: Keepit’s architecture isolates changed blocks or objects natively. This allows enterprises to bypass traditional consumption-based storage billing and optimize predictability.
2. Deconstructing the Delta: File Count vs. Ingested Bytes
Analyzing what actually populates that daily 2% delta reveals a counter-intuitive split between file volume and actual bit weight. The metrics expose a highly specific behavioral pattern in corporate document storage:
| Metric Classification | File Count Percentage | Actual Ingested Bytes Weight |
|---|---|---|
| Entirely New Assets | 42% of daily file additions | ~68% of total ingested data weight |
| Modified Existing Assets | 58% of daily file modifications | ~32% of total ingested data weight |
The operational takeaway is clear: the vast majority of active user modifications occur within lightweight, text-centric files (e.g., transactional documents, spreadsheets, and collaborative team communication text). Conversely, large, data-heavy assets—such as media renders, raw exports, and massive archive structures—are typically written once, stored permanently, and rarely altered.
3. The AI Air-Gap: Reality vs. Speculation
Why has generative AI failed to break standard backup cost models? The answer lies in the current topology of enterprise AI workflows:
Today, the overwhelming majority of generative data—including raw user prompts, high-dimensional vector embeddings, context states, and intermediate model artifacts—resides natively within the isolated infrastructure of individual AI cloud providers. This data does not land automatically inside tenant storage structures like SharePoint Online or OneDrive for Business unless a user intentionally exports it.
Even when AI-generated content enters the corporate tenant ecosystem, it typically acts as a substitute for traditional document creation rather than an exponential multiplier. An AI-authored summary simply takes the place of a manual brief; it does not double the baseline environment footprint. While generative workflows favor creation over modification, the shift is minor and easily accommodated by modern incremental engines.
4. Pragmatic Implications for Business Resiliency Strategy
A stable 2% change rate does not imply that data protection should be treated casually. Rather, it allows security teams to swap anxiety for architectural discipline:
By avoiding the re-ingestion of un-modified assets, backup windows remain exceptionally short, preventing network congestion and API throttling.
Because the underlying database maps small, iterative modifications cleanly, administrators gain fine-grained recovery control—allowing single-file restores to handle daily operational needs natively.
Understanding real data deltas eliminates fear-driven over-provisioning and complex pricing structures, keeping infrastructure spend completely predictable.
Ultimately, a highly resilient security posture values predictable telemetry over speculative chaos. In backup infrastructure, predictable numbers are the true benchmark of an elegant, reliable architecture.
About Keepit
At Keepit, we believe in a digital future where all software is delivered as a service. Keepit’s mission is to protect data in the cloud Keepit is a software company specializing in Cloud-to-Cloud data backup and recovery. Deriving from +20 year experience in building best-in-class data protection and hosting services, Keepit is pioneering the way to secure and protect cloud data at scale.
About Version 2 Limited
Version 2 Digital is one of the most dynamic IT companies in Asia. The company distributes a wide range of IT products across various areas including cyber security, cloud, data protection, end points, infrastructures, system monitoring, storage, networking, business productivity and communication products.
Through an extensive network of channels, point of sales, resellers, and partnership companies, Version 2 offers quality products and services which are highly acclaimed in the market. Its customers cover a wide spectrum which include Global 1000 enterprises, regional listed companies, different vertical industries, public utilities, Government, a vast number of successful SMEs, and consumers in various Asian cities.








