Tel Aviv-based startup Factify emerged from stealth at the moment with a $73 million seed spherical for an bold, but quixotic mission: to deliver digital paperwork past the usual codecs most companies use — .PDF, .docx, collaborative cloud recordsdata like Google Docs — and into the intelligence period.
For Matan Gavish, Factify’s Founder and CEO, this isn't only a software program improve—it’s an inevitability he has been obsessive about for years.
"The PDF was developed after I was in elementary faculty," Gavish advised VentureBeat. "The bedrock of the software program ecosystem hasn't actually advanced… somebody has to revamp the digital doc itself."
Gavish, a tenured professor of laptop science and Stanford PhD, admits that his fixation on administrative file codecs is an anomaly for somebody together with his credentials.
"It's a really uncool downside to be obsessive about," he says. "Given the truth that my tutorial background is AI and machine studying, my mother needed me to start out an AI firm as a result of it's cool. I'm undecided why I'm obsessed after which possessed by paperwork."
However that obsession has now attracted a sizeable seed spherical led by Valley Capital Companions and backed by AI heavyweights like former Google AI chief John Giannandrea.
The guess is easy the static rigidity of most digital recordsdata has restricted their utility, and a greater, extra clever doc that really shares its edit historical past and possession with customers as supposed, is just not solely doable — it's a multi-billion-dollar alternative.
The historical past of digital paperwork
To know why a seed spherical would balloon to $73 million, you must perceive the dimensions of the entice companies are in. There are presently an estimated three trillion PDFs in circulation. "Some folks see the PDF greater than they see their children," Gavish jokes.
The historical past of the digital doc is just not a linear development the place one format replaces one other. As an alternative, it’s a story of "speciation," the place totally different codecs advanced to fill distinct ecological niches: creation, distribution, and collaboration.
The period of recordsdata: Microsoft Phrase (Eighties–Nineties)
Digital paperwork started as remoted artifacts. Within the Eighties, the "doc" was inextricably linked to the {hardware} that created it. A file created in WordPerfect on a DOS machine was successfully gibberish to a Macintosh consumer.
Microsoft Phrase, which traces its lineage to the pioneering WYSIWYG editors at Xerox PARC, modified this by leveraging the dominance of the Home windows working system. By the Nineties, the binary .doc format turned the default container for editable skilled paperwork. Nevertheless, these recordsdata had been structurally advanced "reminiscence dumps" designed for the restricted {hardware} of the time, typically resulting in corruption or privateness leaks the place deleted textual content remained hidden within the file's binary information.
The period of digital 'stone': the PDF (Nineties-2006)
The PDF didn’t originate as a software for writing; it was a software for viewing. In 1991, Adobe co-founder John Warnock penned the "Camelot Venture" white paper, envisioning a "digital envelope" that may look similar on any show or printer.
Not like Phrase recordsdata, which had been malleable, PDFs had been designed to be immutable. They used the PostScript imaging mannequin to put characters at exact coordinates, guaranteeing visible constancy. Whereas adoption was initially gradual, Adobe’s 1994 resolution to launch the Acrobat Reader at no cost established PDF as the worldwide customary for "digital concrete"—the format of finality used for contracts, authorities varieties, and archives.
The collaborative cloud docs period (2006-present)
In 2006, Google disrupted the mannequin once more by shifting the doc from the arduous drive to the browser. Utilizing "Operational Transformation" algorithms, Google Docs allowed a number of customers to edit the identical stream of textual content concurrently.
This shifted the paradigm from "sending a file" to "sharing a hyperlink." Whereas Google Workspace now claims over 3 billion customers (largely shoppers and training), it basically modified how we work—turning paperwork into residing, collaborative processes relatively than static artifacts.
The established order: fragmentation
Regardless of these advances, the enterprise world stays fragmented. We draft in Google Docs (the "Digital Stream"), format in Phrase (the "Digital Clay"), and check in PDF (the "Digital Stone").
However this fragmentation has a value. "The issue is just not the doc. It’s every little thing round it," the corporate notes. "As soon as a PDF leaves your system, management is gone. Variations drift. Entry is unclear. Nothing is seen."
Turning digital paperwork into clever infrastructure
Factify’s wager is that within the age of AI, this fragmentation is not simply annoying—it’s a crucial failure. AI fashions want structured, verifiable information to operate.
When an AI "reads" a PDF, it’s primarily guessing, utilizing optical character recognition to scrape textual content from what’s successfully a digital photograph.
"What we're coping with here’s a megalomaniac imaginative and prescient, nevertheless it's on the identical time most likely one thing that’s inevitable," Gavish says.
Factify’s resolution is to deal with paperwork not as static recordsdata, however as clever infrastructure. Within the "Factified" customary, a doc carries its personal mind. It possesses a novel identification, a reside permission system, and an immutable audit log that travels with it.
"We wrote a brand new doc format that supplants the PostScript," Gavish explains. "We created a brand new information layer that helps the doc as a first-class citizen… and it's at all times accessible contained in the group and doubtlessly outdoors."
This distinction—between a File and an API—is the core of the corporate's pitch"
-
Recordsdata are liabilities: They accumulate, get misplaced, and could be stolen. "It goes again to a brick standing," Gavish says. "Recordsdata are liabilities, if something, as a result of they only accumulate there, you must guard them."
-
APIs are belongings: A Factify doc is an energetic object. You’ll be able to ask it questions: "Who has seen you? When do you expire? Are you probably the most up-to-date model?"
'Folks don't change', however codecs do
Historical past is suffering from codecs that attempted to exchange the PDF (like Microsoft’s XPS). They failed as a result of they demanded an excessive amount of behavioral change from customers. Gavish is keenly conscious of this entice.
"After I speak to enterprise software program entrepreneurs, I inform them the 2 legal guidelines to learn about beginning an organization in enterprise software program is that folks don't care, and nobody adjustments," he says.
To skirt this, Factify has constructed deep backwards compatibility. A Factified doc can look precisely like a PDF, full with web page breaks and margins. Customers don't must study a brand new interface to get worth; they only want to unravel a particular ache level—like an govt who needs to make sure an funding memo can’t be forwarded.
"All they’ve to inform their crew is, 'Expensive Chief of Employees, employment agreements and funding memoranda… are going to be Factified. The remainder stick with it,'" Gavish says. "They see quick profit… however then they uncover that they've crossed the Rubicon."
What's subsequent for Factify?
The capital from this spherical will likely be used to deepen the platform's core engineering—which Gavish describes as a "heavy engineering carry" requiring them to rebuild the doc format, information layer, and software layer from scratch. The corporate can be establishing a significant operational hub in Pittsburgh to assist its U.S. growth.
In the end, Factify isn't making an attempt to construct one other collaboration software like Google Docs. They’re making an attempt to construct the immutable report of the long run—the usual for "fact" in a digital world.
"The PDF… turned an ordinary which means I can not file my taxes utilizing every other format. That is how victory appears like," Gavish says. "We’re making a doc customary that isn’t particular for well being care or for insurance coverage, however is simply doc as such."
For the three trillion static recordsdata presently sitting in cloud storage, the writing might lastly be on the wall.
Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the most recent breakthroughs, get unique updates, and join with a worldwide community of future-focused thinkers.
Unlock tomorrow’s developments at the moment: learn extra, subscribe to our publication, and grow to be a part of the NextTech group at NextTech-news.com

