Legal AI Opportunities and Challenges Whitepaper
Legal AI Opportunities and Challenges Whitepaper
1 Introduction
With a global market size of $1 Trillion, the legal industry has received significant attention
from new startup founders as an area to build in. At face value, this interest makes sense.
Law is largely grounded in text-based transactions, which appear ideal for automation and
processing using large language models (LLMs). These LLMs offer the opportunity to rapidly
accelerate the rate at which humans can understand and process large documents – similar
to those that lawyers work with--offering avenues for different startups to interject and
innovate at the cusp of the generative AI (gen AI) wave. Gen AI seems particularly useful for
making legal work more efficient, since, despite being some of the highest-paid
professionals, lawyers spend a disproportionate amount of time on repetitive tasks such as
basic research, document review, and case preparation. This insight is affirmed by the
amount of funding directed towards AI companies in 2023 and 2024. For example, in The
Generative AI Legal Landscape 2024, Ma et al. found that “funding for LegalTech startups
totaled ~700M between Jan 2023 - Feb 2024.”
This interest in legal AI is further bolstered by the rapid reduction in per token LLM cost,
which has roughly trended along a 10x annual reduction per year as we can see in the figure
below, and expanding context window sizes, which have seen GPT, Claude, and Gemini
windows rise to 128K, 200K, and 1M token sizes, respectively. These changes are enabling
lawyers to process hundred-page briefs, the typical length of legal documents, at a fraction
of the costs of actual lawyers doing the work, resulting in new levels of savings.
The scale of the legal workload further underscores the urgency of innovation: 65 million
cases processed annually and 1.6 million patent filings handled by 450,000 U.S. law
firms. As the complexity of cases grows and client expectations rise, AI offers an
unprecedented opportunity to reshape legal workflows by improving efficiency and reducing
costs in most areas of the legal stack. This raises an important question: given the seemingly
obvious synergies between law and LLMs, why have there not been many breakthroughs in
the legal AI space, with most companies still relying on antiquated workflows to achieve
their daily tasks?
1
Figure 1: Guido Appenzeller, “Welcome to LLMflation – LLM inference cost is going down fast”, Andreessen
Horowitz, available at: https://a16z.com/llmflation-llm-inference-cost/
In this white paper, we hope to provide more context to this fundamental question. First, we
cover some market headwinds that make legal a difficult area for incoming startup founders
to enter. Next, we discuss traps of false traction that founders should be made aware of.
Finally, we end with some potentially interesting directions for future exploration.
2
Legal Tech companies view data as their highest demonstrable value. While publicly
available datasets, such as patent data and/or other court data made available via Free Law
Project, provide a starting point, proprietary data is essential for building tools with real
competitive value. This is because unlike data for standard writing like/coding tasks, for
which a great deal of public data exists, legal data is tightly controlled under client-attorney
privilege and within companies as their secret sauce, preventing LLMs from being geared
towards legal writing. Moreover, the data that is released and published often only contains
the final product, missing the intermediary stages in the pipeline, which are crucial for
building reasoning capabilities within these models.
The best way founders can attempt to break this data wall is via well-structured pilots and
design partners. However, even data from generous and willing firms can be hard to directly
utilize. For instance, at CodeX, we partner with law firms to experiment and build legal AI
tools, such as an automated redlining training platform. However, to protect client
confidentiality, all shared data must necessarily be passed through redaction software,
which generates PDFs with blacked out regions. Unfortunately, this requires leveraging
complicated PDF parsers. There are also modality differences between taking these multi-
page PDFs, mapping them into usable insights, and then applying these insights to MS Word
documents. This often means that the best teams suited for these tasks have strong parsing
and UI/UX backgrounds, rather than AI ones. However, these UI/UX backgrounds are quite
different from the predominantly machine learning focused backgrounds of founders who
are looking to penetrate legal AI.
In the late 19th century, the notion of a “large law firm” was merely defined as a firm
comprised of four or more lawyers. These firms responded to the demands of rapidly
growing businesses that required increasingly bespoke, specialized transaction work. While
there were only 15 firms recognized as a “large law firm” in 1872, these numbers
exponentially soared such that, by 1924, there were over 1000 firms classified under this
definition.
Nevertheless, what would come to be interpreted as the modern law firm, otherwise known
as the “Cravath System,” emerged in the early twentieth century. The Cravath System was
3
a pyramidic organizational structure built with the intention of profit maximization. There
were fundamentally only two “classes” of professionals: partners and associates. The goal
was to incentivize associates to eventually be admitted as partners to the firm on a
meritocratic basis. With law firms structured as partnerships, there is an implicit signal of
belonging; that the duration of your tenure as an associate is conceivably a probationary
period until you are joined in the ranks of your mentors. Furthermore, as partners are largely
chosen within the firm, apprenticeship and culture hail from a rather insular heritage. The
industry became a mirror to the fundamental “priesthood” of the legal profession.
Accordingly, following the footsteps of rainmakers and renowned attorneys was primordial
to success in the business. Entering the legal sector thereby required traineeship, directly
integrated into a Darwinian model of operation. In effect, work product creation in the
profession and skill development and training are intimately interrelated.
In the 1960s, law firms began to develop “deep and enduring relationships with corporate
clients”, pivoting and expanding the role of the legal professional from purely adversarial to
advisory. These close institutional relationships became foundational to a law firm’s growth
and development. Trust became the secret ingredient to the success of the business,
enabling information asymmetry between corporate clients and lawyers. Put differently,
reputation became a key driver behind business and operational strategy of a practice. In
effect, these factors encourage a business model centered on the individual. The law firm
became perceivably structured as a collective, rather than a corporation.
These structures persisted in the years that followed. Even amidst changes, including the
rise of in-house legal departments and alternative legal service providers, the law firm
remained focused on the individual practitioner and their proteges.
More importantly, unlike other domain specific industries, relational and reputational
qualities of the legal practice, coupled with information asymmetry, replaced the need for
the development of explicit quantifiable metrics to evaluate the work of legal professionals.
Therefore, time and billing on time spent became a proxy for value, as value generated from
the work could not be measured concretely.
The billable hour model, where revenue for firms is directly correlated with the number of
hours spent on the task, is a cornerstone of the legal industry. This creates a structural
disincentive for law firms to adopt efficiency-enhancing technologies like AI, as reducing
hours worked translates to reduced revenue. While some firms are exploring alternatives,
such as fixed-fee pricing, the transition is gradual and fraught with challenges.
The legal profession has historically emphasized long hours as a marker of diligence and
expertise. As such, many clients, particularly large corporations, have accepted billable
hours as the standard pricing model. While some small-medium businesses and/or more
agile clients have begun to push back against the inefficiencies in the pricing model, this
implicit acceptance reduces the urgency for firms to innovate. As a result, while firms are
4
willing to entertain AI integration as conceivably important, it remains largely theoretically
rather than an immediate demand for revenue generation via time and billing.
That said, this headwind is undergoing a renewed cycle of tension, potentially enabling an
opportunity for founders who are able to build distributional moats. Often to reduce friction
from the industry, recommendations for founders often dwell in the monotonous, low-effort
tasks done by junior associates in the firm. We have observed from certain future-forward
firms Ashurst, Baker McKenzie, and Wilson Sonsini that these areas of higher inefficiency
have made the relative lift gained from using AI more significant to clients.
Additionally, LLMs are bad pointwise editors. While lawyers can be surgical with their edits
on large documents, requiring on the removal of minimal information to achieve their goals,
LLMs regenerate the entire document. This becomes problematic for large 100+ page files,
since this generation process enables hallucinations, as token probabilities diverge. Our
work explored workarounds to this using an agentic approach, where we first identify
relevant sections and only have the LLM generate new content for the section to be
inserted/removed/edited. This tends to improve results; however, it reduces global
reasoning accuracy. Further research on pointwise edits could be helpful for improving the
efficacy of legal AI tools.
5
3.1 Fragmented Market:
To demonstrate deep ROI, founders often have to focus on a specific legal workflow.
However, some back of the envelope math shows that these contract sizes are small. Even
generously assuming 100 seats added per firm and $100/month seats, yearly revenue per
contract would only be around $120K. Even expanding to 1000 seats would restrict revenue
to $1.2M/yr, making the upfront sales costs hard to justify. Further complicating the matter,
the legal industry is extremely fragmented. While there are exceptions, generally large firms
tend to handle complex, high-value cases requiring specialized solutions, while smaller
firms often perform broader, resource-constrained work. Targeting the larger firms,
therefore, pushes founders to develop more bespoke products that do not easily scale from
firm to firm. These firm-to-firm differences, in turn, slow fast distribution as multiple product
lines need to be maintained – positioning a tech startup as pseudo consultancy.
Furthermore, data privacy issues mean that it is very difficult to have true network effects.
The products must be returned to every partner using their internal proprietary data.
6
4 Interesting Directions in Legal AI
Several promising areas of innovation have emerged to address the industry’s challenges
and unlock AI’s potential:
It is perhaps easier to see the utility of such personas within specific contexts. For example,
consider running simulations of personas of different stakeholders responding to changes
in policies or firm structure. These simulations can help us identify adverse effects of
changes in advance, saving wasted time and effort. Alternatively, these personas can be
used to educate junior attorneys faster by injecting additional context into redlines from
their partners using partner personas. These partner personas would draw on how partners
have historically redlined, drawing on similarities to previous examples to better equip
associates to learn from each redline.
7
4.3 Patent Automation
Another promising area for AI is within patent law, where the data is typically well-structured
and publicly available. This makes the data challenges that make building in other areas of
law so difficult much less of an issue, reducing entry barriers for newcomers to law. We
anticipate AI tools like semantic retrieval, graph-based retrieval, and agentic retrieval will
disrupt traditional workflows like prior art searches, invalidity analyses, and patentability
assessments. This is especially useful when traditional keyword searches fail due to minor
differences in the framing of search queries. With current alternatives involving humans
explicitly sifting through these documents to identify relevant infringements/invalidation
claims, automated search tools to do this faster and cheaper enable patent attorneys to
focus on the more interesting aspects of drafting patents or invalidating other patents,
instead of sifting through prior art. Additionally, given that many large firms explicitly ask
external search firms to find prior art on their behalf, selling these products doesn’t result in
explicit reduction in billable hours for law firms, increasing their openness to adopting them.
However, entering the patent space has its challenges as well. The market for patent-related
tools is becoming increasingly saturated, making differentiation difficult. This exposes
founders in the patent space to get commoditized unless they are able to find data or
distributional moats. Furthermore, while embeddings-based searches are optimized for
detecting macro level similarities, they tend to perform poorly when two patents are only
weakly related on the basis of a small feature. However, these similarities are important to
identify as they can be basis for an infringement. Moreover, default embeddings are trained
on a concept space that is quite different from the language that patents follow. This means
startups have to make significant upfront investments in training their embedding modules.
Additionally, there are some UI/UX processes to consider. To stand out, AI solutions must
not only improve efficiency but also offer clean visualizations of claim dependencies,
seamless integration with patent office databases, and help attorneys craft arguments
around prior art matches.
5 Conclusion
Synergies between Gen AI and law can drastically increase efficiencies within the legal
market. This insight coupled with law’s primarily text-based modality has attracted many
young founders to experiment with building companies in the space. However, the path
forward requires addressing gaps like proprietary data, the billable hour model, cultural
resistance, and integration hassles with current tool suites. Moreover, the ease of acquiring
design partners/pilots can be a red herring for founders unless they realize that pilots do not
necessarily equate to revenue and that selling to additional customers doesn’t necessarily
scale easily. There are some promising areas of legal AI that are still being developed and
that I’m personally passionate about, such as creating legal personas, building a framework
for computational law, and streamlining patent prosecution/litigation. Carefully navigating
these opportunities and challenges could help make the legal industry faster, more efficient,
and more accessible for all.