I've had this idea of an ideal research agent for at least 4 months now, but a ton has gotten in the way: (a) a new baby, (b) Transpose Health work, and (c) that in the world of computer science, I (half-jokingly) sit in the "valley of despair" of the Dunning-Kruger Effect: I am competent enough to write my own pretty-good code, aware enough to know that I am slower and less efficient than full-time developers, but stubbornly proud enough to not have adopted AI-driven coding.
Seeing my ideas flounder for almost half a year, undeveloped, made me rethink this misplaced pride. (That, and my core competencies are thinking of new ideas to try out (not necessarily writing the code for it!); hearing Andrew Ng say that any good coder is using AI; wanting to test out "vibe-coding" to build out apps; real-life, time-saving use cases in my time on the AI/Q Fund at Connecticut Innovations; and Google releasing their Antigravity IDE.)
I include a journal of my progress below, some pre-coding thoughts, and a summary of the Antigravity pilot.
Some side thoughts:
- Why release an IDE now? Software development is one of the few areas that has shown very clear acceptance of AI agents (see: Andrew Ng on the 20VC podcast above), and with a great IDE, Google can both (a) win market share quickly, (b) collect more data on software code, and (c) create lock-in for its Gemini LLM (in a market where I believe LLMs will become a commodity). The future of software development is slowly becoming clearer: human developers, enhanced by AI. If we compare software development to building a house, AI agents today are high-quality pre-fab cabinets. It saves the builder time without sacrificing quality, but doesn't replace the need to have a builder to begin with.
- AI is lowering the barrier to entry for creating high-quality apps. I guess this shouldn't be surprising. In the internet age, drag-and-drop sites like Wix and Squarespace let you build no-code websites, and so no-code automation and AI-driven app development shouldn't be a surprise.
- There are so many frameworks out there for everything. Part of the challenge of building anything today is that there are so, so many libraries (in Python) that do variants of the same thing, and it's hard to know which to use. Each library requires its own small learning curve, and each never works exactly like I want it to. The house-building comparison: the aisles and aisles of hammers, table saws, and lumber you find at Home Depot.
- Compliance/security still remains a very high barrier to entry. I've been investigating building a health records SaaS platform, which would require HIPAA-compliance and SOC-2 Type II compliance. I'd imagine Antigravity can build the software to meet these requirements ... but would it be able to help direct me with the non-software requirements for these certs? That'd be incredible.
- Overloaded servers. I had to keep restarting the agent to keep coding. Everyone must be piloting this at the same time!
- Great product/market fit for someone like me. I have enough coding background to be picky about how I like things coded/abstracted, but not enough to be efficient at coding it myself. I can be a picky a
- The value of investment memos is thinking and unearthing risks. This is at the crux of what I'm trying to build here: not an investment memo generator (a passive tool), but ideally a research co-pilot, automating the dull stuff and pushing you to think more.
- Communication is key. AI-assisted coding shifts the art of coding to the art of clearly communicating specs and business needs -- which definitely requires a different skill set. (Oftentimes, when I code, I'll "wander" and see what's possible, reshaping the specs as I go.)
- AI coding (like Antigravity) foretells the future of work elsewhere. I was surprised that in some ways, AI coding was more like a dialogue between me and the AI coder: me telling it what to do, and it responding back with its thought process. (A more experienced AI coder may be better equipped to tel lthe AI agent exactly what to do.) It almost feels like the type of product that I'm trying to build: the core workflow is the dialogue between human and AI; the memo is just a happy by-product.
- A few distinct tasks ("agents") are needed for a good investment memo. In my initial app, I broadly lumped all tasks together. However, I think there's a few distinct task types that should be abstracted out:
- Research analyst - the analyst "read" a few different articles and websites, pick out pertinent quotes, presents them in the report, and synthesize them together. A virtual hunter/gatherer.
- Quantitative analyst - an agent that can work through a data room spreadsheet and create projections, calculate IRR, etc. Would also be great to do things like calculate betas for public-market comps, P/E ratios for comps, etc.
- Guesstimator analyst - a consultant-y analyst who can build up TAMs from the bottom-up and top-down, based on clearly stated assumptions.
- Devils advocate analyst - comes up with counterfactuals (e.g. the bear case in an industry analysis), helping to uncover risks and good questions to ask companies/founders.
- Legal analyst? - at some point, you probably want an agent who can see how things compare to a standard corpus (e.g. how do the deal terms presented match against the terms we normally agree to)
- Would be nice to separate out AI and personal/human thought. In a perfect next iteration, it'd be nice to clearly separate what's AI-driven content from my human personal thoughts and opinions. I think this will be critical to show where a single person is still adding value.