How to Create Good Research-driven Software Products

InsightsPortfolio

Jun 30

By David Magerman, PhD

Good research requires well-tested software that validates the accuracy of the research results. Good products require a much higher level of testing, that not only ensures the accuracy of software, but also ensures maximum uptime, eliminates failures, maintains efficiency and performance, and protects customers from experiencing any negative impact from software changes. This seems to imply that there should be different software development environments, and different software testing protocols, for research software and production software. The reality is that good research-driven products are best created by imposing production-quality software development practices on all levels of software development, including research, data engineering, tool building, and IT infrastructure.

Bad Research

I have a painful admission to make. When I was an academic, I was a really bad researcher. The experiments I performed used poorly tested software developed without any regression testing protocols. Even the software that evaluated the results of test results was unverified. I produced influential papers that had an impact on my academic field of study, and some of those ideas have been validated by subsequent research results. But I have no way of knowing, at least from the research results I based my papers on, that my results proved the efficacy of the scientific ideas I was promoting.

When I left academia and went to work at Renaissance Technologies, and when my software and research results were actually deployed in the real-world products, with real money being wagered on their accuracy, I learned valuable lessons about how to develop research-driven products. The most significant lesson I learned is that unless your research code is actually used in live production, you are going to struggle to maintain the integrity of that research in production. And the best way to write production-ready research code is to test every change to research code as though it is going to be used in production. When you do that, not only do you increase the efficiency of getting new research ideas into production, but you also increase the likelihood that the research process will yield the best results.

When Research Programming and Production Programming are Different

When you are writing research code for the purposes of writing papers, the research results ARE the product. Once you are done writing the paper, the software gets thrown away, at least until it is used again for follow-up research. Aside from intellectual integrity and sincerely advancing the cause of scientific research, there’s no real incentive to ensure the validity and accuracy of your software.

When your research is intended to be used to deliver products to customers, in some form or another, the incentive system is completely different. (It shouldn’t be, of course, but it is.) Good research results that don’t actually lead to desired performance are worse than worthless. They are actually damaging, because they mislead management, misdirect resources, and ultimately lead to worsening the product, not improving it.

So, what can go wrong with applying research-style software development methodologies in a research-driven product environment? Obviously, inaccurate software can produce misleading results, which can lead to deploying invalid research ideas in production environments. But even good research can harm production when the research software needs to be modified, upgraded, or even completely rewritten (which is typical) in order to get it into production. If researchers don’t write code up to production quality standards, inevitably software developers who don’t deeply understand the complexities of the research ideas will be tasked with reimplementing the research software, leading to the substance of the research potentially being lost in translation. And even if the content of the research is translated faithfully to production code, the delays introduced by the lengthy process of reimplementing that software could lead to the research ideas having diminished value to the customer, if competition can add similar features more efficiently.

When Research Programming and Production Programming are the Same

A great way to avoid these pitfalls is to treat the integrity of research code as though it is being used in production.

If you have coding standards for production software, impose those standards on researchers from day one of research software development. If you want production code to be written, documented, and structured in specific ways, those protocols are just as valid for research, for all of the reasons mentioned above. Unless you are a programming team of one, all of the reasons to have these standards in production apply equally, if not more acutely, in a research environment.

If you have regression testing protocols to verify the integrity of a new batch of source code on the end-to-production environment, use those regression testing protocols on research code. If you wait for production testing to detect inaccurate or error-prone code, you are likely to miss out on valuable research ideas and possibly promote ineffective research ideas. If you produce inefficient code that wouldn’t be tolerated in production, you will slow down research in ways which will limit research progress. And, just as bug fixes in production environments are meaningful for explaining customer experiences, recognizing the impact of bug fixes on research results, past, present and future, are important for researchers. Communicating how code changes impact results are all the more so important in a research environment.

One Process to Rule Them All

The best way to avoid all of the pitfalls discussed above is to stop differentiating between research software and production software.

If researchers are coached to understand that their software will be used in production WITHOUT MODIFICATION, then they will be forced to up their programming game to live up to that standard. This may be difficult culturally, at least initially, but the value to the overall product once it is achieved is worth the effort and pain. If software developers at all levels understand that their code needs to remain compatible with research, there will be little need for translating changes made in production or in Q/A testing back to the research environment.

If there is one set of code maintained by one software development project management tool, then there will be complete observability of all aspects of research, development, and production, for team members of all groups, from the earliest stages of research to the production deployment, and back again to pushing production bug fixes back to research.

Over my ten years of managing production (trading) at Renaissance Technologies, as well as my twenty-year career there doing a combination of research programming, production programming, and tool building, achieving this goal of having One Process To Rule Them All was the holy grail (to mix mythologies) of software development management. Once we learned this important lesson, and consolidated our development process under one process with one set of standards, we could finally believe with near certainty that our software worked as we expected it to, that our research results were valid and could be achieved in production, and that any programmer could introduce code into the system without risking the stability of the system.

Databand ™ as a Software Development Observability Solution

Having achieved this goal with internally developed software at Renaissance, when I started investing in data-science oriented software startups, I was on the lookout for a product that implemented these ideas in a consumer-quality product. When I met the founding team at Databand, I realized that they understood the lesson I had learned and were building a software solution that could implement these capabilities for the masses. Their software system allows all levels of employees, from team members up to senior managers, to have visibility (as appropriate) into all phases of the software development and project management process. Having this level of observability in one coherent system gives managers the ability to enforce project management, software development, and team interaction practices throughout an enterprise. They can choose to relax standards or heighten constraints depending on the cost-benefit trade-off of different practices. But by having all of these processes under the umbrella of one tool gives maximum visibility into all aspects of the operations of a company. As data-science research becomes more mission critical to products and customer experience, this observability is becoming indispensable for management teams. Those who are slow to recognize this necessity risk falling behind their competitors who use this awareness to deliver research-driven products more efficiently, more effectively, and more consistently.

Databand is currently focused on applying these principles to data engineering and data pipeline management, which is an area where this observability is most obviously and directly impactful to an enterprise’s deployment of data science in a hybrid research/production environment. Databand’s software solution is capable of addressing the larger problem of managing all aspects of project management for data science-driven products. When the management world is ready to embrace the principles espoused in this article, Databand will be there with a product ready to meet that demand.

More News & Insights

News & Insights

Mar 6, 2025

On the Record with Lizzy Kolar, Scope Zero Co-Founder & CEO

Mar 6, 2025

Differential goes on the record with Lizzy Kolar, the co-founder and CEO of Scope Zero. Scope Zero's mission is to reduce annual utility bills and fuel expenses by $300 billion, the environmental equivalent of removing 125M cars from the road.

Mar 6, 2025

Feb 11, 2025

DataRobot Acquires Agnostiq

Feb 11, 2025

AttackIQ Acquires DeepSurface

Feb 11, 2025

Aug 27, 2024

On the Record with Moshe Hecht Hatch.AI Founder & CEO

Aug 27, 2024

Differential goes on the record with Moshe Hecht, an award-winning philanthropic futurist and innovator, reshaping the world of giving through technology and data solutions. The founder and CEO of Hatch, he is a dedicated philanthropist and has been published in Forbes, Guidestar, and Nonprofit Pro.

Aug 27, 2024

Aug 18, 2024

Driving Sustainability through Employee Benefits with Scope Zero CEO Lizzy Kolar

Aug 18, 2024

The WorkplaceTech Spotlight host Hadeel Al-Tashi sits down with Lizzy Kolar, Co-Founder and CEO of Scope Zero to dive into how Scope Zero's Carbon Savings Account (CSA) empowers employees to make affordable home technology and transportation upgrades while aligning with corporate sustainability goals. They discuss how the CSA not only supports environmental and financial wellness for employees but also strengthens a company's commitment to sustainability. Don't miss this opportunity to learn how integrating green benefits can drive meaningful impact within your organization.

Aug 18, 2024

Aug 6, 2024

Hatch. AI Closes a $3 Million Seed Round

Aug 6, 2024

Hatch AI, a groundbreaking intelligence platform for nonprofits, announced a $3 million raise in seed funding, led by Differential. Read the full press announcement at the link below.

Aug 6, 2024

Mar 4, 2024

Pienso: Putting AI into the hands of people with problems to solve

Mar 4, 2024

MIT News: Alumni-founded Pienso has developed a user-friendly AI builder so domain experts can build solutions without writing any code.

Mar 4, 2024

Feb 26, 2024

On the Record with Nate Cavanaugh, CoFounder & Co-CEO of FlowFi

Feb 26, 2024

On the Record with Nate Cavanaugh, CoFounder & Co-CEO of FlowFi.

In 2021, Nate co-founded of FlowFi, a SaaS-enabled marketplace that connects startups and SMBs with finance experts. FlowFi has raised $10M from top VC firms including Blumberg Capital, Differential Ventures, Clocktower Ventures and Precursor Ventures, and generated 7-figures of annual recurring revenue in its first year.

Nate was nominated to the Forbes 30 Under 30 list for Enterprise Technology.

Feb 26, 2024

Feb 13, 2024

FlowFi Closes on $9M in Seed Funding

Feb 13, 2024

TECHCRUNCH: FlowFi, a startup creating a marketplace of finance experts for entrepreneurs, closed on $9 million in seed funding.

Blumberg Capital led the investment and was joined by a group of investors including Parade Ventures, Differential Ventures, Precursor Ventures, Special Ventures, 14 Peaks Capital and Cooley LLP.

Feb 13, 2024

Dec 13, 2023

Cyolo’s Almog Apirion on Nasdaq TradeTalks

Dec 13, 2023

NASDAQ: Nasdaq TradeTalks: 2024 Cybersecurity Budget Outlook with Almog Apirion, Cyolo.

Dec 13, 2023

Nov 30, 2023

Retrocausal Raises $5.3M in Financing

Nov 30, 2023

FINSMES: Retrocausal, a Seattle, WA-based platform provider for manufacturing process management, raised $5.3M in funding.

The round was led by Glasswing Ventures, One Way Ventures, and Indicator Ventures, with participation from existing investors Argon Ventures, Differential Ventures, Ascend Vietnam Ventures, Incubate Fund US, SaaS Ventures, Hypertherm Ventures, Stage Venture Partners, and Techstars.

Nov 30, 2023

Sep 19, 2023

Nick Adams Discusses How To Get Your Generative AI Startup Funded

Sep 19, 2023

AI and the Future of Work Podcast: Entrepreneurs wonder what it’s like to be a VC. And VCs without an operating background often don’t understand the grit required to turn an idea into a successful business. The best investors have been successful operators first.

Today’s guest is one of those. Nick Adams founded Differential Ventures in 2017 to invest in B2B, data-first seed-stage companies. Since then, Nick and the team have invested in an impressive group of companies including Private AI, Ocrolus, and Agnostiq.

Sep 19, 2023

Aug 8, 2023

On the Record with Elissa Ross, CoFounder & CEO of Metafold

Aug 8, 2023

On the Record with Elissa Ross, CoFounder & CEO of Metafold. Elissa Ross is a mathematician and the CEO of Toronto-based startup Metafold 3D. Metafold makes an engineering design platform for additive manufacturing, with an emphasis on supporting engineers using metamaterials, lattices and microstructures at industrial scales. Elissa holds a PhD in discrete geometry (2011), and worked as an industrial geometry consultant for the 8 years prior to cofounding Metafold. Metafold is the result of observations made in the consulting context about the challenges and opportunities of 3D printing.

Aug 8, 2023

Jul 26, 2023

Nick Adams: What Regulations Need to Be Put in Place to Ensure the Safe Use of AI in the U.S.?

Jul 26, 2023

Nick Adams on PM360: To get a better grasp on what eventual AI regulations could and should look like, PM360 spoke with Nick Adams, Founding Partner at Differential Ventures. In addition to starting the venture capital firm focused on AI/machine learning in 2018, Adams is also a member of the cybersecurity and national security subcommittee for the National Venture Capital Association and recently briefed members of Congress on AI policy and potential regulation.

Jul 26, 2023

Jul 18, 2023

Metafold 3D Closes $2.35 Million CAD To Fuel Industrial Adoption of 3D Printing

Jul 18, 2023

BETAKIT: Metafold 3D, which wants to make it easier for manufacturers to design and 3D print complex parts, has secured $2.35 million CAD ($1.78 million USD) in seed funding.

Toronto-based Metafold was founded in 2020 by a group of math, geometry, and architecture experts in CEO Elissa Ross, CTO Daniel Hambleton, and COO Tom Reslinski. Born out of Hambleton’s geometry-focused consulting agency, Mesh Consultants, Metafold sells design for additive-manufacturing software to sportswear and biopharmaceutical companies.

Jul 18, 2023

Jul 12, 2023

Nick Adams: Where’s AI headed in the workplace? VCs weigh in

Jul 12, 2023

Nick Adams on TECHBREW: For all the pixels spilled about the promises of generative AI, it’s starting to feel like we’re telling the same story over and over again. AI is serviceable at document summarization and shows promise in customer service applications. But it generates fictions (the industry prefers the euphemistic and anthropomorphizing term “hallucinates”) and is limited by the data on which it’s trained.

Jul 12, 2023

Jul 10, 2023

Mona Introduces Free, Self-Service Monitoring for GPT Applications

Jul 10, 2023

ATLANTA and TEL AVIV, Israel, June 29, 2023 /PRNewswire/ -- Mona, the leading intelligent monitoring platform, unveils a new monitoring solution for GPT-based applications. The free, self-service offering provides businesses with granular visibility into GPT-based products and valuable insights into costs, performance, and quality.

Jul 10, 2023

Jun 21, 2023

David Magerman: Artificial Intelligence’s Glass Ceiling

Jun 21, 2023

David Magerman on THEINFORMATION: OpenAI’s stated goal is to develop and promote a software system capable of artificial general intelligence. Toward that end, the company has released systems based on large-language models, which can respond to prompts with fluent conversation on many subjects. ChatGPT, Microsoft’s Bing chatbot and other new systems based on OpenAI’s GPT-3 and GPT-4 models are truly incredible and perform far beyond previous attempts at achieving AGI.

Jun 21, 2023

Jun 16, 2023

Morgan Stanley at Work Launches Carver Edison’s Cashless Participation®

Jun 16, 2023

BUSINESSWIRE: Morgan Stanley at Work and Carver Edison, a financial technology company, announced today that Shareworks has joined Equity Edge Online® in offering Cashless Participation® to U.S.-based corporate clients. Since the initial launch of Cashless Participation® on Equity Edge Online®, stock plan participants have purchased more than one million shares1 with Cashless Participation®. Now that Shareworks has also launched the tool, a wider cohort of Morgan Stanley at Work corporate clients will have access.

Jun 16, 2023

Jun 9, 2023

Nick Adams on Fox5: Artificial Intelligence Pros and Cons

Jun 9, 2023

FOX5 WASHINGTON DC: Nick Adams discusses the pros and cons of Artificial intelligence.

Jun 9, 2023

May 15, 2023

Differential Ventures Specializes In Being Advisors For AI Companies

May 15, 2023

PULSE 2.0: Differential Ventures is a seed-stage venture capital fund that was founded by data scientists and entrepreneurs for data-focused entrepreneurs. To learn more about the firm, Pulse 2.0 interviewed Differential Ventures’ managing partner and co-founder Nick Adams.

May 15, 2023

May 4, 2023

Golioth Secures $4.6M Seed Funding to Accelerate Time-to-Market for IoT

May 4, 2023

IoTForAll: Golioth, a leading developer platform for the Industrial Internet of Things (IIoT), announced open access to a library of new reference designs for embedded engineers to accelerate their time to market, the launch of a Select Partner Program for energy and construction developers, and the completion of a $4.6M round of seed funding led by Blackhorn Ventures and Differential Ventures with participation from existing investors, Zetta Venture Partners, MongoDB Ventures and Lorimer Ventures.

May 4, 2023

May 1, 2023

PrivateAI’s PrivateGPT aims to combat ChatGPT privacy concerns

May 1, 2023

VENTURE BEAT: Data privacy provider Private AI, announced the launch of PrivateGPT, a “privacy layer” for large language models (LLMs) such as OpenAI’s ChatGPT. The new tool is designed to automatically redact sensitive information and personally identifiable information (PII) from user prompts.

May 1, 2023

Apr 28, 2023

Quantum commercialization: softly, softly towards the inevitable future

Apr 28, 2023

DIGINOMICA: What can an early-stage investor tell enterprises about the nascent quantum market?

The quantum tipping point – that fabled moment when quantum technologies break through to commercial adoption at scale – has been questioned in a previous diginomica report…

Apr 28, 2023

Apr 27, 2023

Why Investors Bank on Quantum Commercialization

Apr 27, 2023

ENTER QUANTUM: Experts agree that commercial quantum computing at scale could be as much as 10 years away, but this hasn’t stopped investors from betting on it turning a profit in the near future. U.S. tech venture capital company Differential Ventures led the recent $6 million seed extension round for quantum software company Agnostiq which it will use to accelerate further development and commercialization of its enterprise-grade quantum and high-performance computing platform Covalent.

In this Q&A, Differential founding partner David Magerman explains why investors are throwing their weight behind commercial quantum now.

Apr 27, 2023

Apr 25, 2023

Banking in Venture Capital & the Tech Industry

Apr 25, 2023

On Tuesday, April 25th, 2023, Differential Ventures hosted a webinar on “Banking in Venture Capital & the Tech Industry”. The panel was moderated by David Magerman, Managing Partner of Differential Ventures, and joined by guest speakers Michael Crook (Chief Investment Officer, Mill Creek Capital Advisers), Samir Kaji (CEO & Cofounder, Allocate), and Matt Streisfeld (General Partner, Oak HC/FT).

Apr 25, 2023

Apr 14, 2023

Differential Ventures partners with Betaworks for AICamp: Augment

Apr 14, 2023

AICamp: Augment is a 3 month long accelerator program run by Betaworks, aimed at bringing together the most creative pre-seed & seed stage companies building software powered by AI to augment human activity.

Apr 14, 2023

Apr 10, 2023

Agnostiq Closes $6.1M Seed Extension Round

Apr 10, 2023

Quantum computing startup Agnostiq Inc. said today, April 5, 2023, it has closed on a seed funding round worth $6.1 million to help accelerate the development of its enterprise-grade quantum and high-performance computing platform.

Apr 10, 2023

Nick Adams: What to Do When Your Balance Sheet Doesn’t

Apr 10, 2023

Sand Hill Road Podcast: Nick Adams joined the Sand Hill Road podcast to discuss the way startups can survive a downturn.

Apr 10, 2023

Mar 31, 2023

David Magerman: Why Sophistication Will Win Out In The Machine Learning Ops Sector

Mar 31, 2023

UniteAI: There’s no question that machine learning operations (MLOps) is a burgeoning sector. The market is projected to reach $700 million by 2025 – almost four times what it was in 2020.

Still, while technically sound and powerful, these solutions haven’t generated the expected revenue, which has raised concerns about future growth.

Mar 31, 2023

AllDifferential Ventures

Guest User

How to Create Good Research-driven Software Products

Bad Research

When Research Programming and Production Programming are Different

When Research Programming and Production Programming are the Same

Databand ™ as a Software Development Observability Solution

More News & Insights

Contact Us

Learn more

How to Create Good Research-driven Software Products

Bad Research

When Research Programming and Production Programming are Different

When Research Programming and Production Programming are the Same

Databand ™ as a Software Development Observability Solution

More News & Insights

Knockri Raises $3m to Reduce Hiring Bias, Improve Diversity and Help Hire Exceptional Talent

Google Has Your Data, But How Do They Use It?

Contact Us

Learn more

Sign Up