Improve your drug candidates’ success rate by productizing and harmonizing your clinical data.

Harmonize your clinical data using data mesh
Optimize the success of your drug candidates. Tag.bio’s data mesh harmonizes your clinical data sources (e.g. SDTM, molecular assays) to streamline your exploratory analysis and reporting process.


What is a data mesh? And how does it fit in pharma? A data mesh presents a paradigm shift to consider data as a product. When you turn your clinical trials and omics data from each stage of the drug development process into readily analyzable data products, you can extract more value at a significantly faster rate.
Clinical trials data products can connect and enrich one another using a standard communication mechanism, forming a data mesh of information that you can instantly call upon as you move your drug candidates through the pipeline. Set your drug candidates up for success from day one by productizing your data.
Here are some examples of how you can use data mesh throughout your drug development lifecycle:
- Seamlessly integrate both historical and emerging data, such as biomarkers and RWD, to optimize current and future clinical trials
- Instantly compare your proprietary data against public datasets so that you can identify scientific trends and correlations across multiple studies
- Quickly perform exploratory and confirmatory analysis to reduce trial failures and improve clinical development and operations
How Tag.bio is used
Tag.bio helps you overcome data challenges across your drug development process.

Basic Research
To identify novel targets, pathways, biomarkers, and signatures.
A Cell Therapy Biotech is deploying an array of proprietary & public-domain data products — enabling users to investigate & discover gene expression markers with respect to cell types.

Preclinical
To evaluate potential success of a treatment, using data from cell lines and model organisms.
A Top 30 Nonprofit Biomedical Research Institution is analyzing 1000s of flow cytometry samples from various immunocompromised mouse strains with xenografts from human donors.

Clinical Trials
To stratify heterogeneous patient populations and develop diagnostics for stratification, toxicity, and response to treatment.
A Top 5 Global Pharma is working towards comprehensive harmonization of all past & future trials — starting with 10 clinical trials in translational oncology.

Regulatory Review
To produce quality controlled reports for reviews.
A Top 5 Global Pharma is leveraging data traceability and analysis reproducibility capabilities to produce quality control outputs for internal reports and external publications.

Real World evidence
To connect clinical endpoint biomarkers with real world data.
A Digital Health Company is generating evidence to show that their health app produces better outcomes and lower claim costs for their corporate customers.

Data mesh in a box
Tag.bio offers an out-of-the-box solution to help you accelerate your data mesh implementation. Talk to our experts to learn more about our offerings!
Data products for a streamlined analysis process
When data is treated as a product, the analysis process can be streamlined across the organization.
Before data products

With the status quo, an analysis process could easily take up 2 months. Here’s an example of a scenario:
There’s a researcher, or an executive, who wants to ask a question, so they approach a data scientist for help. The data scientist then requests help from data engineers to find the correct datasets and piece them together. After that work is done, the data scientist then performs the ad-hoc analyses to answer the original question. Finally, all this work results in one-off projects that cannot be reused in the future.
After data products

Now, if you productize your data, you can dramatically reduce the analysis turnaround time – from 2 months to 2 minutes.
This is possible because Tag.bio data products are built to be useful – enabling researchers to instantly ask and answer their own questions, and enabling data scientists to integrate controlled, high quality data into their algorithms. The data products are owned and built by cross-functional data teams which ensures the quality and reliability of the product. With this approach, researchers can get answers to complex questions in minutes and data scientists can support multiple analysis projects at the same time.
A solid foundation to drive clinical success
The data product is the fundamental building block of a data mesh architecture. Tag.bio applies product thinking and domain driven design to support useful clinical trials data products that are FAIR (findable, accessible, interoperable, reusable).
Cross-Functional Data Team
Each data product is owned by a cross-functional data team. This approach ensures the quality and reliability of the data products. The team typically includes three roles: data engineer, data scientist, and researcher (i.e. domain expert).
Data Engineers
The data engineers map and model data within each data product to make the process of data access and app building streamlined and flexible for data scientists.
Let’s go through a simple scenario: You want to standardize the use of the concept “Immune Subtype”. There’s a file with a table named “mmc2.txt.gz” that represents a table of data. Within this table, there’s a column named “Immune Subtype” with values of “C1, C2, C3” and so on. But many people don’t always know what those values mean. With Tag.bio’s data product mapping process, however, the data engineers can standardize the process and help other users the following ways:
- For data scientists, they’ll get an easy way to reference the column as “Immune Subtype” instead of constantly typing “select [Immune Subtype] from mmc2“. Additionally, they’ll be able to use the readable reference name everywhere as they build their analysis apps
- For researchers, they’ll get a human-readable version, such as “C2 – Interferon Gamma” instead of merely “C2“
And when there’s an updated version of the data, the data engineer doesn’t need to re-map the data because they had already set a good mapping foundation from the beginning.
Data Scientists
Using Tag.bio’s SDK, data scientists and bioinformaticians can build point-and-click analysis apps for data products, integrating their R, Python, ML/AI code as business logic for algorithms and visualization.
For example, one data scientist can build an app which performs unsupervised clustering and another data scientist can build an app producing an R markdown report using the same data that has been mapped into the data product. Even though these are two different apps, they still use the same data model and controlled vocabularies thanks to the robust data mapping.
This approach enables the data scientists to build apps quickly without wasting time repetitively accessing, loading, cleaning, and modeling data. Additionally, researchers can use different apps to quickly analyze and cross-compare their findings because all the apps within a data product share consistent data and terminologies.
Researchers
The researchers within a data team help the data engineers with the data mapping process by aiding them in identifying and standardizing data vocabularies that will be understood by other researchers. They also assist the data scientists in designing apps to answer a wide range of business and clinical questions.
The researchers outside the data team would use the no-code, self-guided analysis apps within the data products to ask and answer their own questions. For example, a researcher can perform an unsupervised clustering analysis with confidence because the app was built with guidance from other researchers who are either data owners and/or subject matter experts.

Get A Demo
Leverage Tag.bio’s platform to streamline your data analysis process. Talk to our experts to see what productizing your clinical data could look like.
Promote cross-organizational collaboration
Access your data mesh of clinical data products and collaborate efficiently using Tag.bio’s two-sided analysis environment.

Analysis Platform
Tag.bio offers a customizable Analysis Platform to support your unique business and research needs. Leverage the enterprise features to drive collaboration and innovation.
For researchers
- Ask and answer your research questions with confidence – by using self-guided, no-code analysis apps within each data products to perform advanced analytics, such as clustering analysis and generating R reports
- Promptly identify, store, and share your useful data artifacts (UDATs), such as biomarker and target candidates, with your fellow researchers and collaborators
For data scientists and bioinformaticians
- Demonstrate the value of your data science in clinical research – by making your work accessible to researchers and other domain experts across the organization
- Save hours from performing one-off analyses – by repurposing your work as reusable analysis apps across multiple data products
For leadership
- Promote a consistent view of reliable, harmonized research data – by providing a platform that presents a single source of truth
- Boost efficiency and drive innovation – by streamlining the data analysis process and making data science accessible across your organization
For strategic IT
- Simplify your data governance process as your research data grows – by streamlining your data management and access
- Stay compliant with regulatory and legal requirements without compromising data use
Developer Studio
To help you build high quality data products, Tag.bio offers a Developer Studio that uses a familiar, Jupyter notebook-based setting. Leverage the pre-built templates to streamline your data product creation.
For data scientists
- Access reliable and harmonized research data from data products for your R & Python algorithms and ML/AI models
- Collaborate more efficiently when your fellow data scientists and data engineers use the same tool to access data and build data products
For data engineers
- Deliver quality and harmonized research data to data scientists and promote data reusability – by standardizing how nomenclatures get mapped into the data products
- Advance data literacy when you use terminologies that researchers understand – by partnering with domain experts to standardize the terminologies being mapped into the data products
For quality controlled analysts
- Rapidly reproduce any reports and analyses to support regulatory submissions and publications – with instant access to all the versioned data, software packages, and analysis methods
- Streamline the auditing procedures when you gain transparency into the data mapping process and source code
For leadership
- Set a lasting foundation that promotes consistent quality controlled outputs as your research data grow and new data types emerge
- Significantly free up your resources time so that they can focus on what they do best – by streamlining and automating manual repetitive work
“There is no piece of software that does what you do. If there was we would be using it.”
– Data Science Department Director
@Top 5 Global Pharmaceuticals Company

Data Security in Clinical Research Environment
Tag.bio is hosted entirely within your secure network and/or your secure cloud. Source data storage and access is tightly controlled within your network, and the platform is compliant with standards such as GxP, GDPR and HIPAA.
Explore Resources For Life Sciences

Data Harmonization for Translational Research
A practical guide to help you resolve clinical trial data challenges using the data mesh approach.

Promote Transparency In The Data Analysis Process
