Skip to main content
search

Chemaxon Structure Preparation Toolkit

Reliable chemical structure representation through correction and canonicalization

Structure Preparation Toolkit

Chemical databases can get messy, making it difficult to use the data you have already gathered in chemistry research.

The Structure Preparation Toolkit allows you to validate, curate, and normalize molecules according to your business rules. This helps you prepare compounds for registration or docking, allowing you to build a unified database, and generate the relevant microspecies or isomers for your experiments more accurately based on high-quality data.

Structure Preparation Tools

Standardizer

Standardizer’s main purpose is to transform chemical structures into representations that obey your business rules to create a consistent chemical database. You can use it when registering new compounds, or when virtually screening structures. Here are a few highlights of 40 pre-defined standardizer actions:

  • Adding or removing explicit Hydrogen atoms
  • Neutralizing charged fragments or functional groups
  • Recognizing and converting legacy representations of functional groups (like aliases)
  • Removing certain fragments (like water and salt counterions)
  • 2D cleaning and expanding abbreviated groups
  • Unified representation of aromatic rings, tautomers and mesomers

Structure Checker

Structure Checker verifies the correctness of molecules. In case it finds an issue, structure checker can fix it automatically or flag it for manual correction. This tool allows you to filter out input errors and incorrect features when registering a new compound. Here are some highlights of 40 checkers:

  • Invalid bond length
  • Overlapping bonds or atoms
  • Molecule charges
  • Incorrect chiral flags
  • Invalid valences
  • OCR errors
  • Substructure checker that can be configured to transform a given substructure defined by a SMARTS string

Isomers

The ability to deal with molecular structures that have identical atomic composition, but different arrangement of bonds or spatial orientation is crucial in the identification, search and property calculation of compounds. Tautomerism, stereoisomerism and resonance are examples of isomerism.

Tautomerization can affect identification, searching and physico-chemical properties of molecules. Stereoisomers have the same physico-chemical properties (e.g.: melting point, solubility), but they differ in pharmacokinetic and pharmacodynamic behavior. Resonance describes the rearrangement of delocalized electrons in the molecule, which results in a set of contributing structures of the original structure.

The Isomer calculator can:

  • Generate different tautomer sets useful for searching and tautomer handling
  • Calculate tautomer distribution and major tautomer form in water
  • Generate tetrahedral and double-bond stereoisomers starting either from a 2D or 3D structure
  • Generate resonance structures

Protonation

Ionization has an essential impact on compound behavior in nearly all protic environments. On top of its application in general and analytical chemistry, relationships to drug pharmacokinetics and pharmacodynamics are well-established. Acidic and basic dissociation constants (pKa) and the microspecies distribution at different pH values are important to quantitatively describe ionization and comprehend physical and biological processes.

The Protonation calculator includes:

  • Highly-accurate calculation of pKa values along with pH dependent distribution plots of relevant microspecies in water – pKa documentation
  • Calculation of the major microspecies form at a given pH value – Major microspecies documentation
  • Isoelectric point calculation

Availability

The Toolkit bundles are built to easily integrate with most systems using a range of different APIs including Java, Python, Microservices and .NET.

Toolkit Offerings

Tools Discovery Toolkit Descriptor Generation Toolkit Structure Preparation Toolkit Structure Enumeration Toolkit Naming Toolkit
hERG Inhibition

×

×

×

Elemental Analysis

×

×

×

Partitioning

×

×

×

Solubility

×

×

×

NMR Predictor

×

×

×

Structural Calculations

×

×

×

Isomers

×

×

Protonation

×

×

Standardizer

×

×

×

Structure Checker

×

×

×

Reactor

×

×

×

Markush Enumeration

×

×

×

Chemical name and structure conversion

×

×

×

Workflow-Specific Toolkits

Our workflow-specific offerings are designed to serve your needs with a targeted subset of tools bundled together.

Discovery Toolkit

Read more

Descriptor Generation Toolkit

Read more

Structure Enumeration Toolkit

Read more

Naming Toolkit

Read more

With Discovery Toolkit, your data is secure

Certara holds ISO 27001 certification for Certara’s Information Security Management System (ISMS). We have implemented robust security controls, undergone rigorous risk assessments, and continuously strive for improvement. Discovery Toolkit ensures full compliance with global data protection standards, offering peace of mind for sensitive analysis.

Learn more in our Trust Center

Why Certara

Certara delivers leading-edge solutions that empower scientists to make data-driven decisions, accelerating drug discovery timelines and improving outcomes.

Early drug discovery is a core area of expertise at Certara, supported by a team of highly experienced and renowned experts in the field.
The dynamic community of users from drug discovery empowers us to continuously refine our solutions to meet the ever-evolving demands of innovative research.

Using Chemaxon’s suite of tools as core components of our chemistry workflows, they save us time while supporting strong, science-driven decisions. What sets them apart is that their team is professional, approachable, and genuinely invested in helping us succeed.

Xin Zhang
Sr. Director, Cheminformatics and AI/ML
Cellarity

Book a Demo

Discovery Toolkit contains every cheminformatics toolkit we have to offer.

For more specific needs, we offer workflow-specific bundles:

Descriptor Generation Toolkit
Structure Preparation Toolkit
Structure Enumeration Toolkit
Naming Toolkit

Reach out to us

– Leave your contact information
– Summarize what you are looking for
– Our colleagues will get back to you soon