Table of Contents

OpenAD

See commands

General

openad Display the openad splash screen.

get status Display the currently selected workspace and toolkit.

display history Display the last 30 commands run in your current workspace.

clear sessions Clear any other sessions that may be running.


Workspaces

set workspace <workspace_name> Change the current workspace.

get workspace [ <workspace_name> ] Display details a workspace. When no workspace name is passed, details of your current workspace are displayed.

create workspace <workspace_name> [ description('<description>') on path '<path>' ] Create a new workspace with an optional description and path.

remove workspace <workspace_name> Remove a workspace from your registry. Note that this doesn’t remove the workspace’s directory.

list workspaces Lists all your workspaces.


Molecules Working Set

add mol|molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> [ as '<name>' ] [ basic ] [ force ] This command is how you add a molecule to a current working list of molecules in memory. When adding a molecule by name, this name will become the molecule’s identifying string.

It will take any molecules identifier from the following categories:
-smiles
-name or synonym
-smiles
-inchi
-inchikey
-cid

Options :
- as <name> : if the ` as ‘' ` not used the molecule the molecule identfier will be used for the molecules name. if the ` as '' ` not used the molecule the molecule identfier will be used for the molecules name.
You can set or override an name later for any molecule with the `rename molecule` command.
- ` basic ` Creates a molecule that does not have its properties and synonyms populated with pubchem data, this feature is only valid with a SMILES molecule identifier
- `force`: The `force` option allows you to ovveride the confirmation that you wish to add a molecule.

You can use the ‘mol’ shorthand instead of ‘molecule’.

You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string

If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.

Examples of how to add a molecule to your working list:

  • Add a molecule by name:
    add molecule aspirin
    or with single quotes
    ` display molecule ‘Aspirin 325 mg’ `

  • Add a molecule by name and force through the acknowledgement to add it:
    add molecule aspirin force

  • Add a molecule by SMILES:
    add molecule CC(=O)OC1=CC=CC=C1C(=O)O

  • Add a molecule by SMILES without populated pubchem properties:
    add molecule CC(=O)OC1=CC=CC=C1C(=O)O basic

  • Add a molecule by CID:
    add mol 2244

  • Add a molecule by InChI:
    add mol InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)
    or with single quotes
    add mol 'InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)'

  • Add a molecule by InChIKey:
    add mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N

  • Add a molecule by InChIKey nd set its name to “mymol”:
    add mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N as 'mymol'

  • Add a molecule by SMILES nd set its name to “mymol” and not prepopulate values from pubchem:
    add mol CC(=O)OC1=CC=CC=C1C(=O)O as 'mymol' basic

display sources <name> | <smiles> | <inchi> | <inchikey> | <cid> Display the sources of a molecule’s properties, attributing back to how they were calculated or sourced.

If the requested molecule exists in your current working list, that version will be used.

You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string

If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.

rename molecule <molecule_identifer_string> as <molecule_name> Rename a molecule in the current working list.

{MOL_SHORTHAND}

Example:
Let’s say you’ve added a molecule “CC(=O)OC1=CC=CC=C1C(=O)O” to your current working list of molecules, you can then rename it as such:
rename molecule CC(=O)OC1=CC=CC=C1C(=O)O as Aspirin

export molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> [ as file ] When run inside a jupyter lab notebook, this will return a dictionary of the molecule’s properties. When run from the command line, or when as file is set, the molecule will be saved to your workspace as a JSON file, named after the molecule’s identifying string.
If the molecule is in your current working list it will be retrieved from there, if the molecule is not there pubchem will be called to retrieve the molecule.

You can use the ‘mol’ shorthand instead of ‘molecule’.

If the requested molecule exists in your current working list, that version will be used.

If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.

Examples

  • export molecule aspirin
  • export molecule aspirin as file

remove mol|molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> [ force ] Remove a molecule from the current working list based on a given molecule identifier.

{MOL_SHORTHAND}

Examples:

  • Remove a molecule by name:
    remove molecule Aspirin

  • Remove a molecule by SMILES:
    remove molecule CC(=O)OC1=CC=CC=C1C(=O)O

  • Remove a molecule by InChIKey:
    remove mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N

  • Remove a molecule by InChI:
    remove mol InChI='1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)'

  • Remove a molecule by CID:
    remove mol 2244

list molecules List all molecules in the current working list.

show mols|molecules Display the current working list of molecules in the GUI.

enrich molecules with analysis This command Enriches every molecule in your current working list of molecules with the analysis results. This assumes that molecules in the current working list was the input or result for the analysis.

        This command currently covers results from the following Analysis commands: <br> 
        - RXN Toolkit `predict Reaction` <br> 
        - RXN Toolkit `predict retrosynthesis ` <br> 
        - DS4SD Toolkit `search for patents containing molecule` <br> 
        - DS4SD Toolkit `search for similiar molecules` <br> 

        See the Deep Search toolkit  and RXN toolkit help for further assistance on these commands.  <br><br>

clear analysis cache this command clears the cache of analysis results for your current workspace.

clear molecules This command clears the working list of molecules.

load molecules using file '<csv_or_sdf_filename>' [ merge with pubchem ] [append] This command Loads molecules from a CSV or SDF file into the molecule working list.

        Options: <br> 
         - you can add `merge with pubchem` to the command to fill in missing properties of the molecule. <br> 
         - you can append to the existing working set using the command <cmd> append </append>  <br><br>

load molecules using dataframe <dataframe> [ merge with pubchem ] [append]
This command Load molecules into the molecule working list from a dataframe.

        Options: <br> 
         - you can add `merge with pubchem` to the command to fill in missing properties of the molecule. NOTE:  this will slow the process down <br> 
         - you can append to the existing working set using the command <cmd> append </append>  <br><br>

export molecules [ as <csv_filename> ] This command exports the molecules in the current working list of molecules.

When run inside a Notebook, this will return a dataframe. When run from the command line, the molecules will be saved to your workspace as a CSV file named “result_#.csv”. The rows will be numbered with the highest number representing the latest molecule that was added.


Molecules

display molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> This command will display a molecule’s identifiers, propoerties, synonyms and any Analysis results it has been enriched with.
if the molecule exists in the current molecule workling list in memory the molecule will be retrieved from there if not pubchem will be checked to see if the molecule and its information is avialable there.

You can use the ‘mol’ shorthand instead of ‘molecule’.

If the requested molecule exists in your current working list, that version will be used.

You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string

If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.

Examples:

  • Display a molecule by name:
    display molecule Aspirin

  • Display a molecule by SMILES:
    display molecule CC(=O)OC1=CC=CC=C1C(=O)O

  • Display a molecule by InChI:
    display mol InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)

  • Display a molecule by InChIKey string:
    display mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N

  • Display a molecule by CID:
    display mol 2244

@(<name> | <smiles> | <inchi> | <inchikey> | <cid>)>><molecule_property_name> This command request the given property of a molecule, it will first try and retrieve the provided molecule from your working list of molecules, if it is not there it will will try and retrieve the molecule from pubchem.

The @ symbol should be followed by the molecule’s name, SMILES, InChI, InChIKey or CID, then after the >> include one of the properties mentioned below.

E.g. @aspirin>>xlogp

You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string

If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.

Examples of how to retrieve the value of a molecules property:

  • Obtain the molecular weight of the molecule known as Aspirin.
    @aspirin>>molecular_weight

  • Obtain the canonical smiles string for a molecule known as Aspirin.
    @aspirin>>canonical_smiles

  • Obtain a molecules xlogp value using a SMILES string.
    @CC(=O)OC1=CC=CC=C1C(=O)O>>xlogp

Available properties: atom_stereo_count, bond_stereo_count, canonical_smiles, charge, cid, complexity, conformer_count_3d, conformer_id_3d, conformer_model_rmsd_3d, conformer_rmsd_3d, coordinate_type, covalent_unit_count, defined_atom_stereo_count, defined_bond_stereo_count, effective_rotor_count_3d, exact_mass, feature_acceptor_count_3d, feature_anion_count_3d, feature_cation_count_3d, feature_count_3d, feature_donor_count_3d, feature_hydrophobe_count_3d, feature_ring_count_3d, h_bond_acceptor_count, h_bond_donor_count, heavy_atom_count, inchi, inchikey, isomeric_smiles, isotope_atom_count, iupac_name, mmff94_energy_3d, mmff94_partial_charges_3d, molecular_formula, molecular_weight, molecular_weight_exact, monoisotopic_mass, multipoles_3d, multipoles_3d, pharmacophore_features_3d, pharmacophore_features_3d, rotatable_bond_count, sol, sol_classification, tpsa, undefined_atom_stereo_count, undefined_bond_stereo_count, volume_3d, x_steric_quadrupole_3d, xlogp, y_steric_quadrupole_3d, z_steric_quadrupole_3d

show mol|molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> Inspect a molecule in the browser. If a molecule is not in the current Molecule Working set it will pull the result from Pubchem.

{MOL_SHORTHAND}

When you show a molecule by SMILES or InChI, we can display it immediately. When you show a molecule by name, InChIKey or PubChem CID, we need to first retrieve it from PubChem, which can take a few seconds.

Examples:

  • show mol aspirin
  • show mol CC(=O)OC1=CC=CC=C1C(=O)O
  • show mol InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)
  • show mol 2244


Molecule-sets

save molecule-set as <molecule_set_name> Save the current molecule workking list to a molecule-set in your workspace.

Example:
save molset as my_working_set

load molecule-set|molset <molecule-set_name> Loads a molecule-set from your workspace, and replaces your current list of molecules with the molecules from the given molecule-set.
Example:
load molset my_working_set

merge molecule-set|molset <molecule-set_name> [merge only] [append only] This command merges a molecule-set from your workspace into cour current working list of molecules in memory, and updates properties/Analysis in existing molecules or appends new molecules to the working list.

Options:
- ` merge only Only merges with existing molecules in list <br> - append only Only append molecules not in list <br> merge molset my_working_set`

list molecule-sets List all molecule sets in your workspace.

show molset|molecule-set '<molset_or_sdf_or_smi_path>' | using dataframe <dataframe> Launch the molset viewer to visualize your molecule dataset.

Examples:

  • show molset 'neurotransmitters.mol.json'
  • show molset 'neurotransmitters.sdf'
  • show molset 'neurotransmitters.smi'
  • show molset my_dataframe


Utility

merge molecules data using dataframe <dataframe> [ merge with pubchem ]
This command merges molecules into the molecule working list from a dataframe.

display data '<filename.csv>' Display data from a csv file.

-> result save [as '<filename.csv>'] Save table data to csv file.

-> result open Explore table data in the browser.
if you append -d to the end of the command result open -d display will result to data viewer.

-> result edit Edit table data in the browser.
if you append -d to the end of the command result open -d display will result to data viewer.

-> result copy Copy table data to clipboard, formatted for spreadheet.

-> result display Display the result in the CLI.

    if you append `-d` to the end of the command `result open -d` display will result to data viewer. <br><br>

-> result as dataframe Return the result as dataframe (only for Jupyter Notebook)

edit config '<json_config_file>' [ schema '<schema_file>'] Edit any JSON file in your workspace directly from the CLI. If a schema is specified, it will be used for validation and documentation.


Toolkits

ds4sd Display the splash screen for the DS4SD toolkit.

rxn Display the splash screen for the RXN toolkit.

st4sd Display the splash screen for the ST4SD toolkit.

list toolkits List all installed toolkits. To see all available toolkits, run list all toolkits.

list all toolkits List all available toolkits.

add toolkit <toolkit_name> Install a toolkit.

remove toolkit <toolkit_name> Remove a toolkit from the registry.

Note: This doesn’t delete the toolkit code. If the toolkit is added again, a backup of the previous install is created in the toolkit directory at ~/.openad/toolkits.

update toolkit <toolkit_name> Update a toolkit with the latest version. It is recommended to do this on a regular basis.

update all toolkits Update all installed toolkits with the latest version. Happens automatically whenever OpenAD is updated to a new version.

set context <toolkit_name> [ reset ] Set your context to the chosen toolkit. By setting the context, the selected toolkit functions become available to you. The optional parameter reset can be used to reset your login information.

get context Display the currently selected toolkit.

unset context Exit your toolkit context. You will no longer have access to toolkit-specific functions.


Runs

create run Start recording a run.

remove run <run_name> remove a run.

save run as <run_name> Stop recording a run and save it.

run <run_name> Execute a previously recorded run. This will execute every command and continue regardless of any failures.

list runs List all runs saved in the current workspace.

display run <run_name> Display the commands stored in a certain run.


GUI

install gui Install the OpenAD GUI (graphical user interface).

The graphical user interface allows you to browse your workspace and visualize your datasets and molecules.

launch gui Launch the OpenAD GUI (graphical user interface).

restart gui Terminate and then restart the GUI server.

quit gui Terminate the GUI server.


LLM

tell me <how to do xyz> Ask your AI assistant how to do anything in OpenAD.

set llm <language_model_name> Set the target language model name for the tell me command.

clear llm auth Clear the language model’s authentication file.


File System

list files [ path ] List al directories and files in your current workspace.

import from '<external_source_file>' to '<workspace_file>' Import a file from outside OpenAD into your current workspace.

export from '<workspace_file>' to '<external_file>' Export a file from your current workspace to anywhere on your hard drive.

copy file '<workspace_file>' to '<other_workspace_name>' Export a file from your current workspace to another workspace.

remove '<filename>' Remove a file from your current workspace.

open '<filename>' Open a file or dataframe in an iframe

Examples:

  • open 'base_molecules.sdf'
  • open my_dataframe


Help

intro Display an introduction to the OpenAD CLI.

docs Open the documentation webpage.

? List all available commands.

? ...<soft> --> List all commands containing "..."</soft>

... ?<soft> --> List all commands starting with "..."</soft>


Model

model auth list show authentication group mapping

model auth add group '<auth_group>'|<auth_group> with '<api_key>' add an authentication group for model services to use

model auth remove group '<auth_group>' | <auth_group> remove an authentication group

model auth add service '<service_name>'|,service_name> to group '<auth_group>'|<auth_group> Attach an authentication group to a model service

model auth remove service '<service_name>'|<service_name> Detatch an authentication group from a model service

model service status Get the status of currently cataloged services

model service describe '<service_name>'|<service_name> get the configuration of a service

model catalog list get the list of currently cataloged services

uncatalog model service '<service_name>'|<service_name> uncatalog a model service

Example:
uncatalog model service 'gen'

catalog model service from (remote) '<path> or <github> or <service_url>' as '<service_name>'|<service_name> USING (<parameter>=<value> <parameter>=<value>) catalog a model service from a path or github or remotely from an existing OpenAD service.
(USING) optional headers parameters for communication with service backend.
If you are cataloging a service using a model defined in a directory, provide the absolute ` ` of that directory in quotes.

The following options require the remote option be declared.

If you are cataloging a service using a model defined in github repository, provide the absolute ` ` of that github directory quotes.

If you are cataloging a remote service on a ip address and port provide the remote services ipaddress and port in quoted string e.g. '0.0.0.0:8080'

service_name: this is the name of the service as you will define it for your usage. e.g prop short for properties.

USING Parameters:

If using a hosted service the following parameters must be supplied:
-Inference-Service: this is the name of the inference service that is hosted, it is a required parameter if cataloging a remote service.
An authorization parameter is always required if cataloging a hosted service, either Auhtorisation group (auth_group) or Authorisation bearer_token/api_key (Authorization):
-auth_group: this is the name of an authorization group which contains the api_key linked to the service access. This can only be used if Authorization is not also defined.
OR
-Authorization: this parameter is designed to be used when a auth_group is not defined.

Example:

Skypilot Deployment
-catalog model service from 'git@github.com:acceleratedscience/generation_inference_service.git' as 'gen'

Service using a authentication group
-catalog model service from remote '<service_url>' as molf USING (Inference-Service=molformer )
` model auth add service ‘molf’ to group ‘default’`

Single Authorisation Service
-openad catalog model service from remote '<service_URL>' as 'gen' USING (Inference-Service=generation Authorization='<api_key>')

Catalog a remote service shared with you:
-catalog model service from remote 'http://54.235.3.243:30001' as gen

model service up '<service_name>'|<service_name> [no_gpu]} launches a cataloged model service when it was cataloged as a self managed service from a directory or github repository.
If you do not want to launch a service with GPU you should specify no_gpu at the end of the command.
Examples:

-model service up gen

-model service up 'gen'

-model service up gen no_gpu

model service local up '<service_name>'|<service_name> Launches a model service locally.

        Example: <br> 
          ` model service local up gen` <br><br>

model service down '<service_name>'|<service_name> Bring down a model service
Examples:

model service down gen

model service down 'gen'


DS4SD

See commands

Search Molecules

search for similar molecules to '<smiles>' [ save as '<filename.csv>' ] Search for molecules that are similar to the provided molecule or molecule substructure as provided in the <smiles_string>.

Use the save as clause to save the results as a csv file in your current workspace.

Example:
search for similar molecules to 'C1(C(=C)C([O-])C1C)=O'

search for molecules in patents from list ['<patent1>', '<patent2>', ...] | dataframe <dataframe_name> | file '<filename.csv>' [ save as '<filename.csv>' ] Search for molecules mentioned in a defined list of patents. When sourcing patents from a CSV or DataFrame, there must be column named “PATENT ID” or “patent id”.

Use the save as clause to save the results as a csv file in your current workspace.

Example:
search for molecules in patents from list ['CN108473493B','US20190023713A1']

search for patents containing molecule '<smiles>' | '<inchi>' | '<inchikey>' [ save as '<filename.csv>' ] Search for mentions of a specified molecules in registered patents. The queried molecule can be described as a SMILES string, InChI or InChiKey.

Use the save as clause to save the results as a csv file in your current workspace.

Example:
search for patents containing molecule 'CC(C)(c1ccccn1)C(CC(=O)O)Nc1nc(-c2c[nH]c3ncc(Cl)cc23)c(C#N)cc1F'

search for substructure instances of '<smiles>' [ save as '<filename.csv>' ] Search for molecules by substructure, as defined by the <smiles_string>.

Use the save as clause to save the results as a csv file in your current workspace.

Example:
search for substructure instances of 'C1(C(=C)C([O-])C1C)=O' save as 'my_mol'


Search Collections

search collection '<collection_name_or_key>' for '<search_string>' [ using (page_size=<int> system_id=<system_id> edit_distance=<integer> display_first=<integer>) ] show (data | docs) [ estimate only | return as data | save as '<filename.csv>' ] Performs a document search of the Deep Search repository based on a given collection. The required using clause specifies the collection to search. Use estimate only to return only the potential number of hits.

Parameters:

  • <collection_name_or_key> The name or index key for a collection. Use the command display all collections to list available collections.
  • <search_string> The search string for the search.

The <search_string> supports elastic search string query syntax:

  • + Signifies AND operation.
  • | Signifies OR operation.
  • - Negates a single token.
  • \" Wraps a number of tokens to signify a phrase for searching.
  • * At the end of a term -> signifies a prefix query
  • ( & ) Signifies precedence
  • ~N After a word -> signifies edit distance (fuzziness)
  • ~N After a phrase -> signifies slop amount

Options for the using clause:

Note: The using clause requires all enclosed parameters to be defined in the same order as listed below.

  • page_size=<integer> Result pagination, the default is None.
  • system_id=<system_id> System cluster id, the default is ‘default’.
  • edit_distance=<integer> (0-5) Sets the search word span criteria for key words for document searches, the default is 5. When set to 0, no snippets will be be returned.
  • display_first=<integer> When set, the displayed result set will be truncated at the given number.

Clauses:

  • show (data | docs):
    • data Display structured data from within the documents.
    • docs Display document context and preview snippet.
      Both can be combined in a single command, e.g. show (data docs)
  • estimate only Determine the potential number of hits.
  • return as data For Notebook or API mode. Removes all styling from the Pandas DataFrame, ready for further processing.

Examples:

  • Look for documents that contain discussions on power conversion efficiency:
    search collection 'arxiv-abstract' for 'ide(\"power conversion efficiency\" OR PCE) AND organ*' using ( edit_distance=20 system_id=default) show (docs)

  • Search the PubChem archive for ‘Ibuprofen’ and display related molecules’ data:
    search collection 'pubchem' for 'Ibuprofen' show (data)

  • Search for patents which mention a specific smiles molecule:
    search collection 'patent-uspto' for '\"smiles#ccc(coc(=o)cs)(c(=o)c(=o)cs)c(=o)c(=o)cs\"' show (data)

display collection matches for '<search_string>' [ save as '<filename.csv>' ] Search all collections for documents that contain a given Deep Search <search_string>. This is useful when narrowing down document collection(s) for subsequent search. You can use the <index_key> from the returned table in your next search.

Use the save as clause to save the results as a csv file in your current workspace.

Example:
display collection matches for 'Ibuprofen'


Collections

display collections in domains from list <list_of_domains> [ save as '<filename.csv>' ] Display collections that belong to the listed domains.

Use the save as clause to save the results as a csv file in your current workspace.

Use the command display all collections to find available domains.

Example:
display collections in domains from list ['Scientific Literature']

display all collections [ save as '<filename.csv>' ] Display all available collections in Deep Search.

Use the save as clause to save the results as a csv file in your current workspace.

display collections for domain '<domain_name>' Display the available collections in a given Deep Search domain.

Use the command display all collections to find available domains.

Example:
display collections for domain 'Business Insights'

display collection details '<collection_name_or_key>' Display the details for a specified collection. You can specify a collection by its name or key.

Use the command display all collections to list available collections.

Example:
display collection details 'Patents from USPTO'


RXN

See commands

General

interpret recipe '<recipe_paragraph>' | '<txt_filename>' Build a ordered list of actions interpreted from a provided text-based recipe. The recipe can be provided as a string or as a text file from your current workspace.

Examples:

  • interpret recipe 'my_recipe.txt'
  • interpret recipe 'A solution of ((1S,2S)-1-{[(methoxymethyl-biphenyl-4-yl)-(2-pyridin-2-yl-cyclopropanecarbonyl)-amino]-methyl}-2-methyl-butyl)-carbamic acid tert-butyl ester (25 mg, 0.045 mmol) and dichloromethane (4 mL) was treated with a solution of HCl in dioxane (4 N, 0.5 mL) and the resulting reaction mixture was maintained at room temperature for 12 h. The reaction was then concentrated to dryness to afford (1R,2R)-2-pyridin-2-yl-cyclopropanecarboxylic acid ((2S,3S)-2-amino-3-methylpentyl)-(methoxymethyl-biphenyl-4-yl)-amide (18 mg, 95% yield) as a white solid.'

list rxn models Lists all RXN AI models currently available.


Retrosynthesis

predict retrosynthesis '<smiles>' [ using (option1=<value> option2=<value>) ] Perform a retrosynthesis route prediction on a molecule.

Note: The using clause requires all enclosed parameters to be defined in the same order as listed below.

Optional Parameters that can be specified in the using clause:

  • availability_pricing_threshold=<int> Maximum price in USD per g/ml of compounds. Default: no threshold.
  • available_smiles='<smiles>.<smiles>.<smiles>' List of molecules available as precursors, delimited with a period.
  • exclude_smiles='<smiles>.<smiles>.<smiles>' List of molecules to exlude from the set of precursors, delimited with a period.
  • exclude_substructures='<smiles>.<smiles>.<smiles>' List of substructures to excludefrom the set of precursors, delimited with a period.
  • exclude_target_molecule=<boolean> Excluded target molecule. The default is True
  • fap=<float> Every retrosynthetic step is evaluated with the FAP, and is only retained when forward confidence is greater than the FAP value. The default is 0.6.
  • max_steps=<int> The maximum number steps in the results. The default is 3.
  • nbeams=<int> The maximum number of beams exploring the hypertree. The default is 10.
  • pruning_steps=<int> The number of steps to prune a hypertree. The default is 2.
  • ai_model='<model_name>' What model to use. Use the command list rxn models to list all available models. The default is ‘2020-07-01’.

Example:
predict retrosynthesis 'BrCCc1cccc2c(Br)c3ccccc3cc12' using (max_steps=3)


Prediction

predict reaction in batch from dataframe <dataframe_name> | file '<filename.csv>' | list ['<smiles>.<smiles>','<smiles>.<smiles>'] [ using (ai_model='<ai_model>') ] [ use_saved ] Run a batch of reaction predictions. The provided list of reactions can be specified as a DataFrame, a CSV file from your current workspace or a list of strings. When proving a DataFrame or CSV file, we will look for the “reactions” column.

Reactions are defined by combining two SMILES strings delimited by a period. For example: 'BrBr.c1ccc2cc3ccccc3cc2c1'

Optional Parameters that can be specified in the using clause:

  • ai_model='<model_name>' What model to use. Use the command list rxn models to list all available models. The default is ‘2020-07-01’.

You can reuse previously generated results by appending the optional use_saved clause. This will reuse the results of a previously run command with the same parameters, if available.

Examples:

  • predict reaction in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1']
  • predict reaction in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1'] use_saved

predict reaction '<smiles>.<smiles>' [ using (ai_model='<ai_model>') ] [ use_saved ] Predict the reaction between two molecules.

Reactions are defined by combining two SMILES strings delimited by a period. For example: 'BrBr.c1ccc2cc3ccccc3cc2c1'

Optional Parameters that can be specified in the using clause:

  • ai_model='<model_name>' What model to use. Use the command list rxn models to list all available models. The default is ‘2020-07-01’.

You can reuse previously generated results by appending the optional use_saved clause. This will reuse the results of a previously run command with the same parameters, if available.

Examples:

  • predict reaction 'BrBr.c1ccc2cc3ccccc3cc2c1CCO'
  • predict reaction 'BrBr.c1ccc2cc3ccccc3cc2c1CCO' use_saved

predict reaction topn in batch from dataframe <dataframe_name> | file '<filename.csv>' | list ['<smiles>.<smiles>','<smiles>.<smiles>'] [ using (topn=<integer> ai_model='<ai_model>') ] [ use_saved ] Run a batch of reaction predictions for topn. The provided list of reactions can be specified as a DataFrame, a CSV file from your current workspace or a list of strings. When proving a DataFrame or CSV file, we will look for the “reactions” column.

Reactions are defined by combining two SMILES strings delimited by a period. For example: 'BrBr.c1ccc2cc3ccccc3cc2c1'

Optional Parameters that can be specified in the using clause:

  • ai_model='<model_name>' What model to use. Use the command list rxn models to list all available models. The default is ‘2020-07-01’.
  • topn=<integer> Defined the number of results being returned. The default value is 3.

You can reuse previously generated results by appending the optional use_saved clause. This will reuse the results of a previously run command with the same parameters, if available.

Examples:

  • predict reaction topn in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1']
  • predict reaction topn in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1'] using (topn=6)
  • predict reaction topn in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1'] use_saved


ST4SD

See commands