Table of Contents
OpenAD
See commands
General
openad
Display the openad splash screen.
get status
Display the currently selected workspace and toolkit.
display history
Display the last 30 commands run in your current workspace.
clear sessions
Clear any other sessions that may be running.
Workspaces
set workspace <workspace_name>
Change the current workspace.
get workspace [ <workspace_name> ]
Display details a workspace. When no workspace name is passed, details of your current workspace are displayed.
create workspace <workspace_name> [ description('<description>') on path '<path>' ]
Create a new workspace with an optional description and path.
remove workspace <workspace_name>
Remove a workspace from your registry. Note that this doesn’t remove the workspace’s directory.
list workspaces
Lists all your workspaces.
Molecules Working Set
add mol|molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> [ as '<name>' ] [ basic ] [ force ]
This command is how you add a molecule to a current working list of molecules in memory. When adding a molecule by name, this name will become the molecule’s identifying string.
It will take any molecules identifier from the following categories:
-smiles
-name or synonym
-smiles
-inchi
-inchikey
-cid
Options :
- as <name>
: if the ` as ‘
You can set or override an name later for any molecule with the `rename molecule` command.
- ` basic ` Creates a molecule that does not have its properties and synonyms populated with pubchem data, this feature is only valid with a SMILES molecule identifier
- `force`: The `force` option allows you to ovveride the confirmation that you wish to add a molecule.
You can use the ‘mol’ shorthand instead of ‘molecule’.
You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string
If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.
Examples of how to add a molecule to your working list:
-
Add a molecule by name:
add molecule aspirin
or with single quotes
` display molecule ‘Aspirin 325 mg’ ` -
Add a molecule by name and force through the acknowledgement to add it:
add molecule aspirin force
-
Add a molecule by SMILES:
add molecule CC(=O)OC1=CC=CC=C1C(=O)O
-
Add a molecule by SMILES without populated pubchem properties:
add molecule CC(=O)OC1=CC=CC=C1C(=O)O basic
-
Add a molecule by CID:
add mol 2244
-
Add a molecule by InChI:
add mol InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)
or with single quotes
add mol 'InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)'
-
Add a molecule by InChIKey:
add mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N
-
Add a molecule by InChIKey nd set its name to “mymol”:
add mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N as 'mymol'
-
Add a molecule by SMILES nd set its name to “mymol” and not prepopulate values from pubchem:
add mol CC(=O)OC1=CC=CC=C1C(=O)O as 'mymol' basic
display sources <name> | <smiles> | <inchi> | <inchikey> | <cid>
Display the sources of a molecule’s properties, attributing back to how they were calculated or sourced.
If the requested molecule exists in your current working list, that version will be used.
You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string
If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.
rename molecule <molecule_identifer_string> as <molecule_name>
Rename a molecule in the current working list.
{MOL_SHORTHAND}
Example:
Let’s say you’ve added a molecule “CC(=O)OC1=CC=CC=C1C(=O)O” to your current working list of molecules, you can then rename it as such:
rename molecule CC(=O)OC1=CC=CC=C1C(=O)O as Aspirin
export molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> [ as file ]
When run inside a jupyter lab notebook, this will return a dictionary of the molecule’s properties. When run from the command line, or when as file
is set, the molecule will be saved to your workspace as a JSON file, named after the molecule’s identifying string.
If the molecule is in your current working list it will be retrieved from there, if the molecule is not there pubchem will be called to retrieve the molecule.
You can use the ‘mol’ shorthand instead of ‘molecule’.
If the requested molecule exists in your current working list, that version will be used.
If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.
Examples
export molecule aspirin
export molecule aspirin as file
remove mol|molecule <name> | <smiles> | <inchi> | <inchikey> | <cid> [ force ]
Remove a molecule from the current working list based on a given molecule identifier.
{MOL_SHORTHAND}
Examples:
-
Remove a molecule by name:
remove molecule Aspirin
-
Remove a molecule by SMILES:
remove molecule CC(=O)OC1=CC=CC=C1C(=O)O
-
Remove a molecule by InChIKey:
remove mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N
-
Remove a molecule by InChI:
remove mol InChI='1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)'
-
Remove a molecule by CID:
remove mol 2244
list molecules
List all molecules in the current working list.
show mols|molecules
Display the current working list of molecules in the GUI.
enrich molecules with analysis
This command Enriches every molecule in your current working list of molecules with the analysis results. This assumes that molecules in the current working list was the input or result for the analysis.
This command currently covers results from the following Analysis commands: <br>
- RXN Toolkit `predict Reaction` <br>
- RXN Toolkit `predict retrosynthesis ` <br>
- DS4SD Toolkit `search for patents containing molecule` <br>
- DS4SD Toolkit `search for similiar molecules` <br>
See the Deep Search toolkit and RXN toolkit help for further assistance on these commands. <br><br>
clear analysis cache
this command clears the cache of analysis results for your current workspace.
clear molecules
This command clears the working list of molecules.
load molecules using file '<csv_or_sdf_filename>' [ merge with pubchem ] [append]
This command Loads molecules from a CSV or SDF file into the molecule working list.
Options: <br>
- you can add `merge with pubchem` to the command to fill in missing properties of the molecule. <br>
- you can append to the existing working set using the command <cmd> append </append> <br><br>
load molecules using dataframe <dataframe> [ merge with pubchem ] [append]
“
This command Load molecules into the molecule working list from a dataframe.
Options: <br>
- you can add `merge with pubchem` to the command to fill in missing properties of the molecule. NOTE: this will slow the process down <br>
- you can append to the existing working set using the command <cmd> append </append> <br><br>
export molecules [ as <csv_filename> ]
This command exports the molecules in the current working list of molecules.
When run inside a Notebook, this will return a dataframe. When run from the command line, the molecules will be saved to your workspace as a CSV file named “result_#.csv”. The rows will be numbered with the highest number representing the latest molecule that was added.
Molecules
display molecule <name> | <smiles> | <inchi> | <inchikey> | <cid>
This command will display a molecule’s identifiers, propoerties, synonyms and any Analysis results it has been enriched with.
if the molecule exists in the current molecule workling list in memory the molecule will be retrieved from there if not pubchem will be checked to see if the molecule and its information is avialable there.
You can use the ‘mol’ shorthand instead of ‘molecule’.
If the requested molecule exists in your current working list, that version will be used.
You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string
If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.
Examples:
-
Display a molecule by name:
display molecule Aspirin
-
Display a molecule by SMILES:
display molecule CC(=O)OC1=CC=CC=C1C(=O)O
-
Display a molecule by InChI:
display mol InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)
-
Display a molecule by InChIKey string:
display mol BSYNRYMUTXBXSQ-UHFFFAOYSA-N
-
Display a molecule by CID:
display mol 2244
@(<name> | <smiles> | <inchi> | <inchikey> | <cid>)>><molecule_property_name>
This command request the given property of a molecule, it will first try and retrieve the provided molecule from your working list of molecules, if it is not there it will will try and retrieve the molecule from pubchem.
The @
symbol should be followed by the molecule’s name, SMILES, InChI, InChIKey or CID, then after the >>
include one of the properties mentioned below.
E.g. @aspirin>>xlogp
You can specify any molecule by SMILES or InChI, and PubChem classified molecules also by name, InChIKey or their PubChem CID.
A molecule identifier can be in single quotes or defined with unquoted text. If you have spaces in your molecule identifier e.g. a name, then you must user a single quoted string
If you use the name of a molecule, the tool will do a caseless search of the names and synonyms first in current working list, then on pubchem.
Examples of how to retrieve the value of a molecules property:
-
Obtain the molecular weight of the molecule known as Aspirin.
@aspirin>>molecular_weight
-
Obtain the canonical smiles string for a molecule known as Aspirin.
@aspirin>>canonical_smiles
-
Obtain a molecules xlogp value using a SMILES string.
@CC(=O)OC1=CC=CC=C1C(=O)O>>xlogp
Available properties: atom_stereo_count
, bond_stereo_count
, canonical_smiles
, charge
, cid
, complexity
, conformer_count_3d
, conformer_id_3d
, conformer_model_rmsd_3d
, conformer_rmsd_3d
, coordinate_type
, covalent_unit_count
, defined_atom_stereo_count
, defined_bond_stereo_count
, effective_rotor_count_3d
, exact_mass
, feature_acceptor_count_3d
, feature_anion_count_3d
, feature_cation_count_3d
, feature_count_3d
, feature_donor_count_3d
, feature_hydrophobe_count_3d
, feature_ring_count_3d
, h_bond_acceptor_count
, h_bond_donor_count
, heavy_atom_count
, inchi
, inchikey
, isomeric_smiles
, isotope_atom_count
, iupac_name
, mmff94_energy_3d
, mmff94_partial_charges_3d
, molecular_formula
, molecular_weight
, molecular_weight_exact
, monoisotopic_mass
, multipoles_3d
, multipoles_3d
, pharmacophore_features_3d
, pharmacophore_features_3d
, rotatable_bond_count
, sol
, sol_classification
, tpsa
, undefined_atom_stereo_count
, undefined_bond_stereo_count
, volume_3d
, x_steric_quadrupole_3d
, xlogp
, y_steric_quadrupole_3d
, z_steric_quadrupole_3d
show mol|molecule <name> | <smiles> | <inchi> | <inchikey> | <cid>
Inspect a molecule in the browser. If a molecule is not in the current Molecule Working set it will pull the result from Pubchem.
{MOL_SHORTHAND}
When you show a molecule by SMILES or InChI, we can display it immediately. When you show a molecule by name, InChIKey or PubChem CID, we need to first retrieve it from PubChem, which can take a few seconds.
Examples:
show mol aspirin
show mol CC(=O)OC1=CC=CC=C1C(=O)O
show mol InChI=1S/C9H8O4/c1-6(10)13-8-5-3-2-4-7(8)9(11)12/h2-5H,1H3,(H,11,12)
show mol 2244
Molecule-sets
save molecule-set as <molecule_set_name>
Save the current molecule workking list to a molecule-set in your workspace.
Example:
save molset as my_working_set
load molecule-set|molset <molecule-set_name>
Loads a molecule-set from your workspace, and replaces your current list of molecules with the molecules from the given molecule-set.
Example:
load molset my_working_set
merge molecule-set|molset <molecule-set_name> [merge only] [append only]
This command merges a molecule-set from your workspace into cour current working list of molecules in memory, and updates properties/Analysis in existing molecules or appends new molecules to the working list.
Options:
- ` merge only Only merges with existing molecules in list <br> -
append only Only append molecules not in list <br>
merge molset my_working_set`
list molecule-sets
List all molecule sets in your workspace.
show molset|molecule-set '<molset_or_sdf_or_smi_path>' | using dataframe <dataframe>
Launch the molset viewer to visualize your molecule dataset.
Examples:
show molset 'neurotransmitters.mol.json'
show molset 'neurotransmitters.sdf'
show molset 'neurotransmitters.smi'
show molset my_dataframe
Utility
merge molecules data using dataframe <dataframe> [ merge with pubchem ]
“
This command merges molecules into the molecule working list from a dataframe.
display data '<filename.csv>'
Display data from a csv file.
-> result save [as '<filename.csv>']
Save table data to csv file.
-> result open
Explore table data in the browser.
if you append -d
to the end of the command result open -d
display will result to data viewer.
-> result edit
Edit table data in the browser.
if you append -d
to the end of the command result open -d
display will result to data viewer.
-> result copy
Copy table data to clipboard, formatted for spreadheet.
-> result display
Display the result in the CLI.
if you append `-d` to the end of the command `result open -d` display will result to data viewer. <br><br>
-> result as dataframe
Return the result as dataframe (only for Jupyter Notebook)
edit config '<json_config_file>' [ schema '<schema_file>']
Edit any JSON file in your workspace directly from the CLI. If a schema is specified, it will be used for validation and documentation.
Toolkits
ds4sd
Display the splash screen for the DS4SD toolkit.
rxn
Display the splash screen for the RXN toolkit.
st4sd
Display the splash screen for the ST4SD toolkit.
list toolkits
List all installed toolkits. To see all available toolkits, run list all toolkits
.
list all toolkits
List all available toolkits.
add toolkit <toolkit_name>
Install a toolkit.
remove toolkit <toolkit_name>
Remove a toolkit from the registry.
Note: This doesn’t delete the toolkit code. If the toolkit is added again, a backup of the previous install is created in the toolkit directory at ~/.openad/toolkits
.
update toolkit <toolkit_name>
Update a toolkit with the latest version. It is recommended to do this on a regular basis.
update all toolkits
Update all installed toolkits with the latest version. Happens automatically whenever OpenAD is updated to a new version.
set context <toolkit_name> [ reset ]
Set your context to the chosen toolkit. By setting the context, the selected toolkit functions become available to you. The optional parameter reset
can be used to reset your login information.
get context
Display the currently selected toolkit.
unset context
Exit your toolkit context. You will no longer have access to toolkit-specific functions.
Runs
create run
Start recording a run.
remove run <run_name>
remove a run.
save run as <run_name>
Stop recording a run and save it.
run <run_name>
Execute a previously recorded run. This will execute every command and continue regardless of any failures.
list runs
List all runs saved in the current workspace.
display run <run_name>
Display the commands stored in a certain run.
GUI
install gui
Install the OpenAD GUI (graphical user interface).
The graphical user interface allows you to browse your workspace and visualize your datasets and molecules.
launch gui
Launch the OpenAD GUI (graphical user interface).
restart gui
Terminate and then restart the GUI server.
quit gui
Terminate the GUI server.
LLM
tell me <how to do xyz>
Ask your AI assistant how to do anything in OpenAD.
set llm <language_model_name>
Set the target language model name for the tell me
command.
clear llm auth
Clear the language model’s authentication file.
File System
list files [ path ]
List al directories and files in your current workspace.
import from '<external_source_file>' to '<workspace_file>'
Import a file from outside OpenAD into your current workspace.
export from '<workspace_file>' to '<external_file>'
Export a file from your current workspace to anywhere on your hard drive.
copy file '<workspace_file>' to '<other_workspace_name>'
Export a file from your current workspace to another workspace.
remove '<filename>'
Remove a file from your current workspace.
open '<filename>'
Open a file or dataframe in an iframe
Examples:
open 'base_molecules.sdf'
open my_dataframe
Help
intro
Display an introduction to the OpenAD CLI.
docs
Open the documentation webpage.
?
List all available commands.
? ...<soft> --> List all commands containing "..."</soft>
... ?<soft> --> List all commands starting with "..."</soft>
Model
model auth list
show authentication group mapping
model auth add group '<auth_group>'|<auth_group> with '<api_key>'
add an authentication group for model services to use
model auth remove group '<auth_group>' | <auth_group>
remove an authentication group
model auth add service '<service_name>'|,service_name> to group '<auth_group>'|<auth_group>
Attach an authentication group to a model service
model auth remove service '<service_name>'|<service_name>
Detatch an authentication group from a model service
model service status
Get the status of currently cataloged services
model service describe '<service_name>'|<service_name>
get the configuration of a service
model catalog list
get the list of currently cataloged services
uncatalog model service '<service_name>'|<service_name>
uncatalog a model service
Example:
uncatalog model service 'gen'
catalog model service from (remote) '<path> or <github> or <service_url>' as '<service_name>'|<service_name> USING (<parameter>=<value> <parameter>=<value>)
catalog a model service from a path or github or remotely from an existing OpenAD service.
(USING) optional headers parameters for communication with service backend.
If you are cataloging a service using a model defined in a directory, provide the absolute `
The following options require the remote
option be declared.
If you are cataloging a service using a model defined in github repository, provide the absolute `
If you are cataloging a remote service on a ip address and port provide the remote services ipaddress and port in quoted string e.g. '0.0.0.0:8080'
service_name
: this is the name of the service as you will define it for your usage. e.g prop
short for properties.
USING Parameters:
If using a hosted service the following parameters must be supplied:
-Inference-Service
: this is the name of the inference service that is hosted, it is a required parameter if cataloging a remote service.
An authorization parameter is always required if cataloging a hosted service, either Auhtorisation group (auth_group
) or Authorisation bearer_token/api_key (Authorization
):
-auth_group
: this is the name of an authorization group which contains the api_key linked to the service access. This can only be used if Authorization
is not also defined.
OR
-Authorization
: this parameter is designed to be used when a auth_group
is not defined.
Example:
Skypilot Deployment
-catalog model service from 'git@github.com:acceleratedscience/generation_inference_service.git' as 'gen'
Service using a authentication group
-catalog model service from remote '<service_url>' as molf USING (Inference-Service=molformer )
` model auth add service ‘molf’ to group ‘default’`
Single Authorisation Service
-openad catalog model service from remote '<service_URL>' as 'gen' USING (Inference-Service=generation Authorization='<api_key>')
Catalog a remote service shared with you:
-catalog model service from remote 'http://54.235.3.243:30001' as gen
model service up '<service_name>'|<service_name> [no_gpu]}
launches a cataloged model service when it was cataloged as a self managed service from a directory or github repository.
If you do not want to launch a service with GPU you should specify no_gpu
at the end of the command.
Examples:
-model service up gen
-model service up 'gen'
-model service up gen no_gpu
model service local up '<service_name>'|<service_name>
Launches a model service locally.
Example: <br>
` model service local up gen` <br><br>
model service down '<service_name>'|<service_name>
Bring down a model service
Examples:
model service down gen
model service down 'gen'
DS4SD
See commands
Search Molecules
search for similar molecules to '<smiles>' [ save as '<filename.csv>' ]
Search for molecules that are similar to the provided molecule or molecule substructure as provided in the <smiles_string>
.
Use the save as
clause to save the results as a csv file in your current workspace.
Example:
search for similar molecules to 'C1(C(=C)C([O-])C1C)=O'
search for molecules in patents from list ['<patent1>', '<patent2>', ...] | dataframe <dataframe_name> | file '<filename.csv>' [ save as '<filename.csv>' ]
Search for molecules mentioned in a defined list of patents. When sourcing patents from a CSV or DataFrame, there must be column named “PATENT ID” or “patent id”.
Use the save as
clause to save the results as a csv file in your current workspace.
Example:
search for molecules in patents from list ['CN108473493B','US20190023713A1']
search for patents containing molecule '<smiles>' | '<inchi>' | '<inchikey>' [ save as '<filename.csv>' ]
Search for mentions of a specified molecules in registered patents. The queried molecule can be described as a SMILES string, InChI or InChiKey.
Use the save as
clause to save the results as a csv file in your current workspace.
Example:
search for patents containing molecule 'CC(C)(c1ccccn1)C(CC(=O)O)Nc1nc(-c2c[nH]c3ncc(Cl)cc23)c(C#N)cc1F'
search for substructure instances of '<smiles>' [ save as '<filename.csv>' ]
Search for molecules by substructure, as defined by the <smiles_string>
.
Use the save as
clause to save the results as a csv file in your current workspace.
Example:
search for substructure instances of 'C1(C(=C)C([O-])C1C)=O' save as 'my_mol'
Search Collections
search collection '<collection_name_or_key>' for '<search_string>' [ using (page_size=<int> system_id=<system_id> edit_distance=<integer> display_first=<integer>) ] show (data | docs) [ estimate only | return as data | save as '<filename.csv>' ]
Performs a document search of the Deep Search repository based on a given collection. The required using
clause specifies the collection to search. Use estimate only
to return only the potential number of hits.
Parameters:
<collection_name_or_key>
The name or index key for a collection. Use the commanddisplay all collections
to list available collections.<search_string>
The search string for the search.
The <search_string>
supports elastic search string query syntax:
+
Signifies AND operation.|
Signifies OR operation.-
Negates a single token.\"
Wraps a number of tokens to signify a phrase for searching.*
At the end of a term -> signifies a prefix query(
&)
Signifies precedence~N
After a word -> signifies edit distance (fuzziness)~N
After a phrase -> signifies slop amount
Options for the using
clause:
Note: The
using
clause requires all enclosed parameters to be defined in the same order as listed below.
page_size=<integer>
Result pagination, the default is None.system_id=<system_id>
System cluster id, the default is ‘default’.edit_distance=<integer>
(0-5) Sets the search word span criteria for key words for document searches, the default is 5. When set to 0, no snippets will be be returned.display_first=<integer>
When set, the displayed result set will be truncated at the given number.
Clauses:
show (data | docs)
:
data
Display structured data from within the documents.docs
Display document context and preview snippet.
Both can be combined in a single command, e.g.show (data docs)
estimate only
Determine the potential number of hits.return as data
For Notebook or API mode. Removes all styling from the Pandas DataFrame, ready for further processing.
Examples:
-
Look for documents that contain discussions on power conversion efficiency:
search collection 'arxiv-abstract' for 'ide(\"power conversion efficiency\" OR PCE) AND organ*' using ( edit_distance=20 system_id=default) show (docs)
-
Search the PubChem archive for ‘Ibuprofen’ and display related molecules’ data:
search collection 'pubchem' for 'Ibuprofen' show (data)
-
Search for patents which mention a specific smiles molecule:
search collection 'patent-uspto' for '\"smiles#ccc(coc(=o)cs)(c(=o)c(=o)cs)c(=o)c(=o)cs\"' show (data)
display collection matches for '<search_string>' [ save as '<filename.csv>' ]
Search all collections for documents that contain a given Deep Search <search_string>
. This is useful when narrowing down document collection(s) for subsequent search. You can use the <index_key>
from the returned table in your next search.
Use the save as
clause to save the results as a csv file in your current workspace.
Example:
display collection matches for 'Ibuprofen'
Collections
display collections in domains from list <list_of_domains> [ save as '<filename.csv>' ]
Display collections that belong to the listed domains.
Use the save as
clause to save the results as a csv file in your current workspace.
Use the command display all collections
to find available domains.
Example:
display collections in domains from list ['Scientific Literature']
display all collections [ save as '<filename.csv>' ]
Display all available collections in Deep Search.
Use the save as
clause to save the results as a csv file in your current workspace.
display collections for domain '<domain_name>'
Display the available collections in a given Deep Search domain.
Use the command display all collections
to find available domains.
Example:
display collections for domain 'Business Insights'
display collection details '<collection_name_or_key>'
Display the details for a specified collection. You can specify a collection by its name or key.
Use the command display all collections
to list available collections.
Example:
display collection details 'Patents from USPTO'
RXN
See commands
General
interpret recipe '<recipe_paragraph>' | '<txt_filename>'
Build a ordered list of actions interpreted from a provided text-based recipe. The recipe can be provided as a string or as a text file from your current workspace.
Examples:
interpret recipe 'my_recipe.txt'
interpret recipe 'A solution of ((1S,2S)-1-{[(methoxymethyl-biphenyl-4-yl)-(2-pyridin-2-yl-cyclopropanecarbonyl)-amino]-methyl}-2-methyl-butyl)-carbamic acid tert-butyl ester (25 mg, 0.045 mmol) and dichloromethane (4 mL) was treated with a solution of HCl in dioxane (4 N, 0.5 mL) and the resulting reaction mixture was maintained at room temperature for 12 h. The reaction was then concentrated to dryness to afford (1R,2R)-2-pyridin-2-yl-cyclopropanecarboxylic acid ((2S,3S)-2-amino-3-methylpentyl)-(methoxymethyl-biphenyl-4-yl)-amide (18 mg, 95% yield) as a white solid.'
list rxn models
Lists all RXN AI models currently available.
Retrosynthesis
predict retrosynthesis '<smiles>' [ using (option1=<value> option2=<value>) ]
Perform a retrosynthesis route prediction on a molecule.
Note: The
using
clause requires all enclosed parameters to be defined in the same order as listed below.
Optional Parameters that can be specified in the using
clause:
availability_pricing_threshold=<int>
Maximum price in USD per g/ml of compounds. Default: no threshold.available_smiles='<smiles>.<smiles>.<smiles>'
List of molecules available as precursors, delimited with a period.exclude_smiles='<smiles>.<smiles>.<smiles>'
List of molecules to exlude from the set of precursors, delimited with a period.exclude_substructures='<smiles>.<smiles>.<smiles>'
List of substructures to excludefrom the set of precursors, delimited with a period.exclude_target_molecule=<boolean>
Excluded target molecule. The default is Truefap=<float>
Every retrosynthetic step is evaluated with the FAP, and is only retained when forward confidence is greater than the FAP value. The default is 0.6.max_steps=<int>
The maximum number steps in the results. The default is 3.nbeams=<int>
The maximum number of beams exploring the hypertree. The default is 10.pruning_steps=<int>
The number of steps to prune a hypertree. The default is 2.ai_model='<model_name>'
What model to use. Use the commandlist rxn models
to list all available models. The default is ‘2020-07-01’.
Example:
predict retrosynthesis 'BrCCc1cccc2c(Br)c3ccccc3cc12' using (max_steps=3)
Prediction
predict reaction in batch from dataframe <dataframe_name> | file '<filename.csv>' | list ['<smiles>.<smiles>','<smiles>.<smiles>'] [ using (ai_model='<ai_model>') ] [ use_saved ]
Run a batch of reaction predictions. The provided list of reactions can be specified as a DataFrame, a CSV file from your current workspace or a list of strings. When proving a DataFrame or CSV file, we will look for the “reactions” column.
Reactions are defined by combining two SMILES strings delimited by a period. For example: 'BrBr.c1ccc2cc3ccccc3cc2c1'
Optional Parameters that can be specified in the using
clause:
ai_model='<model_name>'
What model to use. Use the commandlist rxn models
to list all available models. The default is ‘2020-07-01’.
You can reuse previously generated results by appending the optional use_saved
clause. This will reuse the results of a previously run command with the same parameters, if available.
Examples:
predict reaction in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1']
predict reaction in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1'] use_saved
predict reaction '<smiles>.<smiles>' [ using (ai_model='<ai_model>') ] [ use_saved ]
Predict the reaction between two molecules.
Reactions are defined by combining two SMILES strings delimited by a period. For example: 'BrBr.c1ccc2cc3ccccc3cc2c1'
Optional Parameters that can be specified in the using
clause:
ai_model='<model_name>'
What model to use. Use the commandlist rxn models
to list all available models. The default is ‘2020-07-01’.
You can reuse previously generated results by appending the optional use_saved
clause. This will reuse the results of a previously run command with the same parameters, if available.
Examples:
predict reaction 'BrBr.c1ccc2cc3ccccc3cc2c1CCO'
predict reaction 'BrBr.c1ccc2cc3ccccc3cc2c1CCO' use_saved
predict reaction topn in batch from dataframe <dataframe_name> | file '<filename.csv>' | list ['<smiles>.<smiles>','<smiles>.<smiles>'] [ using (topn=<integer> ai_model='<ai_model>') ] [ use_saved ]
Run a batch of reaction predictions for topn. The provided list of reactions can be specified as a DataFrame, a CSV file from your current workspace or a list of strings. When proving a DataFrame or CSV file, we will look for the “reactions” column.
Reactions are defined by combining two SMILES strings delimited by a period. For example: 'BrBr.c1ccc2cc3ccccc3cc2c1'
Optional Parameters that can be specified in the using
clause:
ai_model='<model_name>'
What model to use. Use the commandlist rxn models
to list all available models. The default is ‘2020-07-01’.topn=<integer>
Defined the number of results being returned. The default value is 3.
You can reuse previously generated results by appending the optional use_saved
clause. This will reuse the results of a previously run command with the same parameters, if available.
Examples:
predict reaction topn in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1']
predict reaction topn in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1'] using (topn=6)
predict reaction topn in batch from list ['BrBr.c1ccc2cc3ccccc3cc2c1CCO' , 'BrBr.c1ccc2cc3ccccc3cc2c1'] use_saved