How would one parse the SMILES string
BrC[2]:C[3]:C(:CH:CH:CH:@2):CH:CH:CH:CH:@3
I rely on tools like rdkit and OpenBabel to parse SMILES, but both tools aren't able to parse this string.
More specifically, this SMILES string comes from the supplementary information of a 2007 paper on solubility prediction. After doing some searching, it seems that this paper was received prior to the 2007 standardization of OpenSMILES (this might be why canonical parsers like rdkit have trouble with it).
After searching a bit more, I found an older PubChem sketcher that was able to convert the SMILES in question into a more recognizable format
(BrC1=C(C(=C(C2=C1C(=C(C(=C2[H])[H])[H])[H])[H])[H])[H])
(parsing this was successful via rdkit, see molecule image below) but still need a way to convert similarly-formatted SMILES strings into the OpenSMILES standard/canonical format en mass.
![image for BrC1=C(C(=C(C2=C1C(=C(C(=C2[H])[H])[H])[H])[H])[H])[H]](../../images/3800501320.webp)