SEQ2FUN

BioLiP

PDB CCD ID: L6M
Number of entries in BioLiP: 4
Chemical formula: C30 H37 N5 O10
InChI: InChI=1S/C30H37N5O10/c31-23-10-11-35(30(44)34-23)26-14-21(36)22(45-26)15-32-24(37)9-8-20(29(42)43)33-25(38)13-18-2-1-3-19(18)28(41)17-6-4-16(5-7-17)12-27(39)40/h4-7,10-11,18-22,26,36H,1-3,8-9,12-15H2,(H,32,37)(H,33,38)(H,39,40)(H,42,43)(H2,31,34,44)/t18-,19+,20-,21-,22+,26+/m0/s1
InChIKey: JVRVDICAIPFWFZ-BONAQRLYSA-N
SMILES:
SoftwareSMILES
OpenEye OEToolkits 2.0.7c1cc(ccc1CC(=O)O)C(=O)C2CCCC2CC(=O)NC(CCC(=O)NCC3C(CC(O3)N4C=CC(=NC4=O)N)O)C(=O)O
CACTVS 3.385NC1=NC(=O)N(C=C1)[C@H]2C[C@H](O)[C@@H](CNC(=O)CC[C@H](NC(=O)C[C@@H]3CCC[C@H]3C(=O)c4ccc(CC(O)=O)cc4)C(O)=O)O2
ACDLabs 12.01C(CC(C(=O)O)NC(CC2C(C(c1ccc(CC(=O)O)cc1)=O)CCC2)=O)C(NCC4C(CC(N3C=CC(=NC3=O)N)O4)O)=O
OpenEye OEToolkits 2.0.7c1cc(ccc1CC(=O)O)C(=O)C2CCCC2CC(=O)N[C@@H](CCC(=O)NC[C@@H]3[C@H](C[C@@H](O3)N4C=CC(=NC4=O)N)O)C(=O)O
CACTVS 3.385NC1=NC(=O)N(C=C1)[CH]2C[CH](O)[CH](CNC(=O)CC[CH](NC(=O)C[CH]3CCC[CH]3C(=O)c4ccc(CC(O)=O)cc4)C(O)=O)O2
Name:5'-{[N-({(1S,2R)-2-[4-(carboxymethyl)benzene-1-carbonyl]cyclopentyl}acetyl)-L-gamma-glutamyl]amino}-2',5'-dideoxycytidine

Reference:
  • Chengxin Zhang, Xi Zhang, Lydia Freddolino, and Yang Zhang. BioLiP2: an updated structure database for biologically relevent ligand-protein interactions, Nucleic Acids Research, gkad630 (2023).
  • Jianyi Yang, Ambrish Roy, and Yang Zhang. BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions, Nucleic Acids Research, 41: D1096-D1103 (2013) (download the PDF file).