PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.008G014801.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family C2H2
Protein Properties Length: 866aa    MW: 97478.9 Da    PI: 7.841
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.008G014801.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.10.0012557579123
                           EEETTTTEEESSHHHHHHHHHHT CS
               zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                           ++C+ Cg +F  + +Lk+H++ H
  Sobic.008G014801.1.p 557 HRCQECGSCFQKPAHLKQHMQGH 579
                           79******************987 PP

2zf-C2H222.23.6e-07585609123
                           EEET..TTTEEESSHHHHHHHHHHT CS
               zf-C2H2   1 ykCp..dCgksFsrksnLkrHirtH 23 
                           + Cp  dC+ s++rk++L+rH+ +H
  Sobic.008G014801.1.p 585 FMCPieDCPFSYKRKDHLNRHMLKH 609
                           79*********************99 PP

3zf-C2H215.84.1e-05614639123
                           EEET..TTTEEESSHHHHHHHHHH.T CS
               zf-C2H2   1 ykCp..dCgksFsrksnLkrHirt.H 23 
                           ++C+   C+++Fs k n++rH +  H
  Sobic.008G014801.1.p 614 FSCTvdGCDRRFSMKANMQRHVKEiH 639
                           899999****************9988 PP

4zf-C2H215.35.6e-05651675123
                           EEET..TTTEEESSHHHHHHHHHHT CS
               zf-C2H2   1 ykCp..dCgksFsrksnLkrHirtH 23 
                           + C    C+k+F+ +s Lk+H  +H
  Sobic.008G014801.1.p 651 FICReeGCNKVFRYSSKLKKHEESH 675
                           789999***************9988 PP

5zf-C2H213.20.00027742766223
                           EET..TTTEEESSHHHHHHHHHH.T CS
               zf-C2H2   2 kCp..dCgksFsrksnLkrHirt.H 23 
                           kC+   C  sFs+ksnL++H +  H
  Sobic.008G014801.1.p 742 KCTfeGCEHSFSNKSNLTKHVKAcH 766
                           799999***************9955 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS513755.3053565IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.102.5E-1144163IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513758.6766100IPR002885Pentatricopeptide repeat
PROSITE profilePS513758.802136170IPR002885Pentatricopeptide repeat
PfamPF015357.2E-5141164IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007563.2E-4141164IPR002885Pentatricopeptide repeat
SuperFamilySSF484523.21E-5147174IPR011990Tetratricopeptide-like helical domain
PfamPF015350.15169193IPR002885Pentatricopeptide repeat
PROSITE profilePS5137510.03198232IPR002885Pentatricopeptide repeat
PfamPF015353.2E-4200229IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007565.6E-5200233IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.102.5E-11250335IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513758.681268298IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007560.0012273298IPR002885Pentatricopeptide repeat
SuperFamilySSF484523.21E-5289329IPR011990Tetratricopeptide-like helical domain
PfamPF130416.7E-12298346IPR002885Pentatricopeptide repeat
PROSITE profilePS5137512.617299333IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007561.0E-9301335IPR002885Pentatricopeptide repeat
PROSITE profilePS513759.591334364IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007565.7E-4336363IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.38370400IPR002885Pentatricopeptide repeat
PfamPF015350.017373397IPR002885Pentatricopeptide repeat
SuperFamilySSF484523.21E-5400465IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513755.338402432IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.102.5E-11403466IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513757.311436470IPR002885Pentatricopeptide repeat
SMARTSM0035522511533IPR015880Zinc finger, C2H2-like
Gene3DG3DSA:3.30.160.601.1E-4556578IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SMARTSM003550.1557579IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015712.258557584IPR007087Zinc finger, C2H2
PROSITE patternPS000280559581IPR007087Zinc finger, C2H2
SuperFamilySSF576672.54E-9570611No hitNo description
Gene3DG3DSA:3.30.160.604.9E-10579611IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS501579.182585614IPR007087Zinc finger, C2H2
SMARTSM003551.6E-4585609IPR015880Zinc finger, C2H2-like
PROSITE patternPS000280587609IPR007087Zinc finger, C2H2
SuperFamilySSF576674.63E-8595644No hitNo description
Gene3DG3DSA:3.30.160.602.3E-8612641IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015711.759614644IPR007087Zinc finger, C2H2
SMARTSM003553.0E-4614639IPR015880Zinc finger, C2H2-like
PROSITE patternPS000280616639IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.30.160.602.2E-7649675IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015710.284651675IPR007087Zinc finger, C2H2
SMARTSM003550.0085651675IPR015880Zinc finger, C2H2-like
PROSITE patternPS000280653675IPR007087Zinc finger, C2H2
SMARTSM003555.3683708IPR015880Zinc finger, C2H2-like
PROSITE patternPS000280685708IPR007087Zinc finger, C2H2
SMARTSM0035514711732IPR015880Zinc finger, C2H2-like
SuperFamilySSF576672.49E-8726783No hitNo description
SMARTSM003550.047741766IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015711.489741771IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.30.160.603.7E-6742766IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE patternPS000280743766IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.30.160.602.2E-5767794IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS501579.515772798IPR007087Zinc finger, C2H2
SMARTSM003550.91772798IPR015880Zinc finger, C2H2-like
PROSITE patternPS000280774798IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005730Cellular Componentnucleolus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0008097Molecular Function5S rRNA binding
GO:0046872Molecular Functionmetal ion binding
GO:0080084Molecular Function5S rDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 866 aa     Download sequence    Send to blast
MNRSVCHHLL AQCKKLRELQ RIHALAIAHG LHPHQQSVSC KIFRCYADFG RVADARKLFD  60
EIPNPDLISF TSLMSLHLQL DNQREAISLF ARVVAAGHRP DGFAVVGALS ASSGAGDQVV  120
GRAVHGLIFR LGLDGEVVVG NALIDMYSQC GKFESAVKVF DRMSLKDEVT WGSMLHGYIK  180
CAGVDSALSF FDQVPVRSVV AWTALITGHV QGRQPVRALE LFGRMVLEGH RPTHVTIVGV  240
LSACADIGAL DLGRVIHGYG SKCNASLNII VSNALMDMYA KSGHIEMAFS VFQEVQSKDS  300
FTWTTMISCC TVQGDGKKAL ELFQDMLRAG VVPNSVTFVS VLSACSHSGL IEEGIELFDR  360
MRQLYKIDPL LEHYGCMIDL LGRGGLLEEA EALIADMNVE PDIVIWRSLL SACLVRGNNR  420
LAEIAGKEIV KREPGDDGVY VLLWNMYASS NKWREAREMR QQMLSLKIFK KPGCSWIEID  480
GAVHEFLMCS VVDVDGDASV GQKGYRDIRR YKCEFCTVVR SKKCLIQAHM VAHHKDELDK  540
SEIYNSNGEK IVHEEEHRCQ ECGSCFQKPA HLKQHMQGHS HERLFMCPIE DCPFSYKRKD  600
HLNRHMLKHE GKLFSCTVDG CDRRFSMKAN MQRHVKEIHE DENASKSNQQ FICREEGCNK  660
VFRYSSKLKK HEESHVKLDY VEVLCGEPGC MKMFTNVEYL RAHSQSCHQY VQCEICGEKH  720
LKKNIKRHLQ SHDKAPSGER MKCTFEGCEH SFSNKSNLTK HVKACHDQLK PFKCQIAGCG  780
KAFTYKHVRD NHEKSGAHVY IEGDFEEIDE QLRARPRGGR KRKALTVETL TRKRVTIPGE  840
ASSLDDGEEY LRWLLSGGDS SRETQ*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5iww_D6e-54941315331PLS9-PPR
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1814822RPRGGRKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00229DAPTransfer from AT1G72050Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.008G014801.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankFN4316680.0FN431668.1 Saccharum hybrid cultivar BAC clone Sh253G12, cultivar R570.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002441722.20.0pentatricopeptide repeat-containing protein At2g22410, mitochondrial isoform X1
TrEMBLA0A1Z5R4A50.0A0A1Z5R4A5_SORBI; Uncharacterized protein
STRINGSi020193m0.0(Setaria italica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP23823882
Representative plantOGRP29811627
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72050.21e-102transcription factor IIIA