PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4AG055050.10
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family C2H2
Protein Properties Length: 1199aa    MW: 132136 Da    PI: 5.2767
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4AG055050.10genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.30.0002410891114123
                         EEET..TTTEEESSHHHHHHHHHH.T CS
            zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                         ++C+   C+++F ++ +L+ H r+ +
  TRIDC4AG055050.10 1089 FQCEidFCDMTFESRAELRAHERNiC 1114
                         89********************9877 PP

2zf-C2H210.80.001511421166123
                         EEET..TTTEEESSHHHHHHHHHHT CS
            zf-C2H2    1 ykCp..dCgksFsrksnLkrHirtH 23  
                         +kCp   Cg++F+     + Hir+H
  TRIDC4AG055050.10 1142 FKCPwdGCGMTFKWLWAQTEHIRVH 1166
                         89*********************99 PP

3zf-C2H214.10.0001411721198123
                         EEET..TTTEEESSHHHHHHHHHH..T CS
            zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                         y+C+  dCg++F+  s++ rH r+  H
  TRIDC4AG055050.10 1172 YECSvpDCGQTFRYVSDYSRHRRKfnH 1198
                         89999******************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1199 aa     Download sequence    
MFVHAASLAV LGEKTTLMSP EVIVAAGLPC CRLVQHPGEF VVTFPRAYHV GFSHGFNCGE  60
AANFATPQWL KFAKEAAVRR AVMNYLPMLS HQQLLYLLAV SFISRTPREL LYGIRTSRLR  120
DRRKEERELL VKREFLQDMI SENELLCAFL KKKLIENAVL WEPDLLPSST ALHSCSSGPK  180
APLKVDDVHS IESVPKENSS SDDIASRAGI QPKCMSMDSK SSDAMSAAEA QKLDTDTDDD  240
GDLPFDLSID SGSLTCVACG ILGFPFMAIL QPSKKALEDM SLVDIERFKL NCEKENHSNA  300
IPCSPDDSIS GHPVIAKRPS SPVAQSNFSH QNAESDKDGV GLDGPLLPHN NSAHSCNSEN  360
TLNPGINTET TETKIPSARF GIEFSKQTGR GDIDAQATES CGNTVDWNIT SAFVRPRIFC  420
LQHALEIEEL LEGKGGAHAL IICHADYTKL KALAISIAEE IEFQFDCKDV PLANASKSDL  480
HLINISIDDE GYKEDERDWT TQMGLNMKYF AKLRKETPGC QEQPPLSFWK RLDISDKPSP  540
ISVVPNLKWL CRRARTPYRV VGYAASRNAT VGPDVVSPAV TKAEMGTSGN AYENAKEQRT  600
GEQDAPLEPS RLQEADDVAD MHTCSEDIDQ DMHCLIGSKR QRTAEQNAPL QPSRLQEADD  660
VVDMHMCSVD NDQDMHRLIG IPVAAAEYPM THQVCEGTVS VSTCELDDLV SASTSDDPIC  720
SAHSQDSPGV SDDFTTEQQC VQSDELTSSV AMSAQQFLVD GSMTAEDSSN HENLGSYNVT  780
SECKDKQLQV QQEQENIELC NNAGRNLAAA VQVNSGHFGD KAVNLKSAIP TESQHEYPKR  840
DAIVLEGMQA ALTTVVSGEN RNSVNTELDS LGILLGALAE ESILADVPGK DEVDDASLTL  900
MTLASIDQSA GDVAHNEVIE TSSSSVGASI SCKGRTLSNL ASDGSLRIQN AEIQNKQENA  960
EEVGAWNCQG LKNSRGILDS SANSLSDTGK SSGTPKAYQP DILSRSIGSS KRRSIICYVR  1020
RKRKQKRKRE SELSTSNSQS FGSFARAPCE RLRPRRKPAV IEEPAEQIET AKPSAAATKG  1080
KRSKVVELFQ CEIDFCDMTF ESRAELRAHE RNICTDESCG KRFQSHKYLK RHQCVHRDER  1140
PFKCPWDGCG MTFKWLWAQT EHIRVHTGER PYECSVPDCG QTFRYVSDYS RHRRKFNHY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110111027KRRSIICYVRRKRKQKR
210111029KRRSIICYVRRKRKQKRKR
310191026VRRKRKQK
410211030RKRKQKRKRE
510231032RKRKQKRKRE
610531057RPRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.12e-59C2H2 family protein