PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4BG053560.3
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family C2H2
Protein Properties Length: 1332aa    MW: 147585 Da    PI: 5.3475
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4BG053560.3genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.40.0002212221247123
                        EEET..TTTEEESSHHHHHHHHHH.T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                        ++C+   C+++F ++ +L+ H r+ +
  TRIDC4BG053560.3 1222 FQCEidFCDMTFESRADLRAHERNiC 1247
                        89********************9877 PP

2zf-C2H210.70.001712751299123
                        EEET..TTTEEESSHHHHHHHHHHT CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirtH 23  
                        +kCp   Cg++F+     + Hir+H
  TRIDC4BG053560.3 1275 FKCPweGCGMTFKWLWAQTEHIRVH 1299
                        89*********************99 PP

3zf-C2H211.10.001313051331123
                        EEET..TTTEEESSHHHHHHHHHH..T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                        y+C    Cg++F+  s++ rH r+  H
  TRIDC4BG053560.3 1305 YECLveGCGQTFRYVSDYSRHRRKfnH 1331
                        889999*****************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1332 aa     Download sequence    
KASADRPIYI EYANDVPGSG FAASAQSHRR PHKKRRREGA PVDEGEKATG WKLSCSPWNL  60
QAIARAPGSL TRFMPDDVPG VTSPMVYIGM LFSWFAWHIE DHELHSLNFL HTGAPKTWYA  120
VPGDRAAELE EVIRVHGYGG NPDRLASLAV LGEKTTLMSP EVIVASGLPC CRLVQHPGEF  180
VVTFPRAYHV GFSHGFNCGE AANFATPQWL KFAKEAAVRR AVMNYLPMLS HQQLLYLLAV  240
SFISRTPREL LYGIRTSRLR DRRKEERELL VKREFLQDMI SENELLCSFL KKKLIDNAVL  300
WEPDLLPSST ALHSCSSGPK APLKVDDVHS IESVPKENCS SDDIASRAGI QPKCMSMDSK  360
SSDAMSTSEA QKLDTDTDDD GDLPFDLSID SGSLTCVACG ILGFPFMAIL QPSKKALEDM  420
SLVDIERFKL NCEKENHSNA IPCSPDDGNS GHPVIAKRPS SPVAESNFSH QNAESDKDGV  480
GLDGPLLPHN NSSHSCSSEN TLNPCINTET TETKIPSARF GIEFSKQTGR GDIDAQATES  540
CGNTVDWNIT SAFVRPRIFC LQHALEIEEL LEGRGGVHAL IICHADYTKL KALAISIAEE  600
IEFQFDCKDV PLVNASKSDL HLINISIDDE GYKEDERDWT TQMGLNMKYF AKLRKETPGC  660
QEQPPLSFWK RLDISDKPLP ISVVPNLKWL CRRARTPYRV VGYAANRNAT VGPDVVSPAV  720
TKAEMGTSGN AYENAKEQRT AEQDALLEPS RLQEADDVAD MHTCSEDIDQ DMHCLIGSKR  780
QRTAEQDAPL QPSRLQEADD VVDMHTCSVD NDQDMHRLIG IPVAVAEYPM VHQVCEGTVS  840
VSTCELDDLV SASTSDDSVC SAYSQDSPGV SDDFTTEQKC VQSDELTSSV AMSVQQFLLD  900
ESMTAEDSSN QEKLGSYNVT SECKDKQLQV QQEQENIELC NNAGRNMATV VQVDSSHFPD  960
KAVNLKSAIP TESQHEYPKR DAIVLEGMQA ALTTVVSGEN RNSVHTELDS LGILLGALAE  1020
ESILADVPGK DEVDDASLTL MTLASIDQSA GDVAHNEVIE TSSSSIGASL SCRGRTLTNL  1080
ASDGSLRIQN AEIQNKQENA EEVDAWNCQG WKSSRGVLDS SANSLSETGK SSGTPNTYQP  1140
DILSRSIGSS KRTSIICYVR RKRKQKRKRE SQSVGSFARA PCERLRPRTK RAVIEEPAEQ  1200
IETAKPSAAA TKGKRSKVVE LFQCEIDFCD MTFESRADLR AHERNICTDE SCGKRFQSHK  1260
YLKRHQCVHR DERPFKCPWE GCGMTFKWLW AQTEHIRVHT GERPYECLVE GCGQTFRYVS  1320
DYSRHRRKFN HY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111591166VRRKRKQK
211611170RKRKQKRKRE
311631172RKRKQKRKRE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.11e-126C2H2 family protein