PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4AG055050.9
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family C2H2
Protein Properties Length: 1181aa    MW: 130608 Da    PI: 5.067
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4AG055050.9genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.40.0002410711096123
                        EEET..TTTEEESSHHHHHHHHHH.T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                        ++C+   C+++F ++ +L+ H r+ +
  TRIDC4AG055050.9 1071 FQCEidFCDMTFESRAELRAHERNiC 1096
                        89********************9877 PP

2zf-C2H210.90.001511241148123
                        EEET..TTTEEESSHHHHHHHHHHT CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirtH 23  
                        +kCp   Cg++F+     + Hir+H
  TRIDC4AG055050.9 1124 FKCPwdGCGMTFKWLWAQTEHIRVH 1148
                        89*********************99 PP

3zf-C2H214.10.0001411541180123
                        EEET..TTTEEESSHHHHHHHHHH..T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                        y+C+  dCg++F+  s++ rH r+  H
  TRIDC4AG055050.9 1154 YECSvpDCGQTFRYVSDYSRHRRKfnH 1180
                        89999******************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1181 aa     Download sequence    
GQFILRDHQI MYSVETEDLH LCILIVLCSS SKWSDQYITG FNCGEAANFA TPQWLKFAKE  60
AAVRRAVMNY LPMLSHQQLL YLLAVSFISR TPRELLYGIR TSRLRDRRKE ERELLVKREF  120
LQDMISENEL LCAFLKKKLI ENAVLWEPDL LPSSTALHSC SSGPKAPLKV DDVHSIESVP  180
KENSSSDDIA SRAGIQPKCM SMDSKSSDAM SAAEAQKLDT DTDDDGDLPF DLSIDSGSLT  240
CVACGILGFP FMAILQPSKK ALEDMSLVDI ERFKLNCEKE NHSNAIPCSP DDSISVIAKR  300
PSSPVAQSNF SHQNAESDKD GVGLDGPLLP HNNSAHSCNS ENTLNPGINT ETTETKIPSA  360
RFGIEFSKQT GRGDIDAQAT ESCGNTVDWN ITSAFVRPRI FCLQHALEIE ELLEGKGGAH  420
ALIICHADYT KLKALAISIA EEIEFQFDCK DVPLANASKS DLHLINISID DEGYKEDERD  480
WTTQMGLNMK YFAKLRKETP GCQEQPPLSF WKRLDISDKP SPISVVPNLK WLCRRARTPY  540
RVVGYAASRN ATVGPDVVSP AVTKAEMGTS GNAYENAKEQ RTGEQDAPLE PSRLQEADDV  600
ADMHTCSEDI DQDMHCLIGS KRQRTAEQNA PLQPSRLQEA DDVVDMHMCS VDNDQDMHRL  660
IGIPVAAAEY PMTHQVCEGT VSVSTCELDD LVSASTSDDP ICSAHSQDSP GVSDDFTTEQ  720
QCVQSDELTS SVAMSAQQFL VDGSMTAEDS SNHENLGSYN VTSECKDKQL QVQQEQENIE  780
LCNNAGRNLA AAVQVNSGHF GDKAVNLKSA IPTESQHEYP KRDAIVLEGM QAALTTVVSG  840
ENRNSVNTEL DSLGILLGAL AEESILADVP GKDEVDDASL TLMTLASIDQ SAGDVAHNEV  900
IETSSSSVGA SISCKGRTLS NLASDGSLRI QNAEIQNKQE NAEEVGAWNC QGLKNSRGIL  960
DSSANSLSDT GKSSGTPKAY QPDILSRSIG SSKRRSIICY VRRKRKQKRK RESELSTSNS  1020
QSFGSFARAP CERLRPRRKP AVIEEPAEQI ETAKPSAAAT KGKRSKVVEL FQCEIDFCDM  1080
TFESRAELRA HERNICTDES CGKRFQSHKY LKRHQCVHRD ERPFKCPWDG CGMTFKWLWA  1140
QTEHIRVHTG ERPYECSVPD CGQTFRYVSD YSRHRRKFNH Y
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
19931009KRRSIICYVRRKRKQKR
29931011KRRSIICYVRRKRKQKRKR
310011008VRRKRKQK
410031012RKRKQKRKRE
510051014RKRKQKRKRE
610351039RPRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.12e-50C2H2 family protein