PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_020686971.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Orchidaceae; Epidendroideae; Malaxideae; Dendrobiinae; Dendrobium
Family C2H2
Protein Properties Length: 376aa    MW: 41290.9 Da    PI: 9.4707
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_020686971.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H218.26.7e-065678123
                    EEETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2  1 ykCpdCgksFsrksnLkrHirtH 23
                    y+C +C++ F r  nL+ H r+H
  XP_020686971.1 56 YVCDICNQGFQRDQNLQMHRRRH 78
                    9*********************9 PP

2zf-C2H212.10.0006141162223
                     EETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2   2 kCpdCgksFsrksnLkrHirtH 23 
                      C  C+k +  +s++k H++t+
  XP_020686971.1 141 ACARCSKAYAVHSDYKAHLKTC 162
                     6*******************98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 376 aa     Download sequence    
MFRIATSQLP PEEFAPPSSA ENAVTHKRKR RPAGTPDPDA EVISLSPRTL LESERYVCDI  60
CNQGFQRDQN LQMHRRRHKV PWKLTVKSNL ASSGAAAAAK KRVFVCPEAT CLHHHPCHAL  120
GDLVGIKKHF RRKHSANRQW ACARCSKAYA VHSDYKAHLK TCGTRGHSCD CGRVFSRVET  180
FIEHQDSCTA TKTREPPPPP VAPPSNHSHS TSSPTTSDDD TIPDPPPPPP PLLPFILAPP  240
NHNKEALSLL PFLPHHPSIP VNTADLHLSI APPASPSQQA KTAKAAADKA REEALRDVHL  300
AERELVAARR VRQQAQAELE KACQLRTAAA RQIHSILLQI TCGDCRQRML RNSAPLAGGN  360
SFSGDDESSG HKSLRL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12732KRKRRP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01940.11e-100C2H2 family protein