PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_024987011.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae; Carduinae; Cynara; Cynara cardunculus; Cynara cardunculus subsp. cardunculus
Family C2H2
Protein Properties Length: 508aa    MW: 56310.8 Da    PI: 9.2016
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_024987011.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H2140.0001491111323
                     ETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2   3 CpdCgksFsrksnLkrHirtH 23 
                     C++C+k F r  nL+ H r H
  XP_024987011.1  91 CEICNKGFQRDQNLQLHRRGH 111
                     *******************88 PP

2zf-C2H212.40.00047166188123
                     EEETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                     +kC++C+k++  +s+ k H + +
  XP_024987011.1 166 WKCEKCSKRYAVQSDWKAHSKIC 188
                     58*****************9876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 508 aa     Download sequence    
MKGMFLDDSM SNLTSASNEA SLSSSSNKNE VGTMYPPPQQ MQQSFASVPI NSNNQTQTNK  60
KKRNLPGNPD PEAEVVAMSP KSLMATNRFL CEICNKGFQR DQNLQLHRRG HNLPWKLKQR  120
SKQEVVRKKV YVCPEPSCVH HEPSRALGDL TGIKKHFSRK HGEKKWKCEK CSKRYAVQSD  180
WKAHSKICGT REHRCDCGTL FSRRDSFITH RAFCGALAGE NARSSSSSLL SPNHLPLHLP  240
INYPLKSEPH LFQRPQVLNS NIITNTLQNH QLPSWFVHHH QENPSPNNNN NNNNLHLPSP  300
SSPPHMSATA LLQKAAQMGV TMSKPAASTN TAIVMSQQQQ QQHQSIPNGP HQNDHMCAST  360
LLTAHHHDSG LENLSCSDDD HHAMLGLKGS TALQEAFNGM LNSTKGIHNT GFQEAFFRQA  420
SEASLNPTKG GGGGGRGGGG GGNDELTRDF LGLTGFPPPT ADHHHHLLNM AGLDPLNTSP  480
YHHHQQQQQQ QQQQHNLNQN QNQLPWNG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1116129KLKQRSKQEVVRKK
2118131KLKQRSKQEVVRKK
3432442GGGGRGGGGGG
4433443GGGGRGGGGGG
5434444GGGGRGGGGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G55110.11e-104C2H2 family protein