PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021295287.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Herrania
Family C2H2
Protein Properties Length: 1584aa    MW: 175762 Da    PI: 6.6345
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021295287.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.90.0003314691494123
                      EEET..TTTEEESSHHHHHHHHHH.T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                      ++C    C++sF +k++L+ H r+ +
  XP_021295287.1 1469 HRCDleGCNMSFETKEELRLHKRNrC 1494
                      789999****************9877 PP

2zf-C2H210.90.001514931516223
                      EET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                      +Cp   Cgk+F+++ +   H+r+H
  XP_021295287.1 1493 RCPyeGCGKRFRSHKYAILHQRVH 1516
                      69999*****************99 PP

3zf-C2H211.40.0009815521578123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      ykC+   Cg sF+  s++ rH r+  H
  XP_021295287.1 1552 YKCKvvGCGLSFRFVSDFSRHRRKtgH 1578
                      9**********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1584 aa     Download sequence    
MGNVEIPNWL KGLPLAPEFR PTDTEFADPI AYISKIEKEA GAYGICKIIP PLPKPSKKYV  60
FNNLNRSLSK SPELGSDMDV SKNVGSISSC RDSGGEEGEG EGRAVFTTRH QELGQSGKKM  120
KAAVSSLQCG VHKQVWQSGE IYTLEQFESK SKTFAKSLLS VLKEVSPLHI EALFWKVASE  180
KPIYVEYAND VPGSGFGEPE GQFRYFHRRR RRRRKRMSYR RENADCKKDE MNTVHNSRID  240
EIKDTCVKSD QNTCFETPKI SPTSSTMASD DNSHSKRKSG DASNDMEGTA GWKLSNSPWN  300
LQVIARSAGS LTRFMPDDIP GVTSPMVYIG MLFSWFAWHV EDHELHSMNF LHTGSSKTWY  360
AVPGDYAYAF EEVIRTEAYG GNIDRLAALS LLGEKTTLLS PELIVASGIP CCRLIQNPGE  420
FVVTFPRAYH VGFSHGFNCG EAANFGTPQW LQVAKEAAVR RAAMNYLPML SHQQLLYLLT  480
MSFVSRVPRS LLPGARSSRL RDRQKEEREL LVKKAFIEDM LTENKLLSLL LKRGSTYRAI  540
IWDPDLLPYA SKDSELPSET AAVSTIPQEN VADIHSKNNT NQNNLLDEMS LYMENLNYLY  600
LNEDDLSCDF QVDSGTLACV ACGILGYPFM SVVQPSEGTL ELLPADHLSV QGPAVLESKN  660
THSCPDLDHP VEGSVSDNVH LVADQSQPPK DATSPSITKF CQGWDTSNIH LRPRIFCLEH  720
AVQVEEILQS KGGAKMLVIC HSDYQKIKAH AIPVAEDIGI TFNYNDVALD AASQEDLNLI  780
NLAIDDEHDE IGEDWTSKLG VNLRYCVKVR KNSPFKQVQH ALPLGGLFSD KYGSPELFNI  840
KWQSRKSRSK GKLNHPSSKP CESIELKVDE LLVEKLDGNI PKTELKIIQY SRRKKRKPDY  900
STGAGGCLEL VKDDLPREDS AATCELPDEH GGSKSKINAE SDSSVLFSSP STRASQTQPE  960
IQTSSVVGVV QKDHGKILQE SNLNGEGCSL AACASSQKHC EIKLMERTTE NNELSLADKC  1020
SKSSVIAACE RYKESTGAIC EVCNPVYEGQ CEELAARHDL INLANSANSL SAQPSAGRFD  1080
PVLEDIIVEK SCMNGGVHSC TTSDNEVQQE IEATSRNNNE DILCDDKLIN KPNLGPEDFS  1140
SGVSLGDEVQ QETNIRGGSQ VEPFFSSPTL TKGPSTVVVG NRSDVPREPC TAADLCDVAI  1200
SKDKAKKQEI QINANKERLL CGSITPVVID QRTSLSVEEY SVISKNPCAE ELHTGVTSDV  1260
EVLQEIQATK GTSGDEVIYC YHLPIKEKQP TPTVMEGCSE VQRECSSEKK SCADATAADD  1320
RHENDMIRNE KVGEESVSCC VTPINQGTPV PIQKYSRTRR ESRATVNVNN GGEVCSFVEN  1380
RDLESVAVNC RSSATDGRKR KREVVETPEK VGGSGFIRSP CEGLRPRARK DASSSFDAGK  1440
TSQEVLPTKE TRKPSVHTQS KKIIKKGSHR CDLEGCNMSF ETKEELRLHK RNRCPYEGCG  1500
KRFRSHKYAI LHQRVHEDDR PLKCPWKGCS MTFKWAWART EHIRVHTGER PYKCKVVGCG  1560
LSFRFVSDFS RHRRKTGHYV DSSA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1204213RYFHRRRRRR
2204214RYFHRRRRRRR
3208213RRRRRR
4208214RRRRRRR
5208216RRRRRRRKR
6209214RRRRRR
7209216RRRRRRKR
8209215RRRRRRR
9209217RRRRRRRKR
10210216RRRRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein