PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021301160.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Herrania
Family C3H
Protein Properties Length: 993aa    MW: 110228 Da    PI: 8.7415
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021301160.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH26.88.9e-09210231325
                     S---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   3 telCrffartGtCkyGdrCkFaH 25 
                     t +C+ f++ G C++G++C+F H
  XP_021301160.1 210 TQICKEFMA-GRCRRGSQCQFLH 231
                     789******.************* PP

2zf-CCCH22.81.6e-07273292626
                     -SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaHg 26 
                     C+++++ G C++G++C+FaH+
  XP_021301160.1 273 CNDYLK-GNCRRGASCRFAHD 292
                     ******.*************5 PP

3zf-CCCH17.85.8e-06334354425
                     ---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   4 elCrffartGtCkyGdrCkFaH 25 
                     ++C++fa+ G C+ G  C+F+H
  XP_021301160.1 334 VPCKYFAA-GNCRNGKYCRFSH 354
                     59**9999.************* PP

Sequence ? help Back to Top
Protein Sequence    Length: 993 aa     Download sequence    
MQFGWLEQMF TNLTALIRIR ATTFGEMSGS RKRGSKWDSK EERQYSLENV RDAAWPAKAG  60
VSFHDRESEH GYFSPEVGRN GNKWSFVEAS DMMKSKHGLP SRESLTGGRG ARKDENINVD  120
CVKNWKTTTP WDGDETYSMR MSPGLDDWRQ QNRRHSPKSD WTRSQSFTHK SRSRSWSRSR  180
SRSRSRSPVR GIRRQSGFHE RTRSRSGVST QICKEFMAGR CRRGSQCQFL HQDIQSHEDG  240
WDNRQKKAGG SKYCTSNDGK ECLMKSGRSS DCCNDYLKGN CRRGASCRFA HDGASDGFSR  300
GSINEVSRER ESNKRNRVAT PERDGEREAR RSDVPCKYFA AGNCRNGKYC RFSHHGQARA  360
SPERSRGDRG GWGQSSVSVD KLRDGAKFRD ADASYNVEKS RNGLKWSDAD ASNEAEKCWA  420
GPKWSDVDAS NDVDKSWTGS KWGDTGTYAG AANMSKDING KVGASESRFP DWSMDERWQR  480
NYDVSGKSSE TKVHHETVDI DKDETIPRKI ENAGLSTGVS EPRGAEESLG DMEMSPEWNY  540
RIPSSVKKEP SHSSMSQTPI DSSLTAHEKD IVEEASGRVC DGLAASQPIS IQKSNFQHDQ  600
VMRGNSAVAL PCDSNAASRN SAISHIDLNF SSSILQMKSF DQPGPSSSSL PYSNLNVVGQ  660
SQVAIPSDSN EVNVKVMQNN LLFQEEKPSN KMNFGDTNTS NGNSGTQSTQ NMVSNEQLTH  720
LTNLSASLAQ LFGKGQQLPL LHVALNAHDA MQVNSFASSG GPIEPDSIPT VQPGQDVTFP  780
KQYDPISDSI EPVKKQDTNT KPLGFSAHPV AQKNTADGKP ELSANMLLPS SLVGSTNGGD  840
YHNDLSSKRK PDSDSHMPNQ VERVASSEVT KENEGVEETK KAQEENKNGP SENIDADDRT  900
DEGKKSKDGK GIRAFKFALV EFVKDLLKPT WKEGQIGKDA YKNIVKKVVD KVTATMQGAN  960
IPQTPEKIDQ YLSFSKPKLS KLVQAYVEKF QKN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1178186RSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.16e-19C3H family protein