PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021636637.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family C3H
Protein Properties Length: 2146aa    MW: 234828 Da    PI: 8.5112
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021636637.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.38.7e-0620212042627
                      -SGGGGTS--TTTTT-SS-SSS CS
         zf-CCCH    6 CrffartGtCkyGdrCkFaHgp 27  
                      C+++ +tG C+ G++Ck +H++
  XP_021636637.1 2021 CPTYEATGSCPQGSKCKLHHPK 2042
                      *****************99985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2146 aa     Download sequence    
MDPRSPYLHH TRYVPRHPQP PPPPPQSHPH PDHPFPNNPN IYASHHSTLI AAPPPPPLRP  60
QPRPPPTPSY HSLPTPPQFN PHDSQFSYNP NTFNSNHPRS HADVHNFTQS PPLMHHKPFD  120
DDLPRRLPNY IRDSRPDLRD PPRVLPDRWP SRPYPPANVD SDSYRRPLDN QPMSPMKIRR  180
ELEGNSRFIE EHKQREELRL GRGDDNYHRR SQFGSTSDRS SRDFRMVSNQ MNQSSPCENL  240
RGLPYDNRMN ENQRWVHGRE VNGDAHYSFI ERGSNEIGDP SEIRVATGKR EHYRCREVNA  300
QLERHSSKGS REDSYEFSRT SRKPLQKKSV LLRIQKPNYR NREDERVHYL GYLDDNKSSS  360
FRGKDQNLYQ NHEMGEQVRE GSPVELDVSF KSNSLVAKAI ATPSSTGVSD LHLTPKNEKV  420
RKVVGLNKDS SSSSAIKPNE GIVKLENAVL VANNASSSDM DLLQSKVEVT ASVTGNVQVS  480
GSLPGSSGTK TSPGNSKVES STKVSVSNKG GTNVISGKTS SLKVAKKKKI VKRVVKKVIN  540
PLSSSSSQPT TKCDGSVTAY GVAHGLPASS EPEKSSALAS VDIVDSQPHL NETNVVPETE  600
NDRVEGFAKV LESDNDTITD SSGLRFPNIK RKRSHSTSPL GSSSHEESKI NGNLANGNSA  660
NYLHGMSSTD KDFSKLLNEN TSSDMDSVEP ASKQLCLDGG SFLLENNTAS LSPKVLGMET  720
NSAEGNTDFG FLSSEEIKIQ EGPASSYNIT LGCDSDSGLI SDGITVSNIG TTDVSCKEPC  780
TNQGKPLAEN GVVDQCLNAN FSVGSGKIFC ESNSGERTIQ NVDTCASCSN EVRTIYSSNS  840
GHIISGEIDF SSNGTIDDVC GRPSSDKVST LENVPTGGSL NCTISTDGSK EDTPNIKKSN  900
KNVEMPQLHV SKSEVNNSYL KPVNMVNSAT WVDTTLRLSF KDPTPTEFTV SGDVCGNVGL  960
RGCTDGISDF CLRSSPDALE ANASGNSTVN VGPSGTSKQN QKKRKFSGSQ LESTCLIASG  1020
ESEGPLTAGI SVSAVEVPCN SGNGLMQPEP EMTVSAMNPL FTSDFPPLKK QITEPLDNCS  1080
VGGYHGTEDS LIDGFEDSGL RGVHSCSTVW ELAVQKVQSP CPSGSEGKQI AEATLVMAGS  1140
SHQNNSILIE SGEAEKMEVD TGEEQDIADS GTAQCQFPSE LQVPDSDERL PGTDVENDSC  1200
QHIKNDLPSM SSYLSSLGDG KEVSATNSSG AVMGLVSDTL PDMLSTSHIQ LSIEKGGGDD  1260
EILLGKRAIK GGSNISVVTS GSPNTEINFN SDHGVENDHS FSGKTGLLPS QDSINSTQMG  1320
NTMSGEVYGR KNQPNQAVSR IYPGRSSVVF ASSKNTASST HISKPRTWHR ADNSSTFGQP  1380
GNKAFSSTVP TQRKLHKQIT KFENTSYIRK GNSLVRKPTT MAAQSQSSHG LSSSVYRLNS  1440
SGTDEVKKNA GSDIRTGVVD PSNFVRTGAN AAFERPRTPP LASATKLPNH ASSFLGNFTS  1500
SPLAEPLHNC ATETASDHMT STASNDVLNS SENAIIISEN PMTQTGQINN LDCHNELNDG  1560
NALSSNANSV TYVKRKSNQL VATSNPSSLS VYNAHNAPAL PSDGYYKRRK NQLVRTSLES  1620
HVQPAFIMPE ESVNPEGQAP HNITSSRSSS KRRSRKAVTK THKPLKFALV WTQRSAQLLN  1680
DDDDSLHRHK FLPHLFPWKR ATYWRSFITN SAANPSNNSS SAIRKLLLSR KRDTVYTRSK  1740
HGFSLRKSKV LSVGGSSLKW SKSIERRSKK ASEEATLAVA EAERKKREQS GASCVVSGTM  1800
NRNSSSRKSV PSINLHSGER IFQIGSFRYK MDSSRRTLQR ISDDDSSYSA AFQTEKDFKR  1860
SYVPRRLVIG KDEYVRIGNG NQLVRDPKKR TRILASEKVR WSLHTARSRL ARKRKYCQFF  1920
TRFGKCNKDD GKCPYIHDSS KIAVCTKFLN GLCFNSDCKL THKVIPERMP DCSYYLQGLC  1980
TNKNCPYRHV HVNPNAFTCE GFLRGYCADG NECRKKHSYV CPTYEATGSC PQGSKCKLHH  2040
PKNRSKGKKS KQSREKKIDQ GRYFGSTHIN VSEPGTAVSE THSAQDNSKI CFEGSIADYM  2100
ILDVADAVRE NINLADEQTS FSEGDPLDLK LVDPDELIKP IRIMTT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1630635KRKRSH