PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021636638.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family C3H
Protein Properties Length: 2136aa    MW: 233794 Da    PI: 8.4781
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021636638.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.38.6e-0620112032627
                      -SGGGGTS--TTTTT-SS-SSS CS
         zf-CCCH    6 CrffartGtCkyGdrCkFaHgp 27  
                      C+++ +tG C+ G++Ck +H++
  XP_021636638.1 2011 CPTYEATGSCPQGSKCKLHHPK 2032
                      *****************99985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2136 aa     Download sequence    
MDPRSPYLHH TRYVPRHPQP PPPPPQSHPH PDHPFPNNPN IYASHHSTLI AAPPPPPLRP  60
QPRPPPTPSY HSLPTPPQFN PHDSQFSYNP NTFNSNHPRS HADVHNFTQS PPLMHHKPFD  120
DDLPRRLPNY IRDSRPDLRD PPRVLPDRWP SRPYPPANVD SDSYRRPLDN QPMSPMKIRR  180
ELEGNSRFIE EHKQREELRL GRGDDNYHRR SQFGSTSDRS SRDFRMVSNQ MNQSSPCENL  240
RGLPYDNRMN ENQRWVHGRE VNGDAHYSFI ERGSNEIGDP SEIRVATGKR EHYRCREVNA  300
QLERHSSKGS REDSYEFSRT SRKPLQKKSV LLRIQKPNYR NREDERVHYL GYLDDNKSSS  360
FRGKDQNLYQ NHEMGEQVRE GSPVELDVSF KSNSLVAKAI ATPSSTGVSD LHLTPKNEKV  420
RKVVGLNKDS SSSSAIKPNE GIVKLENAVL VANNASSSDM DLLQSKVEVT ASVTGNVQVS  480
GSLPGSSGTK TSPGNSKVES STKVSVSNKG GTNVISGKTS SLKVAKKKKI VKRVVKKVIN  540
PLSSSSSQPT TKCDGSVTAY GVAHGLPASS EPEKSSALAS VDIVDSQPHL NETNVVPETE  600
NDRVEGFAKV LESDNDTITD SSGLRFPNIK RKRSHSTSPL GSSSHEESKI NGNLANGNSA  660
NYLHGMSSTD KDFSKLLNEN TSSDMDSVEP ASKQLCLDGG SFLLENNTAS LSPKVLGMET  720
NSAEGNTDFG FLSSEEIKIQ EGPASSYNIT LGCDSDSGLI SDGITVSNIG TTDVSCKEPC  780
TNQGKPLAEN GVVDQCLNAN FSVGSGKIFC ESNSGERTIQ NVDTCASCSN EVRTIYSSNS  840
GHIISGEIDF SSNGTIDDVC GRPSSDKVST LENVPTGGSL NCTISTDGSK EDTPNIKKSN  900
KNVEMPQLHV SKSEVNNSYL KPVNMVNSAT WVDTTLRLSF KDPTPTEFTV SGDVCGNVGL  960
RGCTDGISDF CLRSSPDALE ANASGNSTVN VGPSGTSKQN QKKRKFSGSQ LESTCLIASG  1020
ESEGPLTAGI SVSAVEVPCN SGNGLMQPEP EMTVSAMNPL FTSDFPPLKK QITEPLDNCS  1080
VGGYHGTEDS LIDGFEDSGL RGVHSCSTVW ELAVQKVQSP CPSGSEGKQI AEATLVMAGS  1140
SHQNNSILIE SGEAEKMEVD TGEEQDIADS GTAQCQFPSE LQVPDSDERL PGTDVENDSC  1200
QHIKNDLPSM SSYLSSLGDG KEVSATNSSG AVMGLVSDTL PDMLSTSHIQ LSIEKGGGDD  1260
EILLGKRAIK GGSNISVVTS GSPNTEINFN SDHGVENDHS FSGKTGLLPS QDSINSTQMG  1320
NTMSGEVYGR KNQPNQAVSR IYPGRSSVVF ASSKNTASST HISKPRTWHR ADNSSTFGQP  1380
GNKAFSSTVP TQRKLHKQIT KFENTSYIRK GNSLVRKPTT MAAQSQSSHG LSSSVYRLNS  1440
SGTDEVKKNA GSDIRTGVVD PSNFVRTGAN AAFERPRTPP LASATKLPNH ASSFLGNFTS  1500
SPLAEPLHNC ATETASDHMT STASNDVLNS SENAIIISEN PMTQTGQINN LDCHNELNDG  1560
NALSSNANSV TYVKRKSNQL VATSNPSSLS VYNAHNAPAL PSDGYYKRRK NQLVRTSLES  1620
HVQPAFIMPE ESVNPEGQAP HNITSSRSSS KRRSRKAVTK THKPLKFALV WTQRSAQLLN  1680
DDDDSLHRHK FLPHLFPWKR ATYWRSFITN SAANPSNNSS SAISRKLLLS RKRDTVYTRS  1740
KHGFSLRKSK VLSVGGSSLK WSKSIERRSK KASEEATLAV AEAERKKREQ SGASCVVSGT  1800
MNRNSSSRER IFQIGSFRYK MDSSRRTLQR ISDDDSSYSA AFQTEKDFKR SYVPRRLVIG  1860
KDEYVRIGNG NQLVRDPKKR TRILASEKVR WSLHTARSRL ARKRKYCQFF TRFGKCNKDD  1920
GKCPYIHDSS KIAVCTKFLN GLCFNSDCKL THKVIPERMP DCSYYLQGLC TNKNCPYRHV  1980
HVNPNAFTCE GFLRGYCADG NECRKKHSYV CPTYEATGSC PQGSKCKLHH PKNRSKGKKS  2040
KQSREKKIDQ GRYFGSTHIN VSEPGTAVSE THSAQDNSKI CFEGSIADYM ILDVADAVRE  2100
NINLADEQTS FSEGDPLDLK LVDPDELIKP IRIMTT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1630635KRKRSH