PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0001275.1_g050.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family C3H
Protein Properties Length: 2090aa    MW: 230415 Da    PI: 8.3172
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0001275.1_g050.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH21.15.6e-0719581979627
                                 -SGGGGTS--TTTTT-SS-SSS CS
                    zf-CCCH    6 CrffartGtCkyGdrCkFaHgp 27  
                                 C+ f +tGtC++G++Ck +H++
  Pav_sc0001275.1_g050.1.mk 1958 CPSFEATGTCPHGPKCKLHHPR 1979
                                 *****************99985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2090 aa     Download sequence    
MDLPQYLHQH PRYAPSHPYP ADPLNFPNYP HRHHNNHHYN HHHHQQPPLQ IQPPPPPQPP  60
LPPTSSYHPL PPPPPPPPHP YNPSQPQLPF DSEHSRTFHS LHDFPVSSPR VSSRVTLDAE  120
RHRHHRLPQF DLPYLEKNPD SWDPSRAYLD FERDRESHKP QFRLESEGSG SARFRSEYNE  180
QLLRDQRVVD DEYNNRRRAR VEPNSEIAYR ELGFVSNQNS NHLNSNNLDF DSKSGGYDGR  240
YDELMRSGGG RRDEVYENTQ RWVHHDRQAS RELYDPFEVD ETNGGRNVSG NREYYGSESG  300
RGSSNNRRKQ IQKKSALLRL QMAKPSHKNH SAYFDNSGSS SHRGKGHYEY SDREMDEEER  360
GGSPLELDVS FKSNSLVAKT VGNRNFKRDS VFDREFSNSQ LTKLSEDAVH LDSSVVVEDM  420
TSNSDKDLRL LEEEVTTSGV ESRCDIDSQP CSNVTDDSFG KSEVERASKS KVLQRADKSA  480
GSGQMPSLKV SKKKKVAKKV VKKVINPQPL PKNKIDEPGV ADSFICRPSA AFGADRGETS  540
SFADPCSNDV HALPVNKKVD GSSLNMLSDE HGTEANSCSK STGSNSTSKL GSSNHEEFNI  600
DQGPLTVDTS VQGLLTISNF SNNVTDSLRV ASCPETDGVI DVSKQICHSG NSLSLDNVIR  660
KESSEAMLSV EGNANSGFLS SEKIMMHDDI MNANGSGHGT ETTLDIESGR NVLHQEIIVH  720
DIGTVDAINE NVCKYQFPTS LQIGFVEELP KGISSAESSM TVGLSSSGET LAVCSNSGRG  780
TTWDSDKVCT NYDENIIGKQ PSADGASRSF GICATQRSPD ITKSVGDSKG VTHKNKKKRK  840
VRSRLDSSRA SNTCAEPINV SVNKNSVDTT VSSSLKDASH AEVAVFGVGK LDIGSQRVND  900
GVSVMHGKSS VDGLCEAKLS TRSDVNCDPN ETSPKYIKKR KLSASHLVLT TSQTNDGPTD  960
KSTFYTDAPL KSNDVPTQEE DEVAASSTGL LLATANLMPS QEGSTVFLKD NIAGVLSDAV  1020
AAARDAFTND GMKSEHQGVD SCSIYEESVS DTLFLCPSQL RNEQKEAGTQ VMVINNHHLD  1080
IMDIESNREE NFDIVATDEQ VIIHGETALC RVSSEVEPPD LGYKFSCTDM ESDYVSVKDT  1140
LPFASNRLLL CANDNEVSTT NSNDEGVESV PDTLSDTGSP ETSTDVPGVQ MRTCSPSVIK  1200
ISDGKDCGDD QKLDLKSVVE VGCSASAQNS LSECTKSNLT PHPVTEGGQS VTGKTVALPL  1260
QDIKKTAHGL NLVTAESRLK NQLGQATHRI VPGHSYSVFS TSKKTGSSNH MAKPRTWHRN  1320
GNASASSLPA SMPFSSTVPP QRILPPKDGK LQSNSYVRKG NSLVRKPVPV AALPQSSHGF  1380
SSAVYRLNSL GIDGLKKNAG SDSRVDVKNP PSLVRTGEMN APFDRPRPLL PNGAKLSTYD  1440
AISLGVRTSS QLAEPLLSGE TTSDPMNCLE TKDAKIVVND SLVTSETQEN LSGPFNSLEN  1500
QTELHDGNLA PSNTKNIVYV KRKLNQLVAS SSPCDLPVHN ADKIQHSSFD GYYKRRKNQL  1560
IRTSSEGHAK QAVIMSNDNL NSQVQKVPKI VPSRIYGKKR SQKVIAKTSK TGKNSLVWTP  1620
RGTQASNNDG DSFDHQKVLP HLFPWKRARH WRTSMQSQAS NFKYSSASTI SKKLLLSRRR  1680
DTVYTRSTHG FSLRMYKVLS VGGSSLKWSK SIESRSKKAN EEATRAVAAV EKKKREHSGA  1740
ACVGSGSKFR NNISGKRIFR IGCVRYKMDP SRRTLQRISD DGSSSSAVLN PEKDAKRSYV  1800
PRRLVIGNDE YVRIGNGNQL IRNPKKRTRI LANERVRWSL HTARLRLAKK RKYCQFFTRF  1860
GKCNKDDGKC PYIHDPSKIA VCTKFLKGLC SNPNCKLTHK VIPERMQDCS YFLQGLCSNE  1920
SCPYRHVNVN PKASTCEGFL KGYCADGNEC RKKHSYVCPS FEATGTCPHG PKCKLHHPRN  1980
RTKGKKRKRT REQKNAWGRY FVSKDINFSE PRAVSGKHCA QNGDGIFDDG RAADFISIDA  2040
SDEEAGESND PINEQAASCD SDSSELELDD LDELIKPVRL LDRSLKTNIL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1836841KKKRKV
219851990KKRKRT