PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022721311.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family C3H
Protein Properties Length: 950aa    MW: 104938 Da    PI: 8.7706
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022721311.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH28.33.1e-09188209325
                     S---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   3 telCrffartGtCkyGdrCkFaH 25 
                     t lC++f++ G C++G++C+F H
  XP_022721311.1 188 TQLCKDFMA-GRCRRGSQCHFLH 209
                     78*******.************* PP

2zf-CCCH23.78.4e-08251270626
                     -SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaHg 26 
                     C ++++ G C++G++C+FaH+
  XP_022721311.1 251 CTNYLK-GNCRRGASCRFAHD 270
                     ******.*************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 950 aa     Download sequence    
MSRSRKRSSK WDSKEESQHS LENVRDSACP AKAGVSFHDR ESEHGSFSLE VGRNGNKWSV  60
AGASDAMKSK RGLPSRESLH GSRGGENDDK INVDCVKNWK TTAQLDGDET YSMKMSPGLD  120
DWRQQNHHHS PKSDWSRSRS FTQRSRSRSR SRSWSRSRSR SRSRSPVRGI RRQSGFSERT  180
RNRSGVSTQL CKDFMAGRCR RGSQCHFLHQ DIQSREDGWD NRQRRVGASN YIPRNDSKDY  240
LTRSGRSTDF CTNYLKGNCR RGASCRFAHD GASDGFSRGS VSRERENNKR NRVKTPEQDG  300
EREAQRSSDI PCKYFVAGNC RDGKYCRFSH HGQARAGPER SLGDRGVWGQ GSVSVDKLQD  360
GAKLRGADAS FSVEKSWNAP KRGDDNASNE AEKPLASPKW SDVDASDDVD NSWTGSKWND  420
RGAYSGDTKL SKDTNGKMAA SGSRFSGWSM DERWQHDNDV SGKNSETNVH YKTVDIDKDE  480
GIPRRIQNSG VNMGVSEPKD SEELLGDMEM SPEWNYGIHS SVKKEHSYSS KSTAVDTSLP  540
TNEKNVTEEA SGQACNGLAA LQPISTEKSN FQQDHMMRAS SAVALPCDSN AVSRNASISL  600
IDVNFSTNVL PMTSFDQPGP SSSSLPYSNL NAVGQSQVAI PSNEVNMKAT QNSLLFQEEK  660
PSNKLNILDT NILHGNSGSQ TTQNMVSNEQ LTQLTNLSAS LAQLFGKGQV TSFSNCGGPV  720
EPDSVPTVQP GQDITFPKQY DPISDSIEPA KKQDTNTKPL GFSLHPVSQK NTADGKPELS  780
ANRLLPSSLV GGTNDGDYHN DQSSKSEPGF DSHKPNQLEP AASSEVTKEN GGVEETEKAE  840
EENKNGLSEK IDADDRTDEG KKSKDGKGIR AFKFALVEFV KDLLKPSWKE GQIGKEAYKN  900
IVKKVVDKVT ATMQGTNIPQ TPEKIDQYLS FSKPKLGKLV QAYVEKFQKS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1144152RSRSRSRSR
2156164RSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.13e-18C3H family protein