PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021301161.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Herrania
Family C3H
Protein Properties Length: 967aa    MW: 107170 Da    PI: 8.7415
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021301161.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH26.98.6e-09184205325
                     S---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   3 telCrffartGtCkyGdrCkFaH 25 
                     t +C+ f++ G C++G++C+F H
  XP_021301161.1 184 TQICKEFMA-GRCRRGSQCQFLH 205
                     789******.************* PP

2zf-CCCH22.81.6e-07247266626
                     -SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaHg 26 
                     C+++++ G C++G++C+FaH+
  XP_021301161.1 247 CNDYLK-GNCRRGASCRFAHD 266
                     ******.*************5 PP

3zf-CCCH17.95.7e-06308328425
                     ---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   4 elCrffartGtCkyGdrCkFaH 25 
                     ++C++fa+ G C+ G  C+F+H
  XP_021301161.1 308 VPCKYFAA-GNCRNGKYCRFSH 328
                     59**9999.************* PP

Sequence ? help Back to Top
Protein Sequence    Length: 967 aa     Download sequence    
MSGSRKRGSK WDSKEERQYS LENVRDAAWP AKAGVSFHDR ESEHGYFSPE VGRNGNKWSF  60
VEASDMMKSK HGLPSRESLT GGRGARKDEN INVDCVKNWK TTTPWDGDET YSMRMSPGLD  120
DWRQQNRRHS PKSDWTRSQS FTHKSRSRSW SRSRSRSRSR SPVRGIRRQS GFHERTRSRS  180
GVSTQICKEF MAGRCRRGSQ CQFLHQDIQS HEDGWDNRQK KAGGSKYCTS NDGKECLMKS  240
GRSSDCCNDY LKGNCRRGAS CRFAHDGASD GFSRGSINEV SRERESNKRN RVATPERDGE  300
REARRSDVPC KYFAAGNCRN GKYCRFSHHG QARASPERSR GDRGGWGQSS VSVDKLRDGA  360
KFRDADASYN VEKSRNGLKW SDADASNEAE KCWAGPKWSD VDASNDVDKS WTGSKWGDTG  420
TYAGAANMSK DINGKVGASE SRFPDWSMDE RWQRNYDVSG KSSETKVHHE TVDIDKDETI  480
PRKIENAGLS TGVSEPRGAE ESLGDMEMSP EWNYRIPSSV KKEPSHSSMS QTPIDSSLTA  540
HEKDIVEEAS GRVCDGLAAS QPISIQKSNF QHDQVMRGNS AVALPCDSNA ASRNSAISHI  600
DLNFSSSILQ MKSFDQPGPS SSSLPYSNLN VVGQSQVAIP SDSNEVNVKV MQNNLLFQEE  660
KPSNKMNFGD TNTSNGNSGT QSTQNMVSNE QLTHLTNLSA SLAQLFGKGQ QLPLLHVALN  720
AHDAMQVNSF ASSGGPIEPD SIPTVQPGQD VTFPKQYDPI SDSIEPVKKQ DTNTKPLGFS  780
AHPVAQKNTA DGKPELSANM LLPSSLVGST NGGDYHNDLS SKRKPDSDSH MPNQVERVAS  840
SEVTKENEGV EETKKAQEEN KNGPSENIDA DDRTDEGKKS KDGKGIRAFK FALVEFVKDL  900
LKPTWKEGQI GKDAYKNIVK KVVDKVTATM QGANIPQTPE KIDQYLSFSK PKLSKLVQAY  960
VEKFQKN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1152160RSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.15e-19C3H family protein