PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023884362.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family C3H
Protein Properties Length: 1647aa    MW: 179368 Da    PI: 4.3672
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023884362.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.48.2e-0616241647226
                      -S---SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHg 26  
                      +++ C+ f+++G+Ck G++C + H+
  XP_023884362.1 1624 GQRVCK-FHESGHCKKGASCDYLHP 1647
                      6789**.*****************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1647 aa     Download sequence    
METEEDEASN LNQPSLPLQE EDSAAIVCDP LEAEGPKNSE PQWAPEPENE PESESASESH  60
IVGASESQQS PPSQVAGEEA LEVADTDLGT GEKVVSDVAE KVEKEEEEEE DQGESNTVAE  120
VVEEKEEAQD VADVAVEETE VTDLAKETTN LASPAAEDTD FLGPVVAEET ESAETAVAEE  180
TDLAEPEAAV ETDLKETAVA EPVAAADDTG LEEPAAVEET ALVEPTRVEE TDLVELAVAE  240
ETDLVEPKVE EESDVAVAET DLAEEMDTIE PAVAEETELV APVDEMEDAE VEAEAVEEAN  300
VGADLTEETV EAEEHLVEEE EEEAMETEDN VAEETTDAEE AEAAEEEVEM AEETTEAEET  360
EAAEEMEVVE AGRTSGGKRK RGKSTTRAAA ARASTRKRVE EDVCFICFDG GDLVLCDRRG  420
CPKAYHPSCI NRDEAFFRAK GRWNCGWHLC SSCGKNANYM CYTCTFSLCK GCIKDAVILC  480
VRGNKGFCET CMKTVLLIEH NLQGNKDMGQ VDFDDKSSWE YLFKDYWIDL KERLSLTLDE  540
LNQAKNPWKG GSDVPTGKQE SPDELYDANN DERSDSDSSS DNIETSNSKR RRGKRRLKPR  600
TKVSNSSASA ATGSGGPSMV DNAEWASKEL LEFVMHMKNG DRSVLSQFDV QALLLEYIKR  660
NKLRDPRRKS QIICDARLES LFGKPRVGHF EMLKLLESHF LIKEDSQADD LQGSVVDTET  720
SQLETDGNSD SLVKASKDKK RKTRKKGDER GPQSNLEDFA AIDIHNINLI YLRRNLVEEL  780
IEDMEKFQDK VVGSFVRIRI SGSGQKQDLY RLVQVAGTCK AAEPYKVGKR MTDTLLEILN  840
LNKTEIISID IISNQEFTED ECKRLRQSIK CGFINRLTVG DLQEKAMALQ EARVKDWLET  900
EIVRLSHLRD RASEKGRRKE LRECVEKLQL LKTPEERQRR LEEIPEIHAD PNMDPSYESE  960
EDGGETDDKK QETYMRPRGA GFGRRPREPV SPQTGGPAFS DSWSGTRNYS TGNRDLTRNM  1020
SNKGFSHKGD DTTGVGEILN EWNQGRDRET QQINSWEKQK IAASLEIDAS KTQSAVKSES  1080
TSAAAEASLS TTVAQSAVTI NETEKMWHYK DPSGKVQGPF SMVQLRKWNN TGYFPVDLRI  1140
WRITEKQDNS LLLTDALDGK FQLDPPLVDN SFLKAQGVHN LNLPSSYSSK PHGAPLQHGI  1200
EGQVGERSNF DQNRAALSSQ SPAGMLATLS VEVPKISIDG RSSNYQNDST NLPSPTPQST  1260
TSATKGQPYE NKWFQNPIQP ADSVMGAGSF SAGNGGLHPP AAVIPESVLR VSESNGASSH  1320
LGVTPIPKPE KSMLAGSTDT LQMHAQSISP GADMKNSVTN FQNLVQSVAN NNHPVETQGW  1380
GGASSQKVEP NNLANMPAQP ATHGHWGNSQ SVHNSATSFS TGNPVGNFPS AGFSGLPPSD  1440
SWRHPVPVNQ SNIQPPAPPP QTWGMGVAEN QTAGTRPENQ NTSWGPMAGN QNMPWGGLVP  1500
ANANMNWGAS GPGPVPGNAT AGWVAPVQTP GNAGPGWVPP NQVPPPVNAN PGWVASGQGP  1560
PPRNANPGWA APTGNAGNNG DRFTNQRDRV SHGGDSGHGG GKPWNRQSSF GGGGGGGSRP  1620
SFKGQRVCKF HESGHCKKGA SCDYLHP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1589597KRRRGKRRL
2589598KRRRGKRRLK
3590598KRRRGKRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16485.10.0C3H family protein