PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_020704450.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Orchidaceae; Epidendroideae; Malaxideae; Dendrobiinae; Dendrobium
Family C3H
Protein Properties Length: 906aa    MW: 101601 Da    PI: 7.7066
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_020704450.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH192.4e-06236256425
                     ---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   4 elCrffartGtCkyGdrCkFaH 25 
                     e Cr fa+ G C++G+ C++ H
  XP_020704450.1 236 EACRAFAA-GNCRRGNECRYLH 256
                     68******.************* PP

2zf-CCCH171e-05395414525
                     --SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   5 lCrffartGtCkyGdrCkFaH 25 
                     +C++f++ G+C +G+ CkF H
  XP_020704450.1 395 PCKYFLQ-GHCIHGEGCKFLH 414
                     7******.************* PP

Sequence ? help Back to Top
Protein Sequence    Length: 906 aa     Download sequence    
MIQMEQKAFC WFNIEKCNNR TITEGSRQFK LFLMDEKMKR RIPKWDVAAE LHVDTVGHES  60
ALLEKVDDPC MNLTVVDSMN PIISMNLDGS DVNQSATQTG EEGNNNIIHS DDICEKTQIS  120
GQFDLEDDCK YGLKKQDWEP ASQDRNIAEH TENQDKSHKA VYDMPLDPEE RHQSTRNMSP  180
VDDRRRLHRK RSRSRSLSRD SRSRSWSANR GKSPSYVSKQ GSERWTDKGR MRGMNEACRA  240
FAAGNCRRGN ECRYLHEDGC LRQIDRHHDT GLTDGRGIRL ERERQLVYGS NETLTDTEER  300
NKYSRDKLSG MHEEHHEDGS AANIQHPSYR SNERCFDFIR GRCFRGEACR YIHDDASSDG  360
GWSVKDEFSG RVHDRRGESA SFTRRIHHQR LSDIPCKYFL QGHCIHGEGC KFLHQPGPLN  420
IPLDRSLDGS QHSFVSAGPA GNSTSTNASE QQHQTARFSS TNPHRENLRG VGDTQPIDKQ  480
KFSKRSLSQD GHKRPHSSEQ QATTSEVCEE LSVNNPLPTT LIACQNADSR GQNQSAPKIL  540
HAQNHFAPPI PSKGQMLQVM HSHYQNGKNQ FAALEKPPDA HSFNANIQNR SANPSSHQLN  600
PQNLDSRAQV QKNISILSNS GSNFNHNGQI QQNGPLILVS HSGLGQQEIA GDSQNVLPQN  660
VQNQVQNIPA YISNGQNNLI VVPQPSLNPQ ILNSDDKNCQ LTMKKQLPHD TDSSEVKPSQ  720
FNSDNSITQK VVTIEQAAKI TDLSASLAQF FGSAAQNSQS SAAITPQPFL NPNSLASFAT  780
PIAPPLPTEV DTKQIDDKTE AHGTNGEAEN GKDVDTKKIK EAKEIRLFKC ALVEFVKDAL  840
KPKWKEGQLS KEAYKTIVKK VVEKVCGSLQ GPNVPQTQEK IDLYLLNCKA KLTKLVQAYV  900
EKFVNS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1190195KRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.19e-17C3H family protein