PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_020672526.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Orchidaceae; Epidendroideae; Malaxideae; Dendrobiinae; Dendrobium
Family C3H
Protein Properties Length: 631aa    MW: 74550.6 Da    PI: 7.3568
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_020672526.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH23.78.1e-08219238625
                     -SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                     C+f+++tG C++G+rC++ H
  XP_020672526.1 219 CPFHLKTGACRFGSRCSRVH 238
                     ******************99 PP

2zf-CCCH30.18.3e-10309336126
                     --S---SGGGGTS..--TTTTT-SS-SS CS
         zf-CCCH   1 yktelCrffartG..tCkyGdrCkFaHg 26 
                     +k ++C  +++t   tC++G+ C+F+H+
  XP_020672526.1 309 WKVAICGEYMKTRlkTCSHGSACNFIHC 336
                     899************************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 631 aa     Download sequence    
MANPVNGEMS QEEAFAVCGG KQSRKEKRKV LKKLKRKQAR RIAAIREREM AEFLLNDPEE  60
QLLLRLREQE EADNAEKERR EYEERERVWL EAAAARKSRE EEDERRKKLL QEEERKQNEH  120
DYGAEGDVEH EYVDEGPAEI IWQGNEIIVK KRRVKVAKKI VEQIPDKEDD DRPTSNTLPP  180
QSATFASYVQ GETITAQEII DSVAQKIPNF GTEQDKAHCP FHLKTGACRF GSRCSRVHFY  240
PEKSSTLLIK NMYNGPGLTC EQDEGLEYTD EEVDHSYEEF YEDVHTEFLK FGEIVNFKII  300
CEFVGVTRWK VAICGEYMKT RLKTCSHGSA CNFIHCFRNP GGDYEWADWD NPPPTYWIKR  360
MIFLFGTSVE SGYGKELDSE DYKRHREQER MRKPKNYRHQ SERFHYKEMD LNNSSSDGEE  420
LNSKKLQQTR YSTKRDRSIK RRGRLVSENK NELLSENHRS EERFHSKSFF SHCDDAGECT  480
SKEAIEGSIK KHRRYVSKTP EHDRRDESEN RSSSSRRKIK LLYSDEKENS EPKSHKKQKK  540
NQVKDRHLEN SDWDSFRSKD QFNEECISED PDSEKDSDGV RYHYSKRRKY DKHRSHRSHK  600
HRYITDKVRK GGEESDGSPG RWKVSEEDFS R
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1150155KKRRVK
2516539RRKIKLLYSDEKENSEPKSHKKQK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.11e-111C3H family protein