PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Ro02_G23029
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rubus
Family C3H
Protein Properties Length: 1624aa    MW: 177950 Da    PI: 4.3355
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Ro02_G23029genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH18.82.9e-0616001622326
                   S---SGGGGTS--TTTTT-SS-SS CS
      zf-CCCH    3 telCrffartGtCkyGdrCkFaHg 26  
                    + Cr ++++G+Ck G++C + H+
  Ro02_G23029 1600 GRVCR-YYESGHCKKGASCDYLHP 1622
                   57899.8999*************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1624 aa     Download sequence    
MAAEAEREDV EREREEVGAE REGVEVNREE VQGRREEVGD DGEKTDEVGG VIEDVKVDRE  60
EVQVEKEEVE LERAEVELER EEVELEREEV ELERDEVELD REEVGEERKE AEEDREEAED  120
EQQEEEKAEE QRHEGNVMDM VDETIETEAE AERVEGGHGP DVKDMVDQTF ETTDAVEEGK  180
GDEAEVHEAN VADREEGKEA AVEESEAIVT DMVDETIETE AMEEGKEAAE EGQEANVIVT  240
DMVDEIIGTT EAVEEGKVSE AEGHEANVKD MVDETIETEV AGEEGQEAEV EGREANATIE  300
TDAAVEEGQE AEAEGDEWNV TAKDMVDETI ETEVSGEEGQ EAEAEGHELN GTVTGMIDEA  360
IETTEAVEEA DVGDDVMEET TEAEETEAAE EEAEEMEADE TEAAEEEEAE ETNTACVGGK  420
RKRGKYSKAA SEKVLSKKKE EDVCFICFDG GELVLCDRRG CPKAYHPSCV NRDEAFFRAK  480
GRWNCGWHLC SNCEKNAQYM CYTCTFSLCK ACTKDAVILC VKGNKGFCET CMKTVMMIEK  540
NEHGNKDKDA VDFDDKSSWE YLFKDYWIDL KEKLSLTLND LTLAKNPWKG SAGHANKHGS  600
HEEPYDANND GGSDSDNSEN LDLTNSKRRK SKKRLKTRAK GKNSSSPATG SGGRSADDST  660
KWASKELLEF VMHMRNGDSS ALSQFDVQAL LLEYIKRNKL RDPRRKSQII CDLRLQSLFG  720
KPRVGHFEML KLLESHFLMK EDSQVDDLQG SVVDTEGNQL EAEGNSDTPA KASKDKKRKR  780
KKGEPQSNVE DFAAIDIHNI SLIYLRRNLV EDLLEDMDNF QDKVAGSFVR IRISGSEIVT  840
IDIISNQEFT EDECKRLRQS IKCGLINRLT VGDVQEKAVV LQPVRVKDWL ETETVRLQHL  900
RDRASEKGRR KEYPYRFEEC VEKLQLLKTP EERQRRLEET LEIHADPNMD PSYESEEDED  960
EGGDKRQESY TRPTGSGFGR KGREPISPRR GASSLNDSWS GARNFSNMNR DFSRNMSGKG  1020
MFNKAENTTG AGEILNDTWG QGRETLQTNQ WENKQNISSS ETGSRSTQSV GPSEASPAGA  1080
SENRVAPLSA GVAQSVSNMN ETEKIWHYQD PSGKVQGPFS MIQLRKWNNT GYFPPNLRVW  1140
KNTENQEDSI LVTDALAGKF QKDSSIPKAQ MVHDSHLTPA SSGKPQGAQL QQSSESQSGG  1200
VGWGSHNEIN SSTGRGTPSS VEVPKYSADA WGTTNFPSPT PSQTPISAAK RQAYENNWSP  1260
SPGGNGVLQS PAVLTPDSAV RVAGNDRSSS LPGTFQMHGQ LTVSGPVLAN ASMKPVPDVQ  1320
NIVSNLQNLV QSVSSRTTAS DTQAWGSGTV PGSESQPWGG APSQKIEPNN ATTVPAQHPA  1380
HGYWANAPPT NNGTTSINTG NSAGNFPAQG FSGVPNSDAW RPPVPSNQSY IQPPAQPQVP  1440
WGLSVSDNQS AVPRMGQESQ NAGWVPVAGN PNMSWGGPVP GNTNINWVAP SQGPGWTASG  1500
QGPVRGNAVP SWAPPGQGPP SVSANPGWAP PGQGPALGNA NSGWAAPTAN QTQNGDRFSN  1560
QRDRSSHGGD SGFGGGKPWN RQPSFGGGGG GGSTRPPLKG RVCRYYESGH CKKGASCDYL  1620
HPDH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1776781KKRKRK
2776782KKRKRKK
3778782RKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16485.10.0C3H family protein