PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr6g0267521
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family C3H
Protein Properties Length: 1749aa    MW: 190919 Da    PI: 4.3288
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr6g0267521genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.38.4e-0617241747226
                              -S---SGGGGTS--TTTTT-SS-SS CS
                 zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHg 26  
                              k + Cr ++++G+Ck G++C + H+
  RcHm_v2.0_Chr6g0267521 1724 KGRVCR-YFESGHCKKGSSCDYLHP 1747
                              7789**.8888*************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1749 aa     Download sequence    
MAAEAEREEV EAVEVVEAER EEVGGVAEEV QVDREEEEEV KLEGEGVELE KAEVELEREE  60
IREEEEKAEE QGRVTDMVDE TIETEGRAEA EAVEGGHVSD LNLKDTVDQT LETSDGADVE  120
DTLETTDAVE EGKGNEAEVH EVNATDMGDE TNGTEAVEEG KEGVVTDMVD ETIESEAMEE  180
EGNGCAAEGH EANVTVTESD EIIETTEAVE EGKLSEAEGD EANVKDMVHE SIETEAAGEE  240
GQKAEVEGQE ADAAIEAEEA VEVGQEAEAE GDEKNLNVKD MVDGAVETQV SREEEQEAEV  300
EEHEANTTIE DEEVGQEAAA EGGERNLNVK DMVDDTIETQ VSGEVGQEAE AEGGEQNLNV  360
KDMVDDTIQT QVSGEEEQEA EVEGHELTAT ATNMVDEADE AVVTTEAVEA DVSDDVMEET  420
TEAEETEAAE GAEEMEADEM EAAEEEEAEE MEMVEDTNTS GVGGKRKRRK NSKAAASEKV  480
LSKKKEEDVC FICFDGVSSC FVTGGQGCPK AYHPSCVNRD EAFFRAKGRW NCGWHLCSNC  540
EKNAHYMCYT CTFSLCKACT KDAVILCIKG NKGFCETCMK TVMLIEKNEH GNKEKDAVDF  600
DDKSSWEYLF KDYWTDLKER LSLTLDDLSQ AKNPWKGSAG HANKHGSHDE PYDANIDGGS  660
DSDNSENLDS TNSKRRKGKK RLKTRAKGKN SSSPATGSGG RYADDNTEWA SKELLEFVMH  720
MRNGDSSALS QFDVQALLLE YIKRNKLRDP RRKSQIICDL RLQSLFGKPR VGHFEMLKLL  780
ESHFFMKEDS QVDDHQGSVV DTEGNQLEAD GHSDTPAKAS KDKKRKRKKN EPQSNVEDFA  840
AIDIHNINLI YLRRNLVEDL LEDTDNFQEK VAGSFVRIRI SGSGQKQDLY RLVQVIGTCK  900
AAEPYKVGKR MTDILLEILN LNKTEIVTID IISNQEFTED ECKRLRQSIK CGLINRLTVG  960
DVQEKAVVLQ PVRVKDWLET ETVRLQHLRD RASEKGRRKE YPFLFQFNIY SCEFVHLFLL  1020
LEGSAAYFLL LFLLYLLSDK MRDSCELLKT PEERQRRLEE TLEIHADPNM DPSYESEEDE  1080
DEGGDKREES YTRPTGSGFG RKGREPISPR RGGSSLNDSW SGTRNFSNMN RDFGRSMSGK  1140
GIFNKAENTT GAGEIVNDAW GQGRETPQTN YWENKQNISS LETGSRSTQS VVPSEASPAR  1200
APENRAAPIS TGVAQSVANI NETEKIWHYQ DPSGKVQGPF SMIQLRKWNN TGYFPPNLRV  1260
WKNTDKQEDS ILVTDALVGK FQKDPSIAKA QMVHDSHLTP ASSGKAQGAQ LQQSLESQSG  1320
GGSWGAHNEI ISSTGRGTPS SVEVPKYTSD GWGTTNFPSP TPSQTPISGA KRQAYENNWP  1380
PSPGGNGVMQ SHAVLTPETA MRVSGNDPST SLTGMTAAPN ALQMHGQVTV SGPVLANASV  1440
KPLPDVQNIV SNLQNLVQSV TNRTTANDTR AWGSGTVPGS ESQPWGGAPS QKIEPNNATN  1500
VPAQHPSHGY WPPTNNGTSS INTGNSAGNF PAQGFSGVPN SDAWRPPVPS NQSYIQPPAQ  1560
PQVPWGSSVS DNQSTVPRMG QESQNSGWVP VAGNPNVAWG GPVPGNSNMN WAAPSQSPGW  1620
SASGQVPVRG NAVPSWAPPP GQGPPSVSAN PGWAPPGQGP ALGNAISGWS APTANQTQNG  1680
DRFSNQRDRG SHGGDSGFGG GKPWNRQPSF GSGGGGGSSR PPFKGRVCRY FESGHCKKGS  1740
SCDYLHPDH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1465470KRKRRK
2823828KKRKRK
3823829KKRKRKK
4823830KKRKRKKN
5825829RKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16485.10.0C3H family protein