PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID ONIVA03G01110.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa
Family C3H
Protein Properties Length: 1800aa    MW: 198127 Da    PI: 5.383
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
ONIVA03G01110.1genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.51.7e-0617801800526
                       --SGGGGTS--TTTTT-SS-SS CS
          zf-CCCH    5 lCrffartGtCkyGdrCkFaHg 26  
                       +C+ f+++G+C+ G++C++ H+
  ONIVA03G01110.1 1780 ICK-FHENGYCRKGASCNYLHP 1800
                       799.9999*************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1800 aa     Download sequence    
MEGAAAAAGG GEMLSPGEAD WPPELRLPPP PPPSAASEGE PPPARAAVGM DDSQFLGSII  60
GLPAQPPQAT AEALAVVGVK RRRGRPPKKR DGAAAATAVV PAARPARRRE DEEEVVCFIC  120
FDGGNLVVCD RRGCPKVYHP ACIKRDEAFF QSRSKWNCGW HICSSCEKAV HYMCYTCTYS  180
LCKVCIKQGK FFSVRGTKGF CDTCYSTILL IESKDEGDTK IVVDFDDQNS WEYLFKLYWV  240
DLKGKLSLTL EELTSAKARW NAPTTYTRKE KDESSDDLYD ANNDDDAGSD CSSGKRKRNS  300
SRKKGRKRRK PDSDCSIATK KVETVTRDDG TLPNKVPTEE ASLPVDTKWA SPELLEFVGH  360
MRDGDQSFIS QFDVQALLLD YIKQNNLRDP QRKSQIICDS RLHRLFRKTR VAHFEMLKLL  420
EMHFIVSEPS AVNDGSQGII NPDSAQIDHA SGYNDMAAKF SPDRRRRMHR KMEREPQANP  480
EDYAAIDMHN INLIYLRRSL MEDLIDDPTL SDKISGAFVR IRISGLGQKQ DMYRLVKVVG  540
THKVSEKYSI GKKMTNFALE IMNLNKKEII TMDTVSNQDF TEEECKRLRQ SMKYDLISRL  600
KVGDIQEKAK IFQFVRVNDW FENEKQKLCH LRDRASETGQ CVEKLQLLNT PEERARRINE  660
VLDVHVDSHM DPDYESDDEF GNKKAVERSV NWARSDPFVS PVKVKYSNSS QKNGDATRHL  720
KNLSKQNTER KSGAARNFEN SHSPVGMDIP KSGTNVKSTR CETTSPSSHG VVSSDMEPEK  780
VWHYKDPSGN VQGPFTLVQL SKWTSYFPRD MRVWLTFESE ERSLLLTEVL SKQPKDFGQP  840
ASVTTSSKST VADTGQNRNT EIVDLNKAPS PVGYSMLNSF ETTVQSTKHS APERESVNSL  900
DDRLSHSTDS VPPKDANASN SQAMCQIKHS GSLPSPGSPH QRSDLHHDEV QGGRSGEWNN  960
QHNSELWSPS MPQTSSSAHS NVESHHDHYP SWSQVQHDPK NSLQAGSGKD LNSRYDIAQK  1020
LPSQRITRDV PSPVFAWSPS ESRTASSQHE GSCLSSTTNL CTHDELHSSI ASAKAKSFAP  1080
ATPVEDRGSS SPSGMLSLSE RAPICSPQSA PSASASDTCK MEENMNQQKT LEADISNTSV  1140
NQSPQSKILP ESSPDNQDAE HEYRSPPPIS ESKELSPQSR TTPGSSPDNQ DTEREYPSPP  1200
PISGSKEISP QSRTILESSP DNQDNGHEYP SPPPIPESIE LSPHSKALPE SSPDNQDIEP  1260
ECPSPPQIPE SKELSRQSKI LPESSPGNQD IEPECPSPPQ IPESKELSQQ SKILPESSPD  1320
NHDIKCEYSS PTPIPESKEL SLQSKILPES SSDYQDIKCE DPSPTPISKS KEVSPQSKIL  1380
SESYLDNQDV ECKCPSSILI TESKELAVDL PGSISLAPEK TASTDVGENS SLAFIFPKST  1440
LAGDDALKSV FDMAKAHLEC EDSKVKEELY VESTVVIRDD MVVNPASGVE SIDMSENLLE  1500
SLMEQSCGTF YMDGTTALEG FLSGSTKEEP QCSSPIALST CSSPIALSPW GEHGYYQGDS  1560
VGSSLWGVQD DDPIGNIWPL SSQAPALQYS SGSTAHFIDE ATVTHGNNGV VLSSTPGEVG  1620
LPNSGVCTDW GLVEQVNPEA NDASVSMIDK NSGLVDSQPS ANDGSDVGTA RNTNHNTNLS  1680
LNHETAVPLS RSSGEASRKH GFITDLNVAT SEEALGNTKN WNPSAGNANR GSQRNHHRDR  1740
YSQISESWLL SSNYSRSRSD GFGTGGSSRS TPRGQTQRGI CKFHENGYCR KGASCNYLHP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1295310KRKRNSSRKKGRKRRK
2296311KRKRNSSRKKGRKRRK
3296311RKRNSSRKKGRKRRKP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16485.10.0C3H family protein