PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP015714.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family C3H
Protein Properties Length: 2207aa    MW: 245652 Da    PI: 7.7683
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP015714.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.20.0005413971419323
                   ET..TTTEEESSHHHHHHHHHHT CS
      zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                   Cp   Cgk F ++ +L++H r+H
  PCP015714.1 1397 CPvkGCGKKFFSHKYLVQHRRVH 1419
                   9999*****************99 PP

2zf-CCCH29.99.7e-1015131537327
                   S---SGGGGTS--TTTTT-SS-SSS CS
      zf-CCCH    3 telCrffartGtCkyGdrCkFaHgp 27  
                   + +C++++r+G CkyG++C+++H++
  PCP015714.1 1513 QTECKYYLRSGGCKYGENCRYSHSR 1537
                   568********************95 PP

3zf-CCCH32.41.6e-1015571582126
                   --S---SGGGGTS--TTTTT-SS-SS CS
      zf-CCCH    1 yktelCrffartGtCkyGdrCkFaHg 26  
                   +++++C++++r+G Cky ++C+F+H+
  PCP015714.1 1557 PGERECPYYMRNGSCKYASNCRFNHP 1582
                   5789*********************9 PP

4zf-CCCH31.14.1e-1020642088327
                   S---SGGGGTS--TTTTT-SS-SSS CS
      zf-CCCH    3 telCrffartGtCkyGdrCkFaHgp 27  
                   + +C++++r+G CkyG++Ck++H++
  PCP015714.1 2064 QTECKYYLRSGGCKYGEKCKYSHSK 2088
                   568********************95 PP

5zf-CCCH27.74.6e-0921032127226
                   -S---SGGGGTS--TTTTT-SS-SS CS
      zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHg 26  
                   + ++Cr++ r+G Cky ++C+F+H+
  PCP015714.1 2103 GKRECRYYIRNGSCKYASNCRFNHP 2127
                   789*********************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2207 aa     Download sequence    
MAASSGLIAE ANPEVFPWLK TLPVAPEYHP TWAEFQDPIA YIFKIEKEAS QYGICKIVPP  60
VPPSTKKTTI GNLNRSLAAR AGVSGSSCPK SEPTFTTREQ QIGFCPRRPR PVHRPVWQSG  120
EQYTFQEFEV KAKSFEKSYL RKCSKKGGLS PLEIETLYWK ATVDKPFSVE YANDMPGSAF  180
MPLGARKSCS TSRDAGDNVT LGETAWNMRG VSRSKGSLLR FMKEEIPGVT SPMVYIAMLF  240
SWFAWHVEDH DLHSLNYLHM GAGKTWYGVP KDAAVAFEEV VRVQGYAGEI NPLVLRVSDA  300
PFAFLFTNLQ IYLPSVTFST LGEKTTVMSP EVFVSSGIPC CRLVQNAGEF VVTFPRAYHT  360
GFSHGFNCAE AANIATPEWL RVAKDAAIRR ASINYPPMVS HFQLLYDLAL ALCSRMPSRI  420
SSEPRSSRLK DKRKCEGDIV VKELFVQDVI QNNDLLNVLG KGSPIVLLPQ SSSDLSVCSK  480
LRVGSQSRVN PGFSQGSQSL REEMKSSGSV SDGLMIDRQQ VKGFYSVRGK LASPSESNTL  540
ASLSGSNNVR GLNSKRSNLN CERESNVEGE GLSDQRLFSC VTCGILSFAC VAIIQPTEEA  600
ARYLMSADCS FFNDWVVGSG LASEVFPVAT GDPITSKNAP CTGLEEYNAP AGLYDVLVQS  660
GDCQIQVVDQ GNEVVSNTEM PRETSALGLV ALNYGNSSDS EEDQVEPDVP VCSDEPNMTN  720
CSLESRYRDQ SASPPWRNPY AGTSGAHSPS SQGSGCENEL HLQTFDHYAT DGRKIANFKD  780
SSLQNFDCSA DFKTNNSAST ATGFGKAIVP IQKKSMSFHP GCDEDSSRMH VFCLEHAVEV  840
EQQLRSIGGV HILLLCHPDY PRIEDEAKSM AEELGISYLW NDMAFMNAAK EDETRIQLAL  900
DSEEAIAGNG DWAVKLGINL FYSASLSRSH LYSKQMPYNS VTYKAFGRSS PASSPTRIDA  960
YGRRGGKPKK VVAGKWCGKV WMSNQVHSFL VKRDPEEEVE VAEDEEERTF CAWAMPDEDH  1020
EVKSEITRKT EKTVKKYARK RKMTADTRTT KKARCFDKED AVSDYSVDDN SLQQQRRLPK  1080
SKQAKHTESG RTKKPKHVET EDAVSDDSMQ DDDSLQQNGR FLHSEQVKYI ERSDVSDDSM  1140
GVESHQQHKR TAKSKQFKPV ETDVVSDDSF EGSSHQPQRV LRSKTTKCTG RENLISEDVH  1200
GFGSHQQRRS ISRSKQARAR FIEREDTALD ETPEDNFQQH KRILQNKQTK PETRGKMRQE  1260
TPRQVKQGTA PLVKQGTRTP RNQQTPQQTK KQTPRLRNNQ SEQNSFDLYA EEETEGGPST  1320
RLRKRAPKPS KVPGTKPKEQ QQTARKKAKN ASAGKAQGGR NETKLKEEDA EFVCDFEGCT  1380
MSFASRHDLS LHKRNICPVK GCGKKFFSHK YLVQHRRVHT DDRPLRCPWK GCKMTFKWAW  1440
ARTEHIRVHT GARPYVCAEP GCAQTFRFVS DFSRHKRKTG HSVKKREKGR ARVSPVPKER  1500
VKDKDESAEN PSQTECKYYL RSGGCKYGEN CRYSHSREKP SVAPVLELNF LGLPIRPGER  1560
ECPYYMRNGS CKYASNCRFN HPDPTTTGGS NRPSGYGNGG PASLQGASQS TVAPWSAPRP  1620
LNEAPVYSTL MIPPPQVVSS QNSEWNGYQD PAYVPERSMP ARPPYMMNKQ KFSEGVEVPL  1680
NPTSGPSPAT SDLDHEILDW IRTLDPAVLN RIRNPDPALL HEIHQFNPAV EIRNFDPSIV  1740
DRIRNLDPAF PSDFSESVDV PQNPPSVPSP EPSVVEILDW IGTLGPAVLD EIQQFDPAFL  1800
DEDRSFDPSI VDGIPKLSLK KKEEKAEEEE KERETEREKE NERERESEKS ESSSGGGGGN  1860
ENGGEVEEEV VEESSRRDQY PLRLGDCRYH LRTGNCSPPL PDPSQGAVLR SISSSSFFLE  1920
FPQFSTRRLG FGDVRRAARF GCWVLHGDGE GEGWVRQTFS RFPDGRRRGG RNPSEGGGNP  1980
SGENKKLMLT SSRPGIGSAD FCWAACWPPP EPSGFFGMDR VFGEGGEKMG CCPAKRGRAG  2040
VALSICVPKE SVKDKDESAQ NPSQTECKYY LRSGGCKYGE KCKYSHSKVK LELNFLGLPI  2100
RPGKRECRYY IRNGSCKYAS NCRFNHPDPT DTGESDPSSG YGNGGPAFLQ GASQSTVAPW  2160
SAPRPLNEDP VYSTLMIPPP QVVSSQNSEW NGYQVLLFNF PISESRV
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein