PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022729420.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family C2H2
Protein Properties Length: 1662aa    MW: 188474 Da    PI: 9.2639
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022729420.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.60.0008315461571123
                      EEET..TTTEEESSHHHHHHHHHH.T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                      y+C    C++sF +k +L+ H r+ +
  XP_022729420.1 1546 YQCDmeGCTMSFGSKQDLVLHKRNiC 1571
                      99********************9877 PP

2zf-C2H211.30.00115711593323
                      ET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                      Cp   Cgk F ++ +L++H r+H
  XP_022729420.1 1571 CPvkGCGKKFLSHKYLVQHRRVH 1593
                      9999*****************99 PP

3zf-C2H212.40.0004916291655123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      y+C+   Cg++F+  s++ rH r+  H
  XP_022729420.1 1629 YVCGeaGCGQTFRFVSDFSRHKRKtgH 1655
                      99********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1662 aa     Download sequence    
MAGSSLSPEP SQEVFSWLKS LPLAPEYRPT LAEFQDPIAY IFKIEKEASL YGICKIIPPV  60
PAAPKKTAIG NLNRSLLARA TANASSDLKL AAPTFTTRQQ QIGFCPRKPR PVQKPVWQSG  120
DYYTFQEFEE KAKRFERNYL KRYSKKVSLS ALEVETLFWK ATVDKPSSVE YANDMPGSAF  180
GPLNSKKSSG AGREAGEGVT VGETPWNMRA VSRAKGSLLR FMKEEIPGVT SPMVYFAMLF  240
SWFAWHAEDH DLHSLSYLHV GAGKTWYGVP RNAAVAFEEV VRVDGYGGEF NPLVTFSTLG  300
EKTTVMSPEV FVLAGIPCCR LVQNAGEFVV TFPRAYHSGF SHGFNFGEAA NIATPEWLRV  360
ARDAAIRRAS INYPPMISHF QLLYDLALEL CSRVPMSISA KPKSSRLKDK KKSEGETLVK  420
ELFVQNIRQN NDLLHILGKG SSVVLLPKSS SDISLCSDLR AASQLRINHR MSLDFCNYKE  480
VVKSSKDLAS DKIMLGGNEE IKGVKGFYSM KGKFAPMYEG NQNSSFSGTD YFCRLPLETL  540
NARMERETAV QGDALSDQRL FSCVTCGILC FACVAILQPT EQAARYLMSA DCSFLNDWTV  600
GSGVTRDGFT AAHGDAITSE QNPSTSWTNK GAPNALYDVP IQSVEYKFQM VDRSNQVMED  660
AVKGGDTSAL GLLASIYGNS SDSEEDHVEP NATISGDETN SAKVSPERKF PYNDFGFRPV  720
GTNESHNPSL SRLDSGEEAP VHVINCYSEP GSRRVDIKNR SPQTFDLAVE FETDSLASRR  780
SNGLEDKSRD PITASHANLS YSPATHGTEK MRFSNAIEPM ENADIPFAPR SDEDSSRMHV  840
FCLEHAVEVE KQLRKIGGVH VFLLCHPEYP KIEAEAKLVA EELGIDYPRN DILFGDATKD  900
DEQRIQSALD SEDAIPGNGD WAVKLGINLF YSANLSRSTL YSKQMPYNCV IYNAFGRNSP  960
ASLPTKWNVY GRRSGKQKKV VAGKWCGKVW MSNQVHPFLA QMDPEEQEQE RSFHAWATSN  1020
ENLERKPENV QKAEITQVAK MFKRKRKTRA EIAPSKKVIC IEQEGAVSDD SLDDSSLRQQ  1080
QRFFRGKQPS LIEKEEAISY DSLEDDSLLQ QRNLFRKKRT KFIEKEDAES EDAEDDFTHL  1140
QHSRNLRGRL GKYIEEDDAV SCDSLDESSL KRDRRIRGNW QAKFKESRDI VSDDEQDEIS  1200
HQVHRRIPRG RQMKSFERND ANSYDSRADN SLKQYRRIPK GKQAKFFERD DAMSDDASDY  1260
DSQSQLRRIP RAKQMKYMER DDAFSDDSLE DNPQQQQKRI PRSKVAKFTD REDVVSFDSL  1320
KSNSHRQHSR VPRSQLTKFI EREDAVSSDS PDDSFLQQLW RIRRSKQTKI LDKEDAVSDD  1380
SLDGTSLQQF RKTPRSRQRK FIEREDAISY DSLDGNYHQP NRSIRSRKKK AQTQRQIKQE  1440
TPRNVKQGKR RTTKQVVSQQ MKQETPRNWN TKIEQSARQC NSYGEEEIEG GPSSRLRKRV  1500
RKPLKESETK PIEKKQARKK KVKNASNVKT LAGHNTAKVR HKEAEYQCDM EGCTMSFGSK  1560
QDLVLHKRNI CPVKGCGKKF LSHKYLVQHR RVHMDDRPLK CPWKGCKMTF KWAWARTEHI  1620
RVHTGARPYV CGEAGCGQTF RFVSDFSRHK RKTGHSAKKG RR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
114951501RLRKRVR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein