PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022770015.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family C2H2
Protein Properties Length: 1658aa    MW: 187902 Da    PI: 9.0827
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022770015.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.80.0003715671589323
                      ET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                      Cp   Cgk F ++ +L++H r+H
  XP_022770015.1 1567 CPvkGCGKKFFSHKYLVQHRRVH 1589
                      9999*****************99 PP

2zf-C2H211.60.0008616251651123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      y+C    Cg++F+  s++ rH r+  H
  XP_022770015.1 1625 YVCAeeGCGQTFRFVSDFSRHKRKtgH 1651
                      89********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1658 aa     Download sequence    
MAASNLSPEP SQEVFSWLKS LPLAPEYRPS LAEFQDPIAY IFKIEKEASQ YGICKIIPPV  60
PPAPKRTAIG NLNRSLVARA AANASSDSKP APTFTTRQQQ IGFCPRKQRP VQKPVWQSGE  120
YYTFQEFEAK AKSFERNYLK RYSKKGSLSA LEVETLFWKA TVDKPFSVEY ANDMPGSAFV  180
PLDSKKSSVG GREAGEGVTV GETPWNMRAV SRAKGSLLRF MKEEIPGVTS PMVYIAMLFS  240
WFAWHVEDHD LHSLNYLHMG AAKTWYGVPR DAAVAFEEVV RVDGYAGEFN PLVTFSTLGE  300
KTTVMSPEVF ICAGVPCCRL VQNAGEFVVT FPRAYHSGFS HGFNFGEAAN IATPEWLRVA  360
RDAAIRRASI NYPPMVSHFQ LLYDLALELC SRVPMSIGAK PKSSRLKDKN KSEGETLVKK  420
LFVQNLIQNN DLLHILGKGS SVVLLPKCSS DSSLCSELRV ASQLRTNPRI SLGLCNYKEV  480
RESSETLASD EIMLGRSEEI KAVKGFYSVK GKFVTIYEGN RDSSFRGTDY TCRLPTQTLN  540
TNIERESAVQ ADALSDQRLF SCVTCGILCF DCIAVLQPTE QAARYLMSAD CSFFNDWTVG  600
SGLTHDGFTA AHGDAITSEQ NPCTRRINRS APNALYDVPV QSVEYKFQMV DQTNQVVEDT  660
KEGGDTSALG LLASAYGNSS DSEDHVEPNA TVFGDETSSA NTSPGRKFQY NGSGFSPGNA  720
NRSHTSSLSR LDSEEEAPIH VTDSCSEPGS RRVDIKYRSP QTVDRAVEFE TDNLASGRSN  780
SLEDKFRDPI TVSHANPSCS PATHGTEKMR CSKAIAPMEN ADIPYAPRSD EDSSRMHVFC  840
LEHAVEAEQQ LRQIGGVHVF LLCHPEYPKI EAEAKLVAEE LEIDYQWNDI LFGDATKDDE  900
ERIHSALDSE DALHGNGDWA VKLGINLFYS ANLSRSTLYS KQMPYNCVIY SAFGRKSPAS  960
SPTKLNRYGR RSSKQKKVVA GKWCGKVWMS NQVHPFLAQR DPDEQEQERC FHAWATSDEN  1020
LERKRENVQK AETTKVAKVF NRKRKMRAEI APSKKVTCIE PQGAVSDDSL DGSSLRQQQR  1080
FFRGKQPRLI EKEEAISYDS LEDDSLLLCR NLSRRKQAKF LEREAAESED EEEDFNAQQH  1140
WRNLRGKQGK YFEEDDVVSG DSLDESSLKQ YRRIRRSRQA KFMEVEDAVS DDEQEEISHQ  1200
LQRRIPRGRQ IKSFERNDAI SDYSRADNAL KQYRRISKGK HPKFFERDYA MSDDASDDDS  1260
QRHLRRIPRG KHMKRMERDY AFSDDSLEDN PQQQHRRRSK VAKYTDREVV VSFGPLKGNS  1320
HQQHRKVPRS KRTKYIERED AVLSDSPGDS SLQQLRRIAR SKHTKILKRE DAVSDDSLDD  1380
TSPKQLRKTP RSRQGKFIER ENAVSYDSLE ENYHQPNRNL RSRKKKAQTP RQIQQETPQN  1440
VKQGKRRTTK QVVSQQMKLE TPRNRNTKIE QSARQCNSYG EDEIEGGPST RLRKRVRKPL  1500
KESEAKPKEK KQASKKKVKN ASNVKTLAGH NTAKVKDEEA EYQCDMEGCT MSFGLKQELV  1560
LHKRNICPVK GCGKKFFSHK YLVQHRRVHI DDRPLKCPWK GCKMTFKWAW ARTEHIRVHT  1620
GARPYVCAEE GCGQTFRFVS DFSRHKRKTG HSTKKARG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110231045RKRENVQKAETTKVAKVFNRKRK
214911497RLRKRVR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein