PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMO96830
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family C2H2
Protein Properties Length: 1579aa    MW: 175294 Da    PI: 6.331
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMO96830genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.90.000714641489123
                EEET..TTTEEESSHHHHHHHHHH.T CS
   zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                ++C    C++sF +k++L+ H r+ +
  OMO96830 1464 HRCDleGCHMSFETKEELRLHKRNrC 1489
                789999****************9877 PP

2zf-C2H211.50.000915471573123
                EEET..TTTEEESSHHHHHHHHHH..T CS
   zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                y+C+   Cg sF+  s++ rH r+  H
  OMO96830 1547 YQCKveGCGLSFRFVSDFSRHRRKtgH 1573
                99*********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1579 aa     Download sequence    
MGNVEIPKWL KGMPLAPEFR PTDTEFADPI AYISKIEKEA GAYGICKIIP PMPKPSKKYV  60
FNNLNRSLSK CPELGRDVDV SKNVASVLSS GDSGDDEGEG RAVFTTRQQE LGQSGKKARG  120
TVSSQQCVVQ KQVWQSGEIY TLEQFESKSK TFAKGLLSVL KEVSPLHIEA LFWKAASEKP  180
IYVEYANDVP GSAFGEAEGQ FQYFHRRRRR KRMSYRRESS GCKKDEIDTV KNSHLDDSKD  240
ASVKRDPDTC LETPKSCTTP SILASDKNSH SKRKSGNASC DMEGTAGWKL SNSPWNLQVV  300
ARSAGSLTRF MPDDIPGVTS PMVYIGMLFS WFAWHVEDHE LHSMNFLHTG SSKTWYAVPG  360
DYAFAFEEVI RTEAYGGNMD RLAALSLLGE KTTLLSPELI VASGIPCCRL IQNPGEFVVT  420
FPRAYHVGFS HGFNCGEAAN FGTPQWLEVA KEAAVRRAAM NYLPMLSHQQ LLYLLTMSFV  480
SRVPRSLLPG ARSSRFRDRQ KEEREVLVKK AFIEDMLTEN KLLSLLLKRG SSYRGIVWHP  540
DLLPYTSKDS EFSSGTAIST TPQENVLDIH SENNTNQTSL LDEMNLYMEN VNYLYLNDDD  600
LSCDFQVDSG TLACVACGIL GYPFMSVLQP CKGEKVELSP VDHLSIQGPT ASELKNAHFY  660
PDLDNPVECS VSDNDHHAAD LSLPSKDAPS PSITKFPEGW DTSNKYLRPR IFCLEHAVQV  720
EELLQSKGGS KLLVICHSDY QKIKAHAIPV AEDIGVPFNY NDVPLDAASQ EDLNLINLAI  780
DDERDEIGED WTSKLGLNLR YCVKIRKNSP FKQVQHALPL SGVFSDKNGS SELFNIKWKS  840
RKSRSRGKLS HPSPSKPRES VELKVDEVLV EKRDGNITDN EKKLICYSRR KKRKPDYSTK  900
AGGGLELVKH DLTRDDYAAS CQLLDVHGGN TCEANATSES SRLYSTPGAS RGQFEIQTAS  960
IVGVVQEDQG QNLEDSNLYG GSYSLVDGVS SEKKGDVKLM DTTSENDKIS LADKCSRCCD  1020
STAYERLVGS TCAEREVCNP LSEGQCEKHA DSYDLMSPSN TASSHPAEPS AGRFDPELDD  1080
KAVEKSCVNG GVSSFMTYNN KMEQETDGHC KNSDEEILCD NSLMNKPHLG SEDCSSALYL  1140
GDEVQQETDA RSGRQVEPFF SSPMLTMRPN TNSVEKGSEV PRQPGTAAES CDGAMSNNQA  1200
EKLGVLNASK EDTLSVSIAP VAVDLPTTSS EAVDSAISKN PCAKEDMHTD VTLDVVGLQE  1260
IQDTKATCVD EEVMSRSELP IKGKQSSPTV METCSKGHRE SSDQEKSCHE ATADDDRHGK  1320
DLIRNEKNEE ESVSCFVTPI NETSPVPIQK YSRTRRESHA TGNMNNGGTG LSVENRELES  1380
AVVDCRSSAV NGRKRRREVE ETPEKVDGNG FIRSPCEGLR PRARKDATSS IDVGKTALEM  1440
LPEKDTKKTS IHTCSKKITK KGSHRCDLEG CHMSFETKEE LRLHKRNRCP YEGCGKKFRS  1500
HKYAILHQRV HEDDRPLKCP WKGCSMSFKW AWARTEHIRV HTGERPYQCK VEGCGLSFRF  1560
VSDFSRHRRK TGHYADSSA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1206212RRRRRKR
213931397RKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein