PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_21865_BGI-A2_v1.0
Common NameF383_11176
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family C2H2
Protein Properties Length: 319aa    MW: 36355.6 Da    PI: 6.8048
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_21865_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.80.00036175199123
                                 EEETTTTEEESSHHHHHHHHHH..T CS
                     zf-C2H2   1 ykCpdCgksFsrksnLkrHirt..H 23 
                                 + C+ C+k ++  +++++H+++  H
  Cotton_A_21865_BGI-A2_v1.0 175 FYCELCNKQYKLAMEFEVHLSSydH 199
                                 79*****************987555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM004438.5E-1764109IPR000467G-patch domain
PROSITE profilePS5017416.47766111IPR000467G-patch domain
PfamPF015851.8E-1567107IPR000467G-patch domain
SuperFamilySSF576676.95E-5173205No hitNo description
PfamPF121712.2E-5175202IPR022755Zinc finger, double-stranded RNA binding
PROSITE patternPS000280177199IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 319 aa     Download sequence    Send to blast
MDYRRYNSSQ EANLGTKQQI SEEQAYQDSL VEELAEDFRL PINHKPTENV DLDNVQQATL  60
DTKLNSSNVG FRLLQKMGWK GKGLGKDEQG IIEPIRSGIR DPKLGIGKQE EDDFFTAEEN  120
IQRRKLDIEV EETEEHAKKR EKAAYSIYVC LLIGMVLAER EQKIQTEVKE IRKVFYCELC  180
NKQYKLAMEF EVHLSSYDHN HRKRFKEMRE MHGSSSRDDR QKREQQRQER EMAKFAQMAG  240
ARKQQQQESR EESGPATTPA PAPAPASAIA TALADQEQRK TLKFGFSSKS SSSKNASGSA  300
VKKPKVAVAS VFGNDSDDE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1136142AKKREKA
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017616735.10.0PREDICTED: G patch domain-containing protein 8
TrEMBLA0A0B0NFV10.0A0A0B0NFV1_GOSAR; G patch domain-containing 8
STRINGEOY117720.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1266756