PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.5961s0001.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 311aa    MW: 35277.3 Da    PI: 9.7098
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.5961s0001.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix70.23.8e-2265145286
             trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkm.rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          Wt++e+ +Lie ++e++ +++rg lk ++Wee++ ++  + g+ rs++qC++k+e+++kr++ ++++++       s +p+++q+e
  Cagra.5961s0001.1.p  65 WTHDETFLLIESYKEKWYAIGRGPLKTNHWEEIAVAVsGRSGVDRSSTQCRHKIEKMRKRFRSERQSMGPI-----SIWPFYNQME 145
                          ***********************************998899****************************84.....68*******8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.5E-1965147No hitNo description
PROSITE profilePS500906.60965122IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 311 aa     Download sequence    Send to blast
MATPSPTSSP PSDSNPNSAA TPPHQTQQPS PPHTNPSSTP PPHITVVALA ASTSSHARKT  60
QPILWTHDET FLLIESYKEK WYAIGRGPLK TNHWEEIAVA VSGRSGVDRS STQCRHKIEK  120
MRKRFRSERQ SMGPISIWPF YNQMEELDSN PAPISARPLT RLPPNSSNHY QEDQEDQEEE  180
DHYEEEDEDD ERQSKSRSIN YILRRPGSVN RFAGVGGGLL SWGQRDRSSK RKRNKNDGDG  240
GERRRKGVRA VAAEIRAFAE RVMVMEKKKM EFAKETVKLR KEMEIRRIDL IQSSQAQLLQ  300
FLNNAFDSSF *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1230245RKRNKNDGDGGERRRK
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.5961s0001.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0286090.0AB028609.2 Arabidopsis thaliana genomic DNA, chromosome 3, TAC clone:K7P8.
GenBankAK1181570.0AK118157.1 Arabidopsis thaliana At3g24860 mRNA for unknown protein, complete cds, clone: RAFL19-47-G19.
GenBankBT0054910.0BT005491.1 Arabidopsis thaliana clone U50805 unknown protein (At3g24860) mRNA, complete cds.
GenBankCP0026860.0CP002686.1 Arabidopsis thaliana chromosome 3, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006298195.10.0trihelix transcription factor ASIL1
TrEMBLR0I4460.0R0I446_9BRAS; Uncharacterized protein
STRINGCagra.5961s0001.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM102812232
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G24860.11e-146Trihelix family protein