PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG58949.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family NF-X1
Protein Properties Length: 2319aa    MW: 244812 Da    PI: 7.9132
Description NF-X1 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG58949.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-NF-X120.11.4e-0614021419118
    zf-NF-X1    1 CGkHkCqklCHeGpCppC 18  
                  CG+H+C + CHeG C++C
  GBG58949.1 1402 CGRHTCPRVCHEGACGVC 1419
                  ****************** PP

2zf-NF-X120.21.3e-0614661484119
    zf-NF-X1    1 CGkHkCqklCHeGpCppCp 19  
                  CG+H+C++ CH+GpC+ C+
  GBG58949.1 1466 CGNHRCERFCHPGPCGDCR 1484
                  ******************8 PP

3zf-NF-X119.22.7e-0615291546118
    zf-NF-X1    1 CGkHkCqklCHeGpCppC 18  
                  CGkH+C+++CH G CppC
  GBG58949.1 1529 CGKHRCEQTCHNGACPPC 1546
                  ****************** PP

4zf-NF-X119.22.7e-0616271644118
    zf-NF-X1    1 CGkHkCqklCHeGpCppC 18  
                  CG+H+Cq+lCH+G CppC
  GBG58949.1 1627 CGRHTCQELCHPGHCPPC 1644
                  ****************** PP

Sequence ? help Back to Top
Protein Sequence    Length: 2319 aa     Download sequence    
MPTSMIGLSI SEQSGGGDVA EAARQVCSSP AIRRGGGRGT RSRSDGGGTN SVLSPKPVVQ  60
QVWKEKRPSV DGGCNPGSVG TTGTEAPTPT SSTLTSPLTR GSGSGSGCAD LPLPGVSTFT  120
ATAVAVSATA VPVSRPLDFK LTDGARSEGG EAATGGGGGG GGRCGGAIFS SEASMTKSRE  180
CPNYERSVVS GISSNVVGAR NDGEKSASVS VVTSSGGSIS GKRDSGAPAE RGIAESVAIA  240
GAAAGGGNQG SEAGTADTTS VAAGMVALGG FSDSAVLSRP ASRSAAGAVS GNDAGGLTTD  300
GSQAASAGGM GVSVRAMEDR TQCPSNSASA VGAGKGFNGV GEDKKINGGE GGVTVPVKLS  360
VSVIKDPGGK YAWRARNSPT TPDNRGGGGG GVVGGGVMGE GVRREQDEPA PTPRRLSEVQ  420
SSGGGESEVI PSDRGPILPS PRKADGSDRG VGGGGWNGSG RVVGREEAIP APLFVPQPVP  480
GTSLVQVGNI WNKKTNVRGH RRSVSTSAVD LQVVLPRGPN ERAERHERGG IRRQFPPQPP  540
SLINGGFRDP LAQQQQNVPV GMQQGGVGGA GLNAPAAANG GPAPLNGAMG GGLGENGVNS  600
VSTNVRVQQQ RQQPGANARH VRSATWDGQN SRSGESSGGG SIRRVDPDHT HSEWRRGRNP  660
AEVDGQLMAM DRLEGIGGGG GIPLPLQAHL GTSIPSPLPQ QRGHYRRMSW SQESRHPQHQ  720
QQQLMMGMVP EHRDGHDATM HLRQDKERGM VMPAPEQDRQ SEQRDGSRQS GVVTQLLQQQ  780
QQQQQQTQLA LQQQQLLAHQ KHQQQLTHHH QQQQLAHQQQ QQLAHQQQQQ LAHQQQHQPQ  840
QWYGNSSGYG GREVGEKMGR TEDRGDDSGW EGHHQNLQAA GDDGLTGGGR AHAPSPALRG  900
REGGDAMGNR VSGGSMGGAD VVNNVERIHR RSRSSGGFAV EEQQMGVETG EAGTPSSKSR  960
SEYERMKRET GDRSRDGSGY DGPQDEDEKG SGDRINGRNK EKVKEGEAEV ERYQQQLQQV  1020
PPWVTRGGAP SVAAASAGAG RGNSTVQTRR GRARSDSFDW GGARHRSDMS SNWRSGGPRD  1080
GEESAGHHQF FHGDEKDWSS ERERVRDRGE ERLDRDEWLA MERVERRERE DPQIPRRVER  1140
EDKQQRWVGM GENSDRERDP PPPPPERRSG GGSIRRKEPS SLSAIPPVSP VVGGPVGASP  1200
RPSRLMFGRA VPQLVQELEE KLMKGQVECM ICCEIVGRSS SVWSCSSCYA IFHLSCTRKW  1260
ARVPVATASD LSAAGGQQQP GGSGEGTWRC PGCQTPQSIR ASELRYRCFC GQREDPPNDP  1320
YLTPHSCGDA CRKPLARPTS DGGYRCPHVC TMRCHPGPHA PCAAMAPTVF CYCRKSEITR  1380
KCADYEKHGR SCGSTCMRSL ACGRHTCPRV CHEGACGVCE VTVKTRCFCG KKEELQSCGR  1440
IEFKGEFPTS GGVFSCMERC SKILGCGNHR CERFCHPGPC GDCRLLPSLL LRCPCGKEEI  1500
KNLLGGMDRK GCLDPVPTCE GVCEKLLACG KHRCEQTCHN GACPPCEVPM EQKCRCVLAV  1560
RIVPCFQLDV GTVFTCDRKC GKLKNCRRHR CSAVCCAAPP VDSPGPTAPV PGDSHFCMIV  1620
CGKKLRCGRH TCQELCHPGH CPPCMDSVMT ELECACGRTS IPPPVPCGTP LPACPFPCSV  1680
PQPCGHPSTH LCHFGDCPPC TVAVAKECVG GHVVLRGVPC GSKDIRCNAV CGKLRRCGIH  1740
TCTRTCHRPP CDAVEIEGEG DHTSPGSRDG FSISRSSRRL PCGQTCGQPR RDCEHFCKAV  1800
CHPGEACPNS RCTAVVAIVC LCGRLKAQVQ CCAGGGDDDD GFDEAAFVSQ LGLKLQPVQQ  1860
SADGASPGYV PLGRRKIACD DECAKCEKKR ILANAFGVSM APGLEGEEGT AGGTESGAML  1920
MEMIRRDPIW VSAVELRLQF LLLGSRNALK MSTASATSGM RVHVFRYLPK ERRAAIHELA  1980
ARWLMESISV GWEPKRFIVV YTTAKSRAPL RGLISKPGSS VPAVGHAVAP LPNGEVDMDP  2040
RCVISFFDLP RDADISAGIL RFAGDCELVW LNDRNALSVF SNPIRATTAL RYLDHASAYW  2100
GAIAQQSQPS QSSSAGASTG MRVWGKGPSV QASSSSSSGQ PAHGSLAMRR NRSGSGASAA  2160
WHADAWSENE TQADLEGISG GSRADPLTQV WKQEQSGLAQ SNFWSALAEE GDGDATEENG  2220
EGSTAATPRT EGGDGQRGSH SRSDSGESSK KGASGLNAAG TVPSSPSGGG SPLSTSSVII  2280
RNVGDDNKWG AGGLALAGLG DGCGKGDDEE DWESVLDDH
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10170.10.0NF-X1 family protein