PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG76848.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family NF-X1
Protein Properties Length: 1671aa    MW: 178679 Da    PI: 8.2292
Description NF-X1 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG76848.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-NF-X1217.4e-0712711289119
    zf-NF-X1    1 CGkHkCqklCHeGpCppCp 19  
                  CGkH+C+++CH GpC+ C+
  GBG76848.1 1271 CGKHRCESICHAGPCGDCK 1289
                  ******************9 PP

2zf-NF-X117.21.1e-0513341351118
    zf-NF-X1    1 CGkHkCqklCHeGpCppC 18  
                  CG+H C++ CH+G CppC
  GBG76848.1 1334 CGEHNCKQVCHTGACPPC 1351
                  ****************** PP

3zf-NF-X126.51.4e-0814691486118
    zf-NF-X1    1 CGkHkCqklCHeGpCppC 18  
                  CG+H+CqklCH+G+CppC
  GBG76848.1 1469 CGRHTCQKLCHPGDCPPC 1486
                  ****************** PP

Sequence ? help Back to Top
Protein Sequence    Length: 1671 aa     Download sequence    
MSPARSVNSP MSGEKAVEVE RTRGCVGIPE TTPCGSSASG GGGRRSKPAS SSQSPVSPAK  60
SPKQQIWRQK RSPGSDGNST SAASSSNGLS AARLSSPPSS SVVDEDKSGA SPSAVTTPPS  120
RKGGRKKNNG PVRALDMDGS LDERAVAVPV TKDAQVQNKA SRSAPSKRAE DASDVRSNGM  180
IGVSKHRPSR GQENAPSMCS AGSIPAAGED GWLGGKEESH VASGGGDDGA LSSVRRSEMG  240
EKEDKAKLAW RARAPSSAEA CCKGSMDGRG DSAEGMLTPP SLKLAGSDGS GQRRTGIDTS  300
STSQDGGSGG NGGPLVNKIK AVARGHRRSV SSSSAADLRD FFAAPSPREW SSDERSRDSR  360
EKMGVAQQRQ SIARQQPLQL QQHNIMREPP LEEPKQKSVS RFHPINSSAL SDGGKNAMLP  420
NASATASVGA NGRGVGGGVG DADPCEGQKG DWRSGKGQHQ LLLQQRRGGS RHARSATWDG  480
QNLSPHLMEI AGGGSWRSNS PQLRPNWESG GREASALDSE GGRANLAGMT SRSLAAGFES  540
ARGSPRGSPR DSPLVSPLVS PLVSPRVSPR VSPRVSPHAS PRGSPRAPDN PRFLDGPRTR  600
DNPRDSALAQ GTQNEYRRQR GHFRNLSWTQ DSLDHGVNFL PQKPLQHERG GYQHRRMGWG  660
QESNWQREQG SYLKENRRSE SGTSTWRGDS TVTPSSSVSS FSSLSSDKAL KKEYGARGEC  720
FSAEQQQQQQ QEGEEEINKC GQTEEAMLAE VDGATATNNV RLEERRGRSP WRKLENSLRN  780
EAGLVDGETS EKRGEPPESE RQVPPWAAGK GGAPPAGESF ARLRRARARS DSFAWADGSW  840
NRGDDSSNWR QQSREDGGEE DSANRRSRSR NTSAERDEER ERRDGTRNYD KSSSGGGDKK  900
GYRGRSISRD ANHNERFERR QQLHAGSLAA NGIEREGEWW TAARKKPIVG SPSSPLRSPG  960
KNQKLSGQLL AFSRTVVPQL VQELEDKLVC GLVECVICCD SVTRRTPVWS CNSCYAIGHL  1020
SCIKKWARIP IPPLLSTCCL ARTAAVSEML EEGDYTPPRT LAKGMWFCAV CRAVQTVPAD  1080
ELSCRCFCGQ VEDPSTDSST AIAPHSCGRT CRKPLDRTKD GASAVHGNLL PSAAEVDSSG  1140
KRGYRCPHVC TLKCHPGPHA TCTALAPPVL CYCRKSEITC KCVDFGRQER SCGAICMREM  1200
PCGRHICPRV CHDGPCGACD ANVKVRCFCG KREEMWACGR VETRREELLL PAIGGRGLYS  1260
CQERCLKLLD CGKHRCESIC HAGPCGDCKL LPAVLRNCPC GKVQIEVLLG GAERKGCLDP  1320
VPTCGAECGK LLPCGEHNCK QVCHTGACPP CEMPVEQSCC CGSVVRSMPC FQFGPACGNG  1380
PAGTDIGRSG DSSQNRGKTI GVFTCDRRCG LLKNCGKHVC SMVCCVSVQQ QTQGTLSALS  1440
VPDVAADPVR VMRVDPHVCL IACSKKLQCG RHTCQKLCHP GDCPPCMDSI ATELKCACGK  1500
SVIPPPVPCG TPLPVCSFEC PKPQLCGCPP NHACHSGDCP PCTLASHCHQ SLERDEQPCI  1560
SENHAESVQT EAREESATRT VAETLAGQET PKPAQLAGPQ EVLEGAISEN RGVIVESSGG  1620
EHDTDVVQEQ QEQQDLTDAI HVTSNGLVTP VGLARRPSFK NALLGIDDNG N
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10170.11e-112NF-X1 family protein