PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028111007.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family C2H2
Protein Properties Length: 1161aa    MW: 130684 Da    PI: 7.9698
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028111007.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.30.00110701093223
                      EET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                      +Cp   Cgk Fs++ + + H+r+H
  XP_028111007.1 1070 QCPhkGCGKKFSSHKYAVIHQRVH 1093
                      69999*****************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1161 aa     Download sequence    
MRDVKIPNWL KGLPLAPEFR PTDTEFADPI XHISRIEKEA SAFGICKVIP PLPKPSKKYV  60
ISNLNKSFLK CPELGPNTNL AEVCSSSKTD SGNEANDREA RAVFTTRHQE LGQNARTIKK  120
VVRKQVWQSG QIYTPEQFES KSKTFARSQF GMVKDLSPLD IEALFWKAVS EKPINIEYAN  180
DVPGSGFGEP EGSFQYFYRQ TRRRKRTYNR NSGGSCDSKK YQVNNTSNGG ENRGVSTKNS  240
SNSFRETSRV SISLPTILSN QTSGCSRQKS SNAANEMEGT AGWKLSNSPW NLQVIARSPG  300
SLTRFMPDDI PGVTSPMAYI GMLFSWFAWH VEDHELHSLN FLHTGSPKTW YAVPGDYAFS  360
FEEVIRNQAY GGNIDHLASL TLLGEKTTLL SPEVVVKSGI PCCRLIQNPG EFVATFPRAY  420
HVGFSHGFNC GEAANFGTSK WLKVAKEAAV RRAAMNYLPM LSHQQLLYLL TMSFVSRVPR  480
SLMPGARKSR LRDCQKEERE LSVKKAFLED ILSENNLLTI LLGKNSSCHA VLWDLESLPS  540
PSKESERCIA TVVTTPREDV CSENKNIETM DDLYFNDADF SCDFEVDSRT LACVACGILG  600
FPFMCVVQPS ERVSMSLLQD VGVSKPLRCH SHPVLHDMIE GFISDGMKDK IEQNESNVLH  660
HQNGAFSSAK SIKSTLLRGS INFQKMKAHA AVIAEEIGSP FGYNEIPLDN ASQEGLNLID  720
LAIDDEEHDE CREDWTSKLS INLRHCVKVR KNSPSEQVHH ALALGGLFSD RCPSSNTFNL  780
KWPSRKSWSK QNSYHSVHMK PSESIQMEKD EVWGEESDGI MVRREGELIQ YSRRFKSKPG  840
DYATASKIFE HPAKHMHNEV SVTNSGGPDE SITLIAVEST ENFDIQRENQ ITEEISMKYV  900
ACADASEGQC NIQSDGDVLM KEAPDLVNTM NTDEVLSSLE NCSSHDKIEL ENTGFTMVCP  960
RSTAQNGRKR RIDVELQTED QLDFNCFIKS PCERLRPRAA KDAKISGIDS NSMFEDKPAV  1020
KSVRKSSNDF LSRKNKKKNE NVKGSHRCDL EGCLMKFQTK ADLLLHKRNQ CPHKGCGKKF  1080
SSHKYAVIHQ RVHDDDRPLK CLWKGCTMTF KWAWARTEHL RVHTGERPYQ CKVEGCGLTF  1140
RFVSDFSRHR WKTGHYVNVP T
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110241037RKSSNDFLSRKNKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein