PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_024963416.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae; Carduinae; Cynara; Cynara cardunculus; Cynara cardunculus subsp. cardunculus
Family C3H
Protein Properties Length: 809aa    MW: 90437.4 Da    PI: 8.5945
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_024963416.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH23.68.9e-08186206425
                     ---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   4 elCrffartGtCkyGdrCkFaH 25 
                     ++C++f++ G C++G +C+F H
  XP_024963416.1 186 RPCKDFSA-GNCRRGIQCRFLH 206
                     79******.************* PP

Sequence ? help Back to Top
Protein Sequence    Length: 809 aa     Download sequence    
MSGRRRRSES LWDRKEEEAT RTEISEKVPY SSFIGDGSPK RSNSEANDAG DISGQPSREP  60
MQGNQNISMD NDNSSGRHGM KMDPAFDEWE NQYSSRPSDN SYDLPFRSGE RGVGGTGNMS  120
KDRDRSRSPH RSRGRDRDRD RDRGSTRGLA RSRSRSRSRS RGRGRSPFSD ERRESYESGD  180
SRVSSRPCKD FSAGNCRRGI QCRFLHQELV DSRGGDLMEK YSGGLRFRDD KNSVSKGSDG  240
YRKDTGRKSG VDHTTSSGDD YHKGNRNAYD DQDHRRQSNQ TTRAPCRFFI MGKCNRTNCK  300
FSHDVPKSGG HEGRSHDNSR PWNDRQPQDQ VSASGFSDFP KSSHDFDDKN KSWSGPLWND  360
LESGGFDEMS RDNILDDKNR TWDPPGWNGS EKNSDILSQP RSVTNHMDMN QESQIIHDSS  420
QLHSQHVMPE TSGSQLTNAT TTVIPEVPRI PCFQQHQKQG EGSVSMMIDS RNMVSGQTNK  480
QIHSYGSDFV PSVHNMAFQY PSTLNGTEKS SDMLSLSSLN GPKLQSNGSV QGMFLHVDLQ  540
KQTDIQNKRG GKPLETSKND IPQLDATVAS NEQVSQVSSL PVSLPQISQN LNLANALELL  600
YSLPNTASSG APVDSMVVQQ NLDTKSKEHY ESSRDMVVNE SNISVPMGIL SNSMEQGNQV  660
SVEQVSSIDP NPSKLVAVGM PDGKQEPVGI LQVNENRKAD KNENTNPGKS EAQGKVEEGN  720
ISNDEKAMRQ FKIALVEFVK EILKPTWKEG KMSREVYKTI VKKVVDKVTS TIQGNQIPRT  780
QEKIDQYLTF SKSKITKLVE AYVGRFLKA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1131141RSRGRDRDRDR
2133143RSRGRDRDRDR
3151161RSRGRDRDRDR
4151159RSRSRSRSR
5153163RSRGRDRDRDR
6153161RSRSRSRSR
7155165RSRGRDRDRDR
8155163RSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.17e-14C3H family protein