YALI0B22308g


uniprot|O74996 Yarrowia lipolytica Hexokinase

Genomic environment map

Element type: CDS
Element length: 2041 nucleotides,
on sense strand of
Yali0B: join(2934434..2934472,2934909..2936474).
Other names:
YALI-IPF40545
Coding sequence: 535 codons.
Database cross references:
EMBL: CR382128
GeneID: 2906865
GenomeReviews: CR382128_GR
HOGENOM: HBG522186

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0087
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0B22308p  


uniprot|O74996 Yarrowia lipolytica Hexokinase; SubName: Full=YALI0B22308p;

Protein domain map

Protein length: 534 amino acids
Protein family: GL3C0087
Database cross references:
HSSP: 1IG8
InterPro: IPR001312
InterPro: IPR019807
InterPro: IPR022672
InterPro: IPR022673
KEGG: yli:YALI0B22308g
PANTHER: PTHR19443
PRINTS: PR00475
PROSITE: PS00378
Pfam: PF00349
Pfam: PF03727
RefSeq: XP_501216.1
SMR: F2Z672
UniProtKB/TrEMBL: O74996
UniProtKB: F2Z672_YARLI

Computed results for YALI0B22308p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>YALI0B22308g.nt
ATGGTTCATCTTGGTCCCCGAAAACCCCCGTCCCGAAAGgtgagtatgattgcacattag
ggggatgagcgattagcgacatgcaaagacaacatggcatagcggtattacagggagctc
cacagatacacggttctggcgaaaacagatacacggctccgatggtgcaaggacgtgata
gtaaacagacgtccaagtcgtcgccttggaacatgacgagtcataccctcacacaaccac
cctgtcacgtccagcacgccgtcggacatgacttccgtctgtgttgattcacagccattg
tgagggatgtgattggtgctgcgacatggaaggaaaggtgactgtctaccgttgccaccg
gtttggttctgtacacgttcggtgtccaagtaatagccaccttgtcacatgctgttgcct
gcaggatgttgtcttcgctcgccagaaaaggcaactcgcagccatactaacacagGGCTC
AATGGCAGACGTCCCGCGGGACCTGCTGGAGCAAATCTCCCAGCTTGAAACCATCTTCAC
CGTTTCGCCCGAAAAGCTGCGTCAAATCACCGACCACTTTGTGTCCGAGCTCGCTAAAGG
CCTCACAAAGGAGGGTGGAGATATCCCCATGAACCCCACCTGGATTCTGGGATGGCCCAC
CGGAAAGGAGAGCGGCTGCTATCTGGCTCTCGACATGGGTGGCACCAACCTGCGAGTTGT
CAAGGTGACTCTGGACGGCGACCGAGGCTTCGACGTCATGCAGTCCAAGTACCACATGCC
CCCCAACATCAAGGTCGGCAAGCAAGAGGAGCTGTGGGAGTACATTGCCGAATGTCTGGG
CAAGTTCTTGGCCGACAATTATCCTGAGGCTCTTGATGCCCATGAGCGAGGACGAGATGT
CGACAGAACCGCTGCGCAGAGCTTCACTCGAGACAAGTCTCCTCCTCCCCACAACCAGCA
CATTTCGTGTTCTCCTGGCTTCGACATCCACAAGATTCCTCTCGGTTTCACCTTTTCATA
TCCCTGCTCTCAGCCCGCCGTCAACCGAGGTGTACTGCAGCGATGGACCAAGGGTTTCGA
CATTGAGGGAGTCGAGGGCGAGGACGTGGTCCCCATGCTGGAAGCTGCCCTCGAAAGAAA
GAACATTCCTATTTCCATCACCGCCCTGATCAACGACACCACCGGAACTATGGTGGCCTC
CAACTACCACGACCCCCAGATCAAGCTGGGTAACATCTTTGGTACTGGTGTCAACGCCGC
CTACTACGAGAAGGTCAAGGACATTCCCAAGCTCAAGGGTCTCATCCCCGACAGCATTGA
TCCCGAGACCCCCATGGCCGTCAATTGCGAGTATGGAGCCTTCGACAATGAGCACAAGGT
TCTCCCTAGAACCAAGTGGGACATCATCATCGATGAGGAGTCTCCCCGACCCGGTCAGCA
GACCTTCGAGAAGATGAGTGCTGGCTACTACCTGGGAGAATTGCTTCGTCTGGTTCTTCT
GGACCTGTACAAGGACGGGTTTGTGTTCGAGAACCAGGGCAAGAACGGTCAGGAGCTTGG
AAACGGCAACATCAACAAGTCGTATTTCTTCGACACCTCTTTCCTGTCTCTGATTGAGGA
GGATCCCTGGGAGAACTTGACTGATGTCGAGATTCTCTTCAAGGAGAAGCTTGGTATTAA
CACCACTGAGCCCGAGCGAAAGCTCATTCGTCGACTGGCCGAGCTCATTGGTACTCGATC
CGCTCGAATCTCTGCCTGTGGTGTCGCTGCCATCTGTAAGAAGGCTGGCTACAAGGAGGC
TCACGCTGGAGCTGACGGATCCGTGTTCAACAAGTACCCCGGATTCAAGGAGCGAGGCGC
CCAGGCTCTCAACGAGATTTTTGAGTGGAACCTGCCCAACCCTAAGGACCACCCCATCAA
AATCGTTCCCGCTGAGGATGGTAGCGGTGTTGGAGCTGCTCTGTGCGCTGCTCTCACCAT
CAAGCGAGTCAAGCAGGGTCTTCCCGTTGGTGTCAAGCCCGGTGTCAAGTACGATATTTA
G

Coding sequence    

>YALI0B22308g.cds
ATGGTTCATCTTGGTCCCCGAAAACCCCCGTCCCGAAAGGGCTCAATGGCAGACGTCCCG
CGGGACCTGCTGGAGCAAATCTCCCAGCTTGAAACCATCTTCACCGTTTCGCCCGAAAAG
CTGCGTCAAATCACCGACCACTTTGTGTCCGAGCTCGCTAAAGGCCTCACAAAGGAGGGT
GGAGATATCCCCATGAACCCCACCTGGATTCTGGGATGGCCCACCGGAAAGGAGAGCGGC
TGCTATCTGGCTCTCGACATGGGTGGCACCAACCTGCGAGTTGTCAAGGTGACTCTGGAC
GGCGACCGAGGCTTCGACGTCATGCAGTCCAAGTACCACATGCCCCCCAACATCAAGGTC
GGCAAGCAAGAGGAGCTGTGGGAGTACATTGCCGAATGTCTGGGCAAGTTCTTGGCCGAC
AATTATCCTGAGGCTCTTGATGCCCATGAGCGAGGACGAGATGTCGACAGAACCGCTGCG
CAGAGCTTCACTCGAGACAAGTCTCCTCCTCCCCACAACCAGCACATTTCGTGTTCTCCT
GGCTTCGACATCCACAAGATTCCTCTCGGTTTCACCTTTTCATATCCCTGCTCTCAGCCC
GCCGTCAACCGAGGTGTACTGCAGCGATGGACCAAGGGTTTCGACATTGAGGGAGTCGAG
GGCGAGGACGTGGTCCCCATGCTGGAAGCTGCCCTCGAAAGAAAGAACATTCCTATTTCC
ATCACCGCCCTGATCAACGACACCACCGGAACTATGGTGGCCTCCAACTACCACGACCCC
CAGATCAAGCTGGGTAACATCTTTGGTACTGGTGTCAACGCCGCCTACTACGAGAAGGTC
AAGGACATTCCCAAGCTCAAGGGTCTCATCCCCGACAGCATTGATCCCGAGACCCCCATG
GCCGTCAATTGCGAGTATGGAGCCTTCGACAATGAGCACAAGGTTCTCCCTAGAACCAAG
TGGGACATCATCATCGATGAGGAGTCTCCCCGACCCGGTCAGCAGACCTTCGAGAAGATG
AGTGCTGGCTACTACCTGGGAGAATTGCTTCGTCTGGTTCTTCTGGACCTGTACAAGGAC
GGGTTTGTGTTCGAGAACCAGGGCAAGAACGGTCAGGAGCTTGGAAACGGCAACATCAAC
AAGTCGTATTTCTTCGACACCTCTTTCCTGTCTCTGATTGAGGAGGATCCCTGGGAGAAC
TTGACTGATGTCGAGATTCTCTTCAAGGAGAAGCTTGGTATTAACACCACTGAGCCCGAG
CGAAAGCTCATTCGTCGACTGGCCGAGCTCATTGGTACTCGATCCGCTCGAATCTCTGCC
TGTGGTGTCGCTGCCATCTGTAAGAAGGCTGGCTACAAGGAGGCTCACGCTGGAGCTGAC
GGATCCGTGTTCAACAAGTACCCCGGATTCAAGGAGCGAGGCGCCCAGGCTCTCAACGAG
ATTTTTGAGTGGAACCTGCCCAACCCTAAGGACCACCCCATCAAAATCGTTCCCGCTGAG
GATGGTAGCGGTGTTGGAGCTGCTCTGTGCGCTGCTCTCACCATCAAGCGAGTCAAGCAG
GGTCTTCCCGTTGGTGTCAAGCCCGGTGTCAAGTACGATATTTAG

Predicted translation product    

>YALI0B22308g.aa
MVHLGPRKPPSRKGSMADVPRDLLEQISQLETIFTVSPEKLRQITDHFVSELAKGLTKEG
GDIPMNPTWILGWPTGKESGCYLALDMGGTNLRVVKVTLDGDRGFDVMQSKYHMPPNIKV
GKQEELWEYIAECLGKFLADNYPEALDAHERGRDVDRTAAQSFTRDKSPPPHNQHISCSP
GFDIHKIPLGFTFSYPCSQPAVNRGVLQRWTKGFDIEGVEGEDVVPMLEAALERKNIPIS
ITALINDTTGTMVASNYHDPQIKLGNIFGTGVNAAYYEKVKDIPKLKGLIPDSIDPETPM
AVNCEYGAFDNEHKVLPRTKWDIIIDEESPRPGQQTFEKMSAGYYLGELLRLVLLDLYKD
GFVFENQGKNGQELGNGNINKSYFFDTSFLSLIEEDPWENLTDVEILFKEKLGINTTEPE
RKLIRRLAELIGTRSARISACGVAAICKKAGYKEAHAGADGSVFNKYPGFKERGAQALNE
IFEWNLPNPKDHPIKIVPAEDGSGVGAALCAALTIKRVKQGLPVGVKPGVKYDI*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites