YALI0E33517g


highly similar to uniprot|P20967 Saccharomyces cerevisiae YIL125w KGD1 2-oxoglutarate dehydrogenase complex E1 component singleton

Genomic environment map

Element type: CDS
Element length: 3783 nucleotides,
on sense strand of
Yali0E: join(3975215..3975406,3976175..3978997).
Other names:
YALI-CDS0531.1
YALI-IPF4345
Coding sequence: 1005 codons.
Database cross references:
EMBL: CR382131
GeneID: 2912855
GenomeReviews: CR382131_GR
HOGENOM: HBG289950

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3R1547
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0E33517p  


highly similar to uniprot|P20967 Saccharomyces cerevisiae YIL125w KGD1 2-oxoglutarate dehydrogenase complex E1 component singleton; SubName: Full=YALI0E33517p;

Protein domain map

Protein length: 1004 amino acids
Protein family: GL3R1547
Database cross references:
InterPro: IPR001017
InterPro: IPR005475
InterPro: IPR011603
KEGG: yli:YALI0E33517g
PANTHER: PTHR23152
PIRSF: PIRSF000157
Pfam: PF00676
Pfam: PF02779
RefSeq: XP_504734.2
SMART: SM00861
TIGRFAMs: TIGR00239
UniProtKB/TrEMBL: Q6C3M8
UniProtKB: Q6C3M8_YARLI

Computed results for YALI0E33517p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>YALI0E33517g.nt
ATGCTCAGACACGCTTTATTGAAAAGGACCCCCGTCCTCCCGCGACTCACAAGTCGCAAG
GCCTTCGTGCCATTGATCCAGAAACGAAAGTACTCTGATGACGTCTTCCTTACCACCAAC
GCCGCAAACTACATCGATGAGATGTATGCCGCATGGAAGGACGACCCCAAGAGTGTCCAC
GTCTCTTGGCAGgtgagtgatagtggaggaatcggaagctgatggcttgtactacgactt
gtgatggttatttttggttgttttggtggacattcgacacttggcgtgtgaagtgatgga
gttgggcccaaggtcatggaggcaagtggcttatttgcaaccgaactgtatcacggatac
agtgagagtgtacagataagtcgttttgttgacacgacacctacgaatggttgacccctt
cctccttgctccagaatgcactagctagcacaattgagagacctgctgttgcatttttct
gcgcctgcctgtagcaatttgtatgtctctcgattggctcagtacctctgcatctctgca
cactagacctgcccttcgtttgcggttgccaccactagactcagggtgtcttcccgcatt
gggagttacgggttgagccggagcatggaaaaacgtgcatgcaggacgcagaattgttgg
tcggaacggactaacaggtgtatgcgccgagagagatcccggcgtgtgtgattagtaaag
gttctggacgtgtttttgacataccccacctgactattcctcaattgtccattcctgggc
tttttggattttagtttcattactttttgcacatgcttggtgtttatctctaactcctgc
cccagcaagtcccaactgcaccttaatttaggcctccctgacttaactaaatcccaacta
aacctacccagcaaatgtatcaaccagaagcacggccgcaattgacctatactaactcag
TCTTACTTCAAGAACCTGGATGGTGGCCTTCCTGCCGACAAGGCATTCTCTGCCCCTCCC
ACTATTGTGCCCTCTCCCTCTGGAGGTGTTCCCACCCCCGCTGCCCCCTCCGGCGCTCCT
TCTGACATCACCAACCACATGAAGGCCCAGCTGCTGGTTCGAGCCTACCAGGTCCGAGGT
CACACCAAGGCCAAGATTGATCCTCTTGGAATCTCCTTTGGCTCCGACAAGAACAAGAAG
CCCCCTAAGGAGCTGACCCTCGAGTTCTACGGATGGACCGACAAGGATCTCGACACCGAG
ATCACTCTCGGCCCCGGTATCCTCCCCCGATTCGTCGAGAACGGTAAGAACAAGCGAACT
CTCCGTGAGATTATCATGGACTGTGAACGAATCTACTGTGGCTCCTACGGTGTTGAGTAC
ATCCATATTCCCTCTCGAGAGGAGTGTGAGTGGATCCGAGACCGAGTCGAGACCCCCAAG
CCTTACAACTACACCCCTGACCAGAAGCGACGAATGCTCGACCGACTTATCTGGGCTAAC
CTCTTCGAGACCTTCCTTGCCTCCAAGTTCCCCAACGACAAGCGATTCGGTCTTGAGGGT
GCCGAGACTGTCGTTGTCGGTATGAAGACTCTGATCGACCGATCCGTCGATGCCGGAATC
GAGGACATTGTTATCGGTATGCCCCATCGAGGTCGACTCAACATGCTTTCCAACGTTGTG
CGAAAGCCCAACGAGTCCATTTTCGCTGAGTTCCAGGGATCTGCTGTCTTCGACGAGGGT
TCTGGAGATGTCAAGTACCATCTGGGTGCCAACTACCAGCGACCCACCCCCTCTGGAAAG
AAGGTCAACCTCTCTCTTGTCGCTAACCCCTCACATCTTGAGGCTGAGGACCCCGTTGTC
CTGGGTAAGACCCGAGCTATCCAGCACATGAAGCATGACGTCGGCACCTTCGACAAGGCC
ATGGGTGTGCTCATGCACGGTGACGCTGCCTTTGCCGGCCAGGGTGTTGTCTACGAGACC
ATGGGCATGCACTCTCTGCCTGCCTACTCTACCGGTGGAACCATCCACATCATCGTTAAC
AACCAGATTGGTTTCACCACCGATCCTCGATTCTCCCGATCTACCCCCTACCCCTCCGAT
CTGGCTAAATCCATCGATGCCCCCATCTTCCACGTCAACGCTGACGATATGGAGGCCGTC
GACTTCATCTTCAACCTGGCTGCTGACTGGCGAGCTACCTTCAAGTCCGATGTCATCATC
GATCTTGTCTGCTACCGAAAGTTCGGTCACAACGAGACCGATCAGCCCTCGTTCACTCAG
CCTCTCATGTACAAGAAGATTGCCGACAAGCCCAACCCTCTTGACATCTATGTCGACAAG
CTTCTCAAGGAGAAGACTTTTACCAAGGAGGACATTGAGGAGCACAAGCAGTGGGTCTGG
GGAATGCTCGAGGAGTCTTTCAAGAAGTCCAAGGACTACGTGCCCCATCAGAAGGAGTGG
CTCGCTTCTCCTTGGGACGACTTCAAGACCCCCAAGGAGCTTGCCACCGAGATCCTGCCC
CATCTCCCCACATCTGTTGAGGAGAAGAAGCTCAAGGAGATTGGAAAGGTCATCTCCTCT
GTTCCGGAGGGATTCACCCTTCACCGAAACCTCAAGCGAATCTTGTCCAACCGAGGCAAG
TCCGTTGAGGAGGGCCATGGCATTGACTGGTCCACTGGTGAGGCTCTTGCCTTCGGTACT
CTGCTTGAGGAGGGCCACCACGTCCGACTTTCCGGTCAGGATGTCGAGCGAGGTACCTTC
TCTCAGCGACACGCTGTTGTCCACGACCAGGTTAACGAGACCACATATGTTCCTCTGAAC
CACCTGACCAAGGATCAGGCCGACTTCACCGTCTCCAACTCCCATCTTTCCGAGTACGGT
GTCATGGGCTTTGAGTACGGTTACTCCCTGGCTTCTCCTGAGGCCCTTGTCATCTGGGAG
GCTCAGTTTGGTGACTTCGCCAACACTGCCCAGGTCATCATTGATCAGTTCATTGCCTCC
GCCGAGACCAAGTGGTCTCAGCGATCCGGCTTGGTTCTGTCTCTGCCCCACGGATACGAT
GGACAGGGTCCCGAGCATTCTTCCGGACGAATTGAGCGATACCTGCTGCTCGGAAACGAG
GATCCTCTCCACTTCCCCTCTCCCGATAAGCTTGAGCGACAGCACCAGGACTGCAACATC
CAGATTGCTTACCCCACTACCCCCGCCAACATCTTCCATCTGTACCGACGACAGATGCAC
CGAGCTTTCCGAAAGCCTCTGGCCTGCTTCTTCTCTAAGAACCTGCTGCGAAACCCCATG
GCCAAGTCCGACCTCTCTGAGTTTGTTGGTGAGTCTCACTTCCAGTGGGTCATTGAGGAC
GACCAGCATGGCAAGACCATCAACAACAAGGAGGGCATCGAGCGAGTTCTCTTCTGTTCC
GGCCAGGTCTGGACTGCTCTCTTCAAGCGACGAGAGGATCTTGCTGACAAGAAGACTGCT
ATCATCCGAATCGAGCAGCTGCACCCCTTCCCTTGGGAGCAGGTCCGAGAGCTTCTGGAC
TCTTACCCCAACCTTAAGGATATCTGCTGGGCTCAGGAGGAGCCTCTTAACGCTGGTGCC
TGGGTCCACATCCAGCCTCGAATGTACACCACCTTCCAGGCTACCAAGAACCACAAGCAT
GCCCACATTAGATACGCTGGCCGAAAGCCTTCTGCATCTGTTGCTGCCGGTACTAAGAAG
CTGCATCTTGCTGAGGAGGAGGCTCTTCTGAAGCAGGCTTTCCAGCAGGAGGATAAGGCC
TAA

Coding sequence    

>YALI0E33517g.cds
ATGCTCAGACACGCTTTATTGAAAAGGACCCCCGTCCTCCCGCGACTCACAAGTCGCAAG
GCCTTCGTGCCATTGATCCAGAAACGAAAGTACTCTGATGACGTCTTCCTTACCACCAAC
GCCGCAAACTACATCGATGAGATGTATGCCGCATGGAAGGACGACCCCAAGAGTGTCCAC
GTCTCTTGGCAGTCTTACTTCAAGAACCTGGATGGTGGCCTTCCTGCCGACAAGGCATTC
TCTGCCCCTCCCACTATTGTGCCCTCTCCCTCTGGAGGTGTTCCCACCCCCGCTGCCCCC
TCCGGCGCTCCTTCTGACATCACCAACCACATGAAGGCCCAGCTGCTGGTTCGAGCCTAC
CAGGTCCGAGGTCACACCAAGGCCAAGATTGATCCTCTTGGAATCTCCTTTGGCTCCGAC
AAGAACAAGAAGCCCCCTAAGGAGCTGACCCTCGAGTTCTACGGATGGACCGACAAGGAT
CTCGACACCGAGATCACTCTCGGCCCCGGTATCCTCCCCCGATTCGTCGAGAACGGTAAG
AACAAGCGAACTCTCCGTGAGATTATCATGGACTGTGAACGAATCTACTGTGGCTCCTAC
GGTGTTGAGTACATCCATATTCCCTCTCGAGAGGAGTGTGAGTGGATCCGAGACCGAGTC
GAGACCCCCAAGCCTTACAACTACACCCCTGACCAGAAGCGACGAATGCTCGACCGACTT
ATCTGGGCTAACCTCTTCGAGACCTTCCTTGCCTCCAAGTTCCCCAACGACAAGCGATTC
GGTCTTGAGGGTGCCGAGACTGTCGTTGTCGGTATGAAGACTCTGATCGACCGATCCGTC
GATGCCGGAATCGAGGACATTGTTATCGGTATGCCCCATCGAGGTCGACTCAACATGCTT
TCCAACGTTGTGCGAAAGCCCAACGAGTCCATTTTCGCTGAGTTCCAGGGATCTGCTGTC
TTCGACGAGGGTTCTGGAGATGTCAAGTACCATCTGGGTGCCAACTACCAGCGACCCACC
CCCTCTGGAAAGAAGGTCAACCTCTCTCTTGTCGCTAACCCCTCACATCTTGAGGCTGAG
GACCCCGTTGTCCTGGGTAAGACCCGAGCTATCCAGCACATGAAGCATGACGTCGGCACC
TTCGACAAGGCCATGGGTGTGCTCATGCACGGTGACGCTGCCTTTGCCGGCCAGGGTGTT
GTCTACGAGACCATGGGCATGCACTCTCTGCCTGCCTACTCTACCGGTGGAACCATCCAC
ATCATCGTTAACAACCAGATTGGTTTCACCACCGATCCTCGATTCTCCCGATCTACCCCC
TACCCCTCCGATCTGGCTAAATCCATCGATGCCCCCATCTTCCACGTCAACGCTGACGAT
ATGGAGGCCGTCGACTTCATCTTCAACCTGGCTGCTGACTGGCGAGCTACCTTCAAGTCC
GATGTCATCATCGATCTTGTCTGCTACCGAAAGTTCGGTCACAACGAGACCGATCAGCCC
TCGTTCACTCAGCCTCTCATGTACAAGAAGATTGCCGACAAGCCCAACCCTCTTGACATC
TATGTCGACAAGCTTCTCAAGGAGAAGACTTTTACCAAGGAGGACATTGAGGAGCACAAG
CAGTGGGTCTGGGGAATGCTCGAGGAGTCTTTCAAGAAGTCCAAGGACTACGTGCCCCAT
CAGAAGGAGTGGCTCGCTTCTCCTTGGGACGACTTCAAGACCCCCAAGGAGCTTGCCACC
GAGATCCTGCCCCATCTCCCCACATCTGTTGAGGAGAAGAAGCTCAAGGAGATTGGAAAG
GTCATCTCCTCTGTTCCGGAGGGATTCACCCTTCACCGAAACCTCAAGCGAATCTTGTCC
AACCGAGGCAAGTCCGTTGAGGAGGGCCATGGCATTGACTGGTCCACTGGTGAGGCTCTT
GCCTTCGGTACTCTGCTTGAGGAGGGCCACCACGTCCGACTTTCCGGTCAGGATGTCGAG
CGAGGTACCTTCTCTCAGCGACACGCTGTTGTCCACGACCAGGTTAACGAGACCACATAT
GTTCCTCTGAACCACCTGACCAAGGATCAGGCCGACTTCACCGTCTCCAACTCCCATCTT
TCCGAGTACGGTGTCATGGGCTTTGAGTACGGTTACTCCCTGGCTTCTCCTGAGGCCCTT
GTCATCTGGGAGGCTCAGTTTGGTGACTTCGCCAACACTGCCCAGGTCATCATTGATCAG
TTCATTGCCTCCGCCGAGACCAAGTGGTCTCAGCGATCCGGCTTGGTTCTGTCTCTGCCC
CACGGATACGATGGACAGGGTCCCGAGCATTCTTCCGGACGAATTGAGCGATACCTGCTG
CTCGGAAACGAGGATCCTCTCCACTTCCCCTCTCCCGATAAGCTTGAGCGACAGCACCAG
GACTGCAACATCCAGATTGCTTACCCCACTACCCCCGCCAACATCTTCCATCTGTACCGA
CGACAGATGCACCGAGCTTTCCGAAAGCCTCTGGCCTGCTTCTTCTCTAAGAACCTGCTG
CGAAACCCCATGGCCAAGTCCGACCTCTCTGAGTTTGTTGGTGAGTCTCACTTCCAGTGG
GTCATTGAGGACGACCAGCATGGCAAGACCATCAACAACAAGGAGGGCATCGAGCGAGTT
CTCTTCTGTTCCGGCCAGGTCTGGACTGCTCTCTTCAAGCGACGAGAGGATCTTGCTGAC
AAGAAGACTGCTATCATCCGAATCGAGCAGCTGCACCCCTTCCCTTGGGAGCAGGTCCGA
GAGCTTCTGGACTCTTACCCCAACCTTAAGGATATCTGCTGGGCTCAGGAGGAGCCTCTT
AACGCTGGTGCCTGGGTCCACATCCAGCCTCGAATGTACACCACCTTCCAGGCTACCAAG
AACCACAAGCATGCCCACATTAGATACGCTGGCCGAAAGCCTTCTGCATCTGTTGCTGCC
GGTACTAAGAAGCTGCATCTTGCTGAGGAGGAGGCTCTTCTGAAGCAGGCTTTCCAGCAG
GAGGATAAGGCCTAA

Predicted translation product    

>YALI0E33517g.aa
MLRHALLKRTPVLPRLTSRKAFVPLIQKRKYSDDVFLTTNAANYIDEMYAAWKDDPKSVH
VSWQSYFKNLDGGLPADKAFSAPPTIVPSPSGGVPTPAAPSGAPSDITNHMKAQLLVRAY
QVRGHTKAKIDPLGISFGSDKNKKPPKELTLEFYGWTDKDLDTEITLGPGILPRFVENGK
NKRTLREIIMDCERIYCGSYGVEYIHIPSREECEWIRDRVETPKPYNYTPDQKRRMLDRL
IWANLFETFLASKFPNDKRFGLEGAETVVVGMKTLIDRSVDAGIEDIVIGMPHRGRLNML
SNVVRKPNESIFAEFQGSAVFDEGSGDVKYHLGANYQRPTPSGKKVNLSLVANPSHLEAE
DPVVLGKTRAIQHMKHDVGTFDKAMGVLMHGDAAFAGQGVVYETMGMHSLPAYSTGGTIH
IIVNNQIGFTTDPRFSRSTPYPSDLAKSIDAPIFHVNADDMEAVDFIFNLAADWRATFKS
DVIIDLVCYRKFGHNETDQPSFTQPLMYKKIADKPNPLDIYVDKLLKEKTFTKEDIEEHK
QWVWGMLEESFKKSKDYVPHQKEWLASPWDDFKTPKELATEILPHLPTSVEEKKLKEIGK
VISSVPEGFTLHRNLKRILSNRGKSVEEGHGIDWSTGEALAFGTLLEEGHHVRLSGQDVE
RGTFSQRHAVVHDQVNETTYVPLNHLTKDQADFTVSNSHLSEYGVMGFEYGYSLASPEAL
VIWEAQFGDFANTAQVIIDQFIASAETKWSQRSGLVLSLPHGYDGQGPEHSSGRIERYLL
LGNEDPLHFPSPDKLERQHQDCNIQIAYPTTPANIFHLYRRQMHRAFRKPLACFFSKNLL
RNPMAKSDLSEFVGESHFQWVIEDDQHGKTINNKEGIERVLFCSGQVWTALFKRREDLAD
KKTAIIRIEQLHPFPWEQVRELLDSYPNLKDICWAQEEPLNAGAWVHIQPRMYTTFQATK
NHKHAHIRYAGRKPSASVAAGTKKLHLAEEEALLKQAFQQEDKA*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites