YALI0E33517g
highly similar to uniprot|P20967 Saccharomyces cerevisiae YIL125w KGD1 2-oxoglutarate dehydrogenase complex E1 component singleton
Element type: CDS
Element length: 3783 nucleotides,
on sense strand of
Yali0E: join(3975215..3975406,3976175..3978997).
Other names:
YALI-CDS0531.1
YALI-IPF4345
Coding sequence: 1005 codons.
Element length: 3783 nucleotides,
on sense strand of
Yali0E: join(3975215..3975406,3976175..3978997).
Other names:
YALI-CDS0531.1
YALI-IPF4345
Coding sequence: 1005 codons.
Database cross references:
EMBL: CR382131
GeneID: 2912855
GenomeReviews: CR382131_GR
HOGENOM: HBG289950
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR382131
GeneID: 2912855
GenomeReviews: CR382131_GR
HOGENOM: HBG289950
Homologs and Orthologs
Homologs in protein family: GL3R1547Orthologs: strict determination not possible; homologs must be refined manually
Protein YALI0E33517p 
highly similar to uniprot|P20967 Saccharomyces cerevisiae YIL125w KGD1 2-oxoglutarate dehydrogenase complex E1 component singleton; SubName: Full=YALI0E33517p;
Protein domain map
Database cross references:
InterPro: IPR001017
InterPro: IPR005475
InterPro: IPR011603
KEGG: yli:YALI0E33517g
PANTHER: PTHR23152
PIRSF: PIRSF000157
Pfam: PF00676
Pfam: PF02779
RefSeq: XP_504734.2
SMART: SM00861
TIGRFAMs: TIGR00239
UniProtKB/TrEMBL: Q6C3M8
UniProtKB: Q6C3M8_YARLI
InterPro: IPR001017
InterPro: IPR005475
InterPro: IPR011603
KEGG: yli:YALI0E33517g
PANTHER: PTHR23152
PIRSF: PIRSF000157
Pfam: PF00676
Pfam: PF02779
RefSeq: XP_504734.2
SMART: SM00861
TIGRFAMs: TIGR00239
UniProtKB/TrEMBL: Q6C3M8
UniProtKB: Q6C3M8_YARLI
Sequence data 
>YALI0E33517g.nt ATGCTCAGACACGCTTTATTGAAAAGGACCCCCGTCCTCCCGCGACTCACAAGTCGCAAG GCCTTCGTGCCATTGATCCAGAAACGAAAGTACTCTGATGACGTCTTCCTTACCACCAAC GCCGCAAACTACATCGATGAGATGTATGCCGCATGGAAGGACGACCCCAAGAGTGTCCAC GTCTCTTGGCAGgtgagtgatagtggaggaatcggaagctgatggcttgtactacgactt gtgatggttatttttggttgttttggtggacattcgacacttggcgtgtgaagtgatgga gttgggcccaaggtcatggaggcaagtggcttatttgcaaccgaactgtatcacggatac agtgagagtgtacagataagtcgttttgttgacacgacacctacgaatggttgacccctt cctccttgctccagaatgcactagctagcacaattgagagacctgctgttgcatttttct gcgcctgcctgtagcaatttgtatgtctctcgattggctcagtacctctgcatctctgca cactagacctgcccttcgtttgcggttgccaccactagactcagggtgtcttcccgcatt gggagttacgggttgagccggagcatggaaaaacgtgcatgcaggacgcagaattgttgg tcggaacggactaacaggtgtatgcgccgagagagatcccggcgtgtgtgattagtaaag gttctggacgtgtttttgacataccccacctgactattcctcaattgtccattcctgggc tttttggattttagtttcattactttttgcacatgcttggtgtttatctctaactcctgc cccagcaagtcccaactgcaccttaatttaggcctccctgacttaactaaatcccaacta aacctacccagcaaatgtatcaaccagaagcacggccgcaattgacctatactaactcag TCTTACTTCAAGAACCTGGATGGTGGCCTTCCTGCCGACAAGGCATTCTCTGCCCCTCCC ACTATTGTGCCCTCTCCCTCTGGAGGTGTTCCCACCCCCGCTGCCCCCTCCGGCGCTCCT TCTGACATCACCAACCACATGAAGGCCCAGCTGCTGGTTCGAGCCTACCAGGTCCGAGGT CACACCAAGGCCAAGATTGATCCTCTTGGAATCTCCTTTGGCTCCGACAAGAACAAGAAG CCCCCTAAGGAGCTGACCCTCGAGTTCTACGGATGGACCGACAAGGATCTCGACACCGAG ATCACTCTCGGCCCCGGTATCCTCCCCCGATTCGTCGAGAACGGTAAGAACAAGCGAACT CTCCGTGAGATTATCATGGACTGTGAACGAATCTACTGTGGCTCCTACGGTGTTGAGTAC ATCCATATTCCCTCTCGAGAGGAGTGTGAGTGGATCCGAGACCGAGTCGAGACCCCCAAG CCTTACAACTACACCCCTGACCAGAAGCGACGAATGCTCGACCGACTTATCTGGGCTAAC CTCTTCGAGACCTTCCTTGCCTCCAAGTTCCCCAACGACAAGCGATTCGGTCTTGAGGGT GCCGAGACTGTCGTTGTCGGTATGAAGACTCTGATCGACCGATCCGTCGATGCCGGAATC GAGGACATTGTTATCGGTATGCCCCATCGAGGTCGACTCAACATGCTTTCCAACGTTGTG CGAAAGCCCAACGAGTCCATTTTCGCTGAGTTCCAGGGATCTGCTGTCTTCGACGAGGGT TCTGGAGATGTCAAGTACCATCTGGGTGCCAACTACCAGCGACCCACCCCCTCTGGAAAG AAGGTCAACCTCTCTCTTGTCGCTAACCCCTCACATCTTGAGGCTGAGGACCCCGTTGTC CTGGGTAAGACCCGAGCTATCCAGCACATGAAGCATGACGTCGGCACCTTCGACAAGGCC ATGGGTGTGCTCATGCACGGTGACGCTGCCTTTGCCGGCCAGGGTGTTGTCTACGAGACC ATGGGCATGCACTCTCTGCCTGCCTACTCTACCGGTGGAACCATCCACATCATCGTTAAC AACCAGATTGGTTTCACCACCGATCCTCGATTCTCCCGATCTACCCCCTACCCCTCCGAT CTGGCTAAATCCATCGATGCCCCCATCTTCCACGTCAACGCTGACGATATGGAGGCCGTC GACTTCATCTTCAACCTGGCTGCTGACTGGCGAGCTACCTTCAAGTCCGATGTCATCATC GATCTTGTCTGCTACCGAAAGTTCGGTCACAACGAGACCGATCAGCCCTCGTTCACTCAG CCTCTCATGTACAAGAAGATTGCCGACAAGCCCAACCCTCTTGACATCTATGTCGACAAG CTTCTCAAGGAGAAGACTTTTACCAAGGAGGACATTGAGGAGCACAAGCAGTGGGTCTGG GGAATGCTCGAGGAGTCTTTCAAGAAGTCCAAGGACTACGTGCCCCATCAGAAGGAGTGG CTCGCTTCTCCTTGGGACGACTTCAAGACCCCCAAGGAGCTTGCCACCGAGATCCTGCCC CATCTCCCCACATCTGTTGAGGAGAAGAAGCTCAAGGAGATTGGAAAGGTCATCTCCTCT GTTCCGGAGGGATTCACCCTTCACCGAAACCTCAAGCGAATCTTGTCCAACCGAGGCAAG TCCGTTGAGGAGGGCCATGGCATTGACTGGTCCACTGGTGAGGCTCTTGCCTTCGGTACT CTGCTTGAGGAGGGCCACCACGTCCGACTTTCCGGTCAGGATGTCGAGCGAGGTACCTTC TCTCAGCGACACGCTGTTGTCCACGACCAGGTTAACGAGACCACATATGTTCCTCTGAAC CACCTGACCAAGGATCAGGCCGACTTCACCGTCTCCAACTCCCATCTTTCCGAGTACGGT GTCATGGGCTTTGAGTACGGTTACTCCCTGGCTTCTCCTGAGGCCCTTGTCATCTGGGAG GCTCAGTTTGGTGACTTCGCCAACACTGCCCAGGTCATCATTGATCAGTTCATTGCCTCC GCCGAGACCAAGTGGTCTCAGCGATCCGGCTTGGTTCTGTCTCTGCCCCACGGATACGAT GGACAGGGTCCCGAGCATTCTTCCGGACGAATTGAGCGATACCTGCTGCTCGGAAACGAG GATCCTCTCCACTTCCCCTCTCCCGATAAGCTTGAGCGACAGCACCAGGACTGCAACATC CAGATTGCTTACCCCACTACCCCCGCCAACATCTTCCATCTGTACCGACGACAGATGCAC CGAGCTTTCCGAAAGCCTCTGGCCTGCTTCTTCTCTAAGAACCTGCTGCGAAACCCCATG GCCAAGTCCGACCTCTCTGAGTTTGTTGGTGAGTCTCACTTCCAGTGGGTCATTGAGGAC GACCAGCATGGCAAGACCATCAACAACAAGGAGGGCATCGAGCGAGTTCTCTTCTGTTCC GGCCAGGTCTGGACTGCTCTCTTCAAGCGACGAGAGGATCTTGCTGACAAGAAGACTGCT ATCATCCGAATCGAGCAGCTGCACCCCTTCCCTTGGGAGCAGGTCCGAGAGCTTCTGGAC TCTTACCCCAACCTTAAGGATATCTGCTGGGCTCAGGAGGAGCCTCTTAACGCTGGTGCC TGGGTCCACATCCAGCCTCGAATGTACACCACCTTCCAGGCTACCAAGAACCACAAGCAT GCCCACATTAGATACGCTGGCCGAAAGCCTTCTGCATCTGTTGCTGCCGGTACTAAGAAG CTGCATCTTGCTGAGGAGGAGGCTCTTCTGAAGCAGGCTTTCCAGCAGGAGGATAAGGCC TAA
>YALI0E33517g.cds ATGCTCAGACACGCTTTATTGAAAAGGACCCCCGTCCTCCCGCGACTCACAAGTCGCAAG GCCTTCGTGCCATTGATCCAGAAACGAAAGTACTCTGATGACGTCTTCCTTACCACCAAC GCCGCAAACTACATCGATGAGATGTATGCCGCATGGAAGGACGACCCCAAGAGTGTCCAC GTCTCTTGGCAGTCTTACTTCAAGAACCTGGATGGTGGCCTTCCTGCCGACAAGGCATTC TCTGCCCCTCCCACTATTGTGCCCTCTCCCTCTGGAGGTGTTCCCACCCCCGCTGCCCCC TCCGGCGCTCCTTCTGACATCACCAACCACATGAAGGCCCAGCTGCTGGTTCGAGCCTAC CAGGTCCGAGGTCACACCAAGGCCAAGATTGATCCTCTTGGAATCTCCTTTGGCTCCGAC AAGAACAAGAAGCCCCCTAAGGAGCTGACCCTCGAGTTCTACGGATGGACCGACAAGGAT CTCGACACCGAGATCACTCTCGGCCCCGGTATCCTCCCCCGATTCGTCGAGAACGGTAAG AACAAGCGAACTCTCCGTGAGATTATCATGGACTGTGAACGAATCTACTGTGGCTCCTAC GGTGTTGAGTACATCCATATTCCCTCTCGAGAGGAGTGTGAGTGGATCCGAGACCGAGTC GAGACCCCCAAGCCTTACAACTACACCCCTGACCAGAAGCGACGAATGCTCGACCGACTT ATCTGGGCTAACCTCTTCGAGACCTTCCTTGCCTCCAAGTTCCCCAACGACAAGCGATTC GGTCTTGAGGGTGCCGAGACTGTCGTTGTCGGTATGAAGACTCTGATCGACCGATCCGTC GATGCCGGAATCGAGGACATTGTTATCGGTATGCCCCATCGAGGTCGACTCAACATGCTT TCCAACGTTGTGCGAAAGCCCAACGAGTCCATTTTCGCTGAGTTCCAGGGATCTGCTGTC TTCGACGAGGGTTCTGGAGATGTCAAGTACCATCTGGGTGCCAACTACCAGCGACCCACC CCCTCTGGAAAGAAGGTCAACCTCTCTCTTGTCGCTAACCCCTCACATCTTGAGGCTGAG GACCCCGTTGTCCTGGGTAAGACCCGAGCTATCCAGCACATGAAGCATGACGTCGGCACC TTCGACAAGGCCATGGGTGTGCTCATGCACGGTGACGCTGCCTTTGCCGGCCAGGGTGTT GTCTACGAGACCATGGGCATGCACTCTCTGCCTGCCTACTCTACCGGTGGAACCATCCAC ATCATCGTTAACAACCAGATTGGTTTCACCACCGATCCTCGATTCTCCCGATCTACCCCC TACCCCTCCGATCTGGCTAAATCCATCGATGCCCCCATCTTCCACGTCAACGCTGACGAT ATGGAGGCCGTCGACTTCATCTTCAACCTGGCTGCTGACTGGCGAGCTACCTTCAAGTCC GATGTCATCATCGATCTTGTCTGCTACCGAAAGTTCGGTCACAACGAGACCGATCAGCCC TCGTTCACTCAGCCTCTCATGTACAAGAAGATTGCCGACAAGCCCAACCCTCTTGACATC TATGTCGACAAGCTTCTCAAGGAGAAGACTTTTACCAAGGAGGACATTGAGGAGCACAAG CAGTGGGTCTGGGGAATGCTCGAGGAGTCTTTCAAGAAGTCCAAGGACTACGTGCCCCAT CAGAAGGAGTGGCTCGCTTCTCCTTGGGACGACTTCAAGACCCCCAAGGAGCTTGCCACC GAGATCCTGCCCCATCTCCCCACATCTGTTGAGGAGAAGAAGCTCAAGGAGATTGGAAAG GTCATCTCCTCTGTTCCGGAGGGATTCACCCTTCACCGAAACCTCAAGCGAATCTTGTCC AACCGAGGCAAGTCCGTTGAGGAGGGCCATGGCATTGACTGGTCCACTGGTGAGGCTCTT GCCTTCGGTACTCTGCTTGAGGAGGGCCACCACGTCCGACTTTCCGGTCAGGATGTCGAG CGAGGTACCTTCTCTCAGCGACACGCTGTTGTCCACGACCAGGTTAACGAGACCACATAT GTTCCTCTGAACCACCTGACCAAGGATCAGGCCGACTTCACCGTCTCCAACTCCCATCTT TCCGAGTACGGTGTCATGGGCTTTGAGTACGGTTACTCCCTGGCTTCTCCTGAGGCCCTT GTCATCTGGGAGGCTCAGTTTGGTGACTTCGCCAACACTGCCCAGGTCATCATTGATCAG TTCATTGCCTCCGCCGAGACCAAGTGGTCTCAGCGATCCGGCTTGGTTCTGTCTCTGCCC CACGGATACGATGGACAGGGTCCCGAGCATTCTTCCGGACGAATTGAGCGATACCTGCTG CTCGGAAACGAGGATCCTCTCCACTTCCCCTCTCCCGATAAGCTTGAGCGACAGCACCAG GACTGCAACATCCAGATTGCTTACCCCACTACCCCCGCCAACATCTTCCATCTGTACCGA CGACAGATGCACCGAGCTTTCCGAAAGCCTCTGGCCTGCTTCTTCTCTAAGAACCTGCTG CGAAACCCCATGGCCAAGTCCGACCTCTCTGAGTTTGTTGGTGAGTCTCACTTCCAGTGG GTCATTGAGGACGACCAGCATGGCAAGACCATCAACAACAAGGAGGGCATCGAGCGAGTT CTCTTCTGTTCCGGCCAGGTCTGGACTGCTCTCTTCAAGCGACGAGAGGATCTTGCTGAC AAGAAGACTGCTATCATCCGAATCGAGCAGCTGCACCCCTTCCCTTGGGAGCAGGTCCGA GAGCTTCTGGACTCTTACCCCAACCTTAAGGATATCTGCTGGGCTCAGGAGGAGCCTCTT AACGCTGGTGCCTGGGTCCACATCCAGCCTCGAATGTACACCACCTTCCAGGCTACCAAG AACCACAAGCATGCCCACATTAGATACGCTGGCCGAAAGCCTTCTGCATCTGTTGCTGCC GGTACTAAGAAGCTGCATCTTGCTGAGGAGGAGGCTCTTCTGAAGCAGGCTTTCCAGCAG GAGGATAAGGCCTAA
>YALI0E33517g.aa MLRHALLKRTPVLPRLTSRKAFVPLIQKRKYSDDVFLTTNAANYIDEMYAAWKDDPKSVH VSWQSYFKNLDGGLPADKAFSAPPTIVPSPSGGVPTPAAPSGAPSDITNHMKAQLLVRAY QVRGHTKAKIDPLGISFGSDKNKKPPKELTLEFYGWTDKDLDTEITLGPGILPRFVENGK NKRTLREIIMDCERIYCGSYGVEYIHIPSREECEWIRDRVETPKPYNYTPDQKRRMLDRL IWANLFETFLASKFPNDKRFGLEGAETVVVGMKTLIDRSVDAGIEDIVIGMPHRGRLNML SNVVRKPNESIFAEFQGSAVFDEGSGDVKYHLGANYQRPTPSGKKVNLSLVANPSHLEAE DPVVLGKTRAIQHMKHDVGTFDKAMGVLMHGDAAFAGQGVVYETMGMHSLPAYSTGGTIH IIVNNQIGFTTDPRFSRSTPYPSDLAKSIDAPIFHVNADDMEAVDFIFNLAADWRATFKS DVIIDLVCYRKFGHNETDQPSFTQPLMYKKIADKPNPLDIYVDKLLKEKTFTKEDIEEHK QWVWGMLEESFKKSKDYVPHQKEWLASPWDDFKTPKELATEILPHLPTSVEEKKLKEIGK VISSVPEGFTLHRNLKRILSNRGKSVEEGHGIDWSTGEALAFGTLLEEGHHVRLSGQDVE RGTFSQRHAVVHDQVNETTYVPLNHLTKDQADFTVSNSHLSEYGVMGFEYGYSLASPEAL VIWEAQFGDFANTAQVIIDQFIASAETKWSQRSGLVLSLPHGYDGQGPEHSSGRIERYLL LGNEDPLHFPSPDKLERQHQDCNIQIAYPTTPANIFHLYRRQMHRAFRKPLACFFSKNLL RNPMAKSDLSEFVGESHFQWVIEDDQHGKTINNKEGIERVLFCSGQVWTALFKRREDLAD KKTAIIRIEQLHPFPWEQVRELLDSYPNLKDICWAQEEPLNAGAWVHIQPRMYTTFQATK NHKHAHIRYAGRKPSASVAAGTKKLHLAEEEALLKQAFQQEDKA*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://192.168.122.177/elt/YALI/YALI0E33517g