YALI0F23793g


similar to uniprot|Q04458 Saccharomyces cerevisiae YMR110c

Genomic environment map

Element type: CDS
Element length: 1560 nucleotides,
on anti-sense strand of
Yali0F: complement(3119824..3121383).
Other names:
YALI-CDS2201.1
YALI-IPF1431
Coding sequence: 520 codons.
Database cross references:
EMBL: CR382132
GeneID: 2908746
GenomeReviews: CR382132_GR
HOGENOM: HBG752218

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0014
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0F23793p  


similar to uniprot|Q04458 Saccharomyces cerevisiae YMR110c; RecName: Full=Aldehyde dehydrogenase;

Protein domain map

Protein length: 519 amino acids
Protein family: GL3C0014
Database cross references:
Gene3D: G3DSA:3.40.309.10
Gene3D: G3DSA:3.40.605.10
InterPro: IPR012394
InterPro: IPR015590
InterPro: IPR016160
InterPro: IPR016161
InterPro: IPR016162
InterPro: IPR016163
KEGG: yli:YALI0F23793g
PANTHER: PTHR11699:SF15
PIRSF: PIRSF036492
PROSITE: PS00070
PROSITE: PS00687
Pfam: PF00171
RefSeq: XP_505802.1
UniProtKB/TrEMBL: Q6C0L0
UniProtKB: Q6C0L0_YARLI

Phylogeny  

PhylomeDB:YALI0F23793g

Computed results for YALI0F23793p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>YALI0F23793g.nt
ATGTCTACCTTTGATTGGGAATCCATTGTGCCTGCCACTCCTCTCGACCAGATTCCTGGC
GACATCCAGCGACTGCGAAAGGGCTTCCGATCCGGAAAGACCCTCGATCTCAACTACCGA
CTGGACCAGATTCGAAACTTGCACTACGTCCTCAGAGACAATGTCGAGGCCATCAAGGAC
GCCGTGTACAAGGATCTCGGCCGACCCAAGCACGAGACTGACCTGTGCGAGGTGGGTTTC
CTGTGGGGCGAGTTTAACAACGTGGTTGCCAACCTCAAGAAGTGGGCCGCCGACGAGGAC
GTCAAGACCAACCTGCAGTACTCCATCTCCTCCCCCAAGATCCGAAAGCGACCTCTTGGA
AACGTGCTCATCATCTCGCCCTGGAACTACCCCTTTATGCTGACCGTGTCTCCTCTCATT
GGAGCTCTGGCTGCCGGTAACACTGTGGCTGTCAAGTTCTCCGAAATGGCCCCCCACACT
TCCAAAATTGTTGGCGACTTGTGCACCAAGGCCCTCGACCCCGACGTCTTCCAGGCCATC
CAGGGAGGTGTCCCCGTCGTCACCAAGACCCTCGAGCAGAAGTTCGACAAGATTATGTAC
ACTGGTAACCACACTGTCGGTAAGATCATTGCCACTGCCGCCAACAAGTACCTGACACCC
GTCATCCTCGAGCTCGGAGGTAAGTCGCCCGTTTTTGTCACCAAGAACTGCAAGAACATC
AAGCTTGCCGCTAAGCGAGCCCTGTGGGGTAAGGTGGTAAACGCTGGCCAGACCTGTGTG
GCTCCCGACTACGTGATTGTCGAGCCCGAGGTGGAGCAGGAGTTTATCGACGCCTGCAAG
TACTGGATTAACGAGTTCTACAGTGGTAAGATTGACCAGTACAACCCCGACTTTGCCAAG
ATCGCCACCCCCAACCACTGGAACCGACTTACCTCCATGTTGAGCAAGTCCAAGGGAGAG
ATCATTACTGGAGGTAACACTGACGAGAAGACTCGATTCATCGCTCCTACTGTCGTCGCA
AAGGTCCCCGACAATGATTCCCTGATGGAGGACGAGATTTTCGGCCCTCTTCTGCCCATT
CTCACTGCCCGATCCGTCGAGGAGGGTATCAAGTACGTGCACGAGAACCACGACACCCCT
CTTGCCATGTACGTCTTCACTGACAAGGCCTCTGAGGGCGACTACATCCAGTCCCAGATC
AACTCTGGTGGCCTTATCTTCAATGACACTCTGATCCACGTTGGATGTGTCCAGGCTCCG
TTTGGTGGTGTCGGCATGTCCGGTTACGGTGCTTACCATGGCGAGGACTCCTTCCTGGCC
TTCACCCACCGACAAACCTACCTCAACCAGCCCAAGCTTCTGGAGCCTCTTCAGGACGTG
CGATACGCCCCCTACACCAAAACCAAGCGAAGCATGGTCAAGAACCTGCTGCTGGTCGGC
CCCATTTTCCCCCGAACCGGCTCCGTATACCCCAACGTGCTGATCCGAATCTTCCGAAAG
ATTTGGTTCTGGGTCCTTATTGTCGCCATCGGAGCTGCTGGTGCCAAGGCTCTGCTCTAG


Coding sequence    

>YALI0F23793g.cds
ATGTCTACCTTTGATTGGGAATCCATTGTGCCTGCCACTCCTCTCGACCAGATTCCTGGC
GACATCCAGCGACTGCGAAAGGGCTTCCGATCCGGAAAGACCCTCGATCTCAACTACCGA
CTGGACCAGATTCGAAACTTGCACTACGTCCTCAGAGACAATGTCGAGGCCATCAAGGAC
GCCGTGTACAAGGATCTCGGCCGACCCAAGCACGAGACTGACCTGTGCGAGGTGGGTTTC
CTGTGGGGCGAGTTTAACAACGTGGTTGCCAACCTCAAGAAGTGGGCCGCCGACGAGGAC
GTCAAGACCAACCTGCAGTACTCCATCTCCTCCCCCAAGATCCGAAAGCGACCTCTTGGA
AACGTGCTCATCATCTCGCCCTGGAACTACCCCTTTATGCTGACCGTGTCTCCTCTCATT
GGAGCTCTGGCTGCCGGTAACACTGTGGCTGTCAAGTTCTCCGAAATGGCCCCCCACACT
TCCAAAATTGTTGGCGACTTGTGCACCAAGGCCCTCGACCCCGACGTCTTCCAGGCCATC
CAGGGAGGTGTCCCCGTCGTCACCAAGACCCTCGAGCAGAAGTTCGACAAGATTATGTAC
ACTGGTAACCACACTGTCGGTAAGATCATTGCCACTGCCGCCAACAAGTACCTGACACCC
GTCATCCTCGAGCTCGGAGGTAAGTCGCCCGTTTTTGTCACCAAGAACTGCAAGAACATC
AAGCTTGCCGCTAAGCGAGCCCTGTGGGGTAAGGTGGTAAACGCTGGCCAGACCTGTGTG
GCTCCCGACTACGTGATTGTCGAGCCCGAGGTGGAGCAGGAGTTTATCGACGCCTGCAAG
TACTGGATTAACGAGTTCTACAGTGGTAAGATTGACCAGTACAACCCCGACTTTGCCAAG
ATCGCCACCCCCAACCACTGGAACCGACTTACCTCCATGTTGAGCAAGTCCAAGGGAGAG
ATCATTACTGGAGGTAACACTGACGAGAAGACTCGATTCATCGCTCCTACTGTCGTCGCA
AAGGTCCCCGACAATGATTCCCTGATGGAGGACGAGATTTTCGGCCCTCTTCTGCCCATT
CTCACTGCCCGATCCGTCGAGGAGGGTATCAAGTACGTGCACGAGAACCACGACACCCCT
CTTGCCATGTACGTCTTCACTGACAAGGCCTCTGAGGGCGACTACATCCAGTCCCAGATC
AACTCTGGTGGCCTTATCTTCAATGACACTCTGATCCACGTTGGATGTGTCCAGGCTCCG
TTTGGTGGTGTCGGCATGTCCGGTTACGGTGCTTACCATGGCGAGGACTCCTTCCTGGCC
TTCACCCACCGACAAACCTACCTCAACCAGCCCAAGCTTCTGGAGCCTCTTCAGGACGTG
CGATACGCCCCCTACACCAAAACCAAGCGAAGCATGGTCAAGAACCTGCTGCTGGTCGGC
CCCATTTTCCCCCGAACCGGCTCCGTATACCCCAACGTGCTGATCCGAATCTTCCGAAAG
ATTTGGTTCTGGGTCCTTATTGTCGCCATCGGAGCTGCTGGTGCCAAGGCTCTGCTCTAG


Predicted translation product    

>YALI0F23793g.aa
MSTFDWESIVPATPLDQIPGDIQRLRKGFRSGKTLDLNYRLDQIRNLHYVLRDNVEAIKD
AVYKDLGRPKHETDLCEVGFLWGEFNNVVANLKKWAADEDVKTNLQYSISSPKIRKRPLG
NVLIISPWNYPFMLTVSPLIGALAAGNTVAVKFSEMAPHTSKIVGDLCTKALDPDVFQAI
QGGVPVVTKTLEQKFDKIMYTGNHTVGKIIATAANKYLTPVILELGGKSPVFVTKNCKNI
KLAAKRALWGKVVNAGQTCVAPDYVIVEPEVEQEFIDACKYWINEFYSGKIDQYNPDFAK
IATPNHWNRLTSMLSKSKGEIITGGNTDEKTRFIAPTVVAKVPDNDSLMEDEIFGPLLPI
LTARSVEEGIKYVHENHDTPLAMYVFTDKASEGDYIQSQINSGGLIFNDTLIHVGCVQAP
FGGVGMSGYGAYHGEDSFLAFTHRQTYLNQPKLLEPLQDVRYAPYTKTKRSMVKNLLLVG
PIFPRTGSVYPNVLIRIFRKIWFWVLIVAIGAAGAKALL*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites