GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:30:36 Sequence gi568815595r:87159897_87376462 : 216566 bp : 35.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 4152 4147 6 1.05 1.04 Term - 30404 30187 218 0 2 47 48 176 0.812 5.82 1.03 Intr - 34725 34656 70 0 1 77 110 23 0.939 1.24 1.02 Intr - 36432 36205 228 0 0 74 37 104 0.467 1.04 1.01 Init - 41356 41288 69 1 0 48 116 87 0.502 8.60 1.00 Prom - 43746 43707 40 -6.85 2.00 Prom + 48521 48560 40 -7.25 2.01 Init + 59414 59493 80 1 2 80 49 107 0.749 6.58 2.02 Intr + 60587 60692 106 2 1 59 68 13 0.365 -4.30 2.03 Intr + 66745 66872 128 0 2 77 99 58 0.678 4.36 2.04 Intr + 67265 67439 175 2 1 66 33 206 0.253 11.92 2.05 Intr + 67909 68120 212 0 2 66 52 169 0.189 7.89 2.06 Intr + 73695 73777 83 0 2 88 103 30 0.549 2.86 2.07 Intr + 80803 80894 92 2 2 70 119 120 0.998 12.09 2.08 Intr + 85818 86012 195 2 0 111 106 137 0.957 16.49 2.09 Intr + 89979 90081 103 2 1 -4 80 103 0.772 -0.87 2.10 Intr + 93508 93614 107 2 2 82 86 218 0.995 20.01 2.11 Term + 93816 93926 111 2 0 52 38 176 0.998 6.58 2.12 PlyA + 94791 94796 6 1.05 3.07 PlyA - 95585 95580 6 1.05 3.06 Term - 100208 99998 211 1 1 51 44 218 0.943 9.48 3.05 Intr - 101437 101377 61 2 1 71 61 59 0.962 -1.63 3.04 Intr - 102339 102175 165 1 0 95 64 140 0.996 11.31 3.03 Intr - 104616 104392 225 1 0 99 77 197 0.980 16.73 3.02 Intr - 113600 113451 150 0 0 42 110 41 0.602 0.91 3.01 Init - 116566 116425 142 1 1 52 84 88 0.891 5.14 3.00 Prom - 116727 116688 40 -3.35 4.00 Prom + 125161 125200 40 -2.75 4.01 Sngl + 163372 164082 711 0 0 86 36 926 0.997 83.17 4.02 PlyA + 165021 165026 6 1.05 5.03 PlyA - 165149 165144 6 1.05 5.02 Term - 183456 181985 1472 1 2 43 36 534 0.108 33.50 5.01 Init - 184245 183579 667 0 1 49 -33 261 0.140 5.52 5.00 Prom - 184337 184298 40 -6.15 6.02 PlyA - 184506 184501 6 1.05 6.01 Sngl - 185714 185385 330 2 0 88 44 320 0.903 23.37 6.00 Prom - 210254 210215 40 -3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:87159897_87376462|GENSCAN_predicted_peptide_1|194_aa MMKTDENQKDWPQVPELFGGLEKVGIECLCFSRCTVQAVSESTILESGGRWPSFTTPPGP APVGTLHVGSSPTFPLCTALAEVFHEGSVPSADFCLYIQRSKVERLSQRSTGIGVLSSLQ TAVFVGYTRISCGLYEVGWVYSDGLRAYTLPEQSITSNINLLQQAGEKTKSKVVIVYISR ENLKKDFETNNEKM >gi568815595r:87159897_87376462|GENSCAN_predicted_CDS_1|585_bp atgatgaaaactgatgaaaatcaaaaagattggccacaagttccagagctgtttggaggt ctagagaaggttggcattgagtgtctgtgcttttccagatgcacagtgcaagctgtcagt gaatctaccattctggagtctggaggacggtggccctctttcacaactccaccaggccct gccccagtggggactctgcatgtgggctccagccccacatttcccttatgcactgcccta gcagaggttttccatgagggctctgtcccttcagcagacttctgtctgtatatccagaga tccaaagttgaaagactttcacaaagaagcacaggaattggagtcttatcttcattgcaa actgcagtatttgtaggttacacaaggatcagctgtggcttatatgaagtcggttgggtt tattctgatggtttacgagcttatactctgccagagcagtcaataacaagtaatataaac ttgctacagcaagctggagagaaaacaaaaagcaaagttgttattgtctacatttcaaga gaaaacctgaagaaagattttgaaacgaacaatgagaaaatgtga >gi568815595r:87159897_87376462|GENSCAN_predicted_peptide_2|463_aa MDNDSKGQHIRVNNVNSDLNKHMEDVRRTALERLTPMIQSPPTSSLPQHMGIMGATIQDE IWARTLNKKCSIKKLPFVLETPPTLDTGSPETHAGSSESGAGGGSSRLAPLPTPGAGGTG SRRREESGRGFKLRSAQAPHNAQAPPRSDFSKKCVSSRSPELRQDPERFEEASWPLSVNK LGNLMDFLLVSSASIVALQLGDILLVGIHTGNWQSRNGEGGDNRLQLGNEVGSGTIRICP VLLPVSQCPLFPLTLAVTELADVIKEQNRELRGTQRAIIRDRAALEKQEKQLELEIKKMA KIGNKEACKVLAKQLVHLRKQKTRTFAVSSKVTSMSTQTKVMNSQMKMAGAMSTTAKTMQ AVNKKMDPQKTLQTMQNFQKENMKMEMTEEMINDTLDDIFDGSDDEEESQDIVNQVLDEI GIEISGKMAKAPSAARSLPSASTSKATISDEEIERQLKALGVD >gi568815595r:87159897_87376462|GENSCAN_predicted_CDS_2|1392_bp atggacaatgatagcaaaggacagcatattcgtgtcaataatgtcaacagtgaccttaat aaacatatggaagatgttaggagaacagcactggaaagactcacccccatgattcaatca cctcccaccagctccctcccacaacacatgggaattatgggagctacaattcaagatgag atttgggcaagaaccttaaacaaaaagtgtagcattaaaaaacttcccttcgtcctcgag acccccccaactctagacactggatccccggagacccacgctgggtcgtcggagtctgga gccgggggcggcagctccagactcgccccgctgccaactcccggtgcaggaggaactggc agccgcagacgtgaggaaagcggccgcggcttcaaactccgtagtgcgcaggcgccacac aacgcgcaggcgccgcctagaagtgacttctccaaaaagtgtgttagttcccggtcacct gagctccggcaggatcctgagcgtttcgaggaggctagctggcccttgagtgtcaataaa cttggaaatctgatggatttcttgttggtttcctctgcttcgattgttgctcttcagttg ggcgatatactacttgtaggcatccatacaggaaactggcagtctcggaatggggaggga ggggataacaggctccagttaggaaatgaagttgggtctggcacaatcaggatttgtcca gtgctgttaccagtgtcacagtgcccactattcccactgacactagctgttacagagctt gcagatgtaataaaggaacagaatcgagagttacgaggtacacagagggctataatcaga gatcgagcagctttagagaaacaagaaaaacagctggaattagaaattaagaaaatggcc aagattggtaataaggaagcttgcaaagttttagccaaacaacttgtgcatctacggaaa cagaagacgagaacttttgctgtaagttcaaaagttacttctatgtctacacaaacaaaa gtgatgaattcccaaatgaagatggctggagcaatgtctactacagcaaaaacaatgcag gcagttaacaagaagatggatccacaaaagacattacaaacaatgcagaatttccagaag gaaaacatgaaaatggaaatgactgaagaaatgatcaatgatacacttgatgacatcttt gacggttctgatgacgaagaagaaagccaggatattgtgaatcaagttcttgatgaaatt ggaattgaaatttctggaaagatggccaaagctccatcagctgctcgaagcttaccatct gcctctacttcaaaggctacaatctcagatgaagagattgaacggcaactcaaggcttta ggagtagattag >gi568815595r:87159897_87376462|GENSCAN_predicted_peptide_3|317_aa MSCQAFTSADTFIPLNSDASATLPLIMHHSAAECLPVSNHATNVMSTVPSILSLIQTPKC LCTHFSVTTLGNTATGLHYSVPSCHYGNQPSTYGVMAGSLTPCLYKFPDHTLSHGFPPIH QPLLAEDPTAADFKQELRRKSKLVEEPIDMDSPEIRELEKFANEFKVRRIKLGYTQTNVG EALAAVHGSEFSQTTICRFENLQLSFKNACKLKAILSKWLEEAEQVGALYNEKVGANERK RKRRTTISIAAKDALERHFGEQNKPSSQEIMRMAEELNLEKEVVRVWFCNRRQREKRVKT SLNQSLFSISKEHLECR >gi568815595r:87159897_87376462|GENSCAN_predicted_CDS_3|954_bp atgagttgccaagcttttacttcggctgatacctttatacctctgaattctgacgcctct gcaactctgcctctgataatgcatcacagtgctgccgagtgtctaccagtctccaaccat gccaccaatgtgatgtctacagtcccatctattttgtctttgatccaaactcctaaatgt ttgtgcacacatttctcggtgacaacgttgggaaacacagcaacaggacttcattattct gttccttcctgtcattatggaaaccagccatcaacctatggagtgatggcaggtagttta accccttgtctttataaatttcctgaccacaccttgagtcatggatttcctcctatacac cagcctcttctggcagaggaccccacagctgctgatttcaagcaggaactcaggcggaaa agtaaattggtggaagagccaatagacatggattctccagaaatcagagaacttgaaaag tttgccaatgaatttaaagtgagacgaattaaattaggatacacccagacaaatgttggg gaggccctggcagctgtgcatggctctgaattcagtcaaacaacaatctgccgatttgaa aatctgcagctcagctttaaaaatgcatgcaaactgaaagcaatattatccaaatggctg gaggaagctgagcaagtaggagctttgtacaatgaaaaagtgggagcaaatgaaaggaaa agaaaacgaagaacaactataagcattgctgctaaagatgctctggagagacactttgga gaacagaataaaccttcttctcaagagatcatgaggatggctgaagaactgaatctggag aaagaagtagtaagagtttggttttgcaaccggaggcagagagaaaaacgggtgaaaaca agtctgaatcagagtttattttctatttctaaggaacatcttgagtgcagataa >gi568815595r:87159897_87376462|GENSCAN_predicted_peptide_4|236_aa MSIRVTQKFYKVFTSGPRAFSSNSYTSGPGVHISFSSFSRVGSSSFWDGLGRGCDGASSK GGITAVTVNQSLLSPLNLQVDPNIQAMCTQEKEQIKTLNNKFASFIDKVWLLEQQNKMLE TKWSLLQQQKMTLSNMHSMLQSYINNLQRQLETLGQENLKLGAELGSMQGLVEGFKNKYK DKINKRTEMENEFVLIKKVVDDAYMNKAELESHLEGLTDEINFLRQLYEEEIRELS >gi568815595r:87159897_87376462|GENSCAN_predicted_CDS_4|711_bp atgtccatcagggtgacccagaagttctacaaggtgttcacctctggcccccgggccttc agcagcaactcctacacaagtgggcctggtgtccacatcagtttctcaagcttctcccga gtgggcagcagcagcttctgggatggcctgggcagaggctgtgatggggccagcagcaag ggaggcatcacagctgtcacagtcaaccagagcctgctgagcccccttaacctgcaggtg gaccccaacatccaggccatgtgcacccaggagaaggagcagatcaagaccctcaacaac aagtttgcctccttcattgacaaggtatggttgctggagcagcagaacaagatgctggag acgaagtggagccttctgcagcagcagaagatgactctgagtaacatgcacagcatgtta cagagctacatcaacaaccttcagcggcagctggagactctgggtcaggagaacctgaag ctgggggcagagcttggcagcatgcaggggctagtggagggcttcaagaacaaatataag gataagatcaataagcgtacagagatggagaatgaatttgtcctcatcaagaaggttgta gatgacgcttacatgaacaaggcagagctggagtctcacctggaagggctgactgatgag atcaacttcctcaggcagctgtatgaagaggagatccgggagctgtcctag >gi568815595r:87159897_87376462|GENSCAN_predicted_peptide_5|712_aa MGDFNTPLSILDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKGKEIITNYLSDHSAIKLELRIKNLTQNHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFMALNAHKRKQERSKIDTLTSQLK EQEKQEQTYSKDSRRQEITKIRAELKEIETQKPSKKFMNPGAEIQTTIREYYEHLYANKL ENLEEMDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQR YKEELVPFLLKLFQSIEKEGILPNSFYEASVILIPKPGRDTTKKENFRPISLMNIDAKIL NKILANRIQQHIKKLIHHDQVGFIPAMQGWFNTRKSINVIQHINKTKDKNHMIISIDAEK AFHKIQQPFMLKTLNKLGIHGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPL SPLLFNIVLEVLARAIRQEKEINGIQLRKEEVKLSLFAGDMIVYLENPIVSAQNLLKLIS NFSKVSGYKINVQKSQAFLYTNNRQTESQIMSALPFTIASKRIQYLGIQLTKDVKELFKE NYKPLLEEIKEGTNRWKNTPCSWVGRINIMNMAILPKVIYRFNAISIKLPMTFFIELEKT TLKFIWNHKRDRIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQTEI >gi568815595r:87159897_87376462|GENSCAN_predicted_CDS_5|2139_bp atgggagattttaacaccccactgtcaatattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaagggaaaagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatggcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaacaagaaaagcaagagcaaacatattcaaaagatagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaccctccaaaaaattcatgaatcca ggagctgaaatacaaactaccatcagagaatactatgaacacctctacgcaaataaacta gaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaaccag gaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatcaat agcttaccaaccaaaaagagtccaggaccagatggcttcacagccgaattctaccagagg tacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagaggga atcctccctaactcattttatgaggccagcgtcatcctgataccaaagccgggcagagac acaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctc aataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaa gtgggcttcatccctgcgatgcaaggttggttcaatacacgcaaatcaataaatgtaatc cagcatataaacaaaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaag gcctttcacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattcat gggatgtatctcaaaataataagagctatctatgacaaacccacagccaatatcatactg aatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctc tcaccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggagaag gaaataaacggtattcaattaagaaaagaggaagtcaaattgtccctgtttgctggcgac atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgagtgcactcccattcacaattgcttca aagagaatacaatacctaggaatccaacttacaaaggatgtgaaggagctcttcaaggag aactacaaaccactgctcgaggaaataaaagagggtacaaacagatggaagaacactcca tgctcatgggtaggaagaatcaatatcatgaatatggccatactgcccaaggtaatttac agattcaatgccatctccatcaagctaccaatgacattcttcatagaattggaaaaaact actttaaagttcatatggaaccataaaagagaccggatcgccaagtcaattctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaacagagatatag >gi568815595r:87159897_87376462|GENSCAN_predicted_peptide_6|109_aa MGKKKSQKTGNSKKQSASPPPKERSSSPATEQSWMENDFHELREEDFRRSNYSELLEDIQ TKGKEVENFEKSLEECIIRITNTEKCLKELMELKTKARELREECRSLRS >gi568815595r:87159897_87376462|GENSCAN_predicted_CDS_6|330_bp atggggaaaaaaaagagccaaaaaactggaaactctaaaaagcagagtgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttcac gagctgagagaagaagacttcagacgatcaaattactccgagctactggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaagtttagaagaatgtataattagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagctga