GENSCAN 1.0 Date run: 4-Aug-121 Time: 20:39:11 Sequence gi568815596r:183507438_183708119 : 200682 bp : 36.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 286 281 6 1.05 1.06 Term - 707 648 60 2 0 109 42 67 0.234 1.03 1.05 Intr - 2231 2130 102 2 0 73 92 62 0.186 4.55 1.04 Intr - 3414 3313 102 0 0 80 55 52 0.254 0.55 1.03 Intr - 4104 3989 116 1 2 74 97 98 0.852 8.55 1.02 Intr - 17107 17056 52 1 1 105 93 -18 0.063 -2.04 1.01 Init - 22096 22046 51 1 0 103 107 46 0.914 9.21 1.00 Prom - 26783 26744 40 -5.45 2.00 Prom + 34160 34199 40 -7.35 2.01 Init + 36851 37166 316 1 1 50 71 200 0.975 12.04 2.02 Intr + 38538 38771 234 1 0 47 22 206 0.658 6.64 2.03 Intr + 39162 39340 179 1 2 27 59 155 0.668 5.22 2.04 Term + 41826 42350 525 2 0 24 41 224 0.391 4.57 2.05 PlyA + 42458 42463 6 1.05 3.00 Prom + 43219 43258 40 -3.75 3.01 Init + 64578 64666 89 2 2 100 76 71 0.840 7.27 3.02 Term + 72320 72452 133 2 1 77 48 99 0.728 1.38 3.03 PlyA + 73436 73441 6 1.05 4.00 Prom + 77031 77070 40 -7.95 4.01 Sngl + 79791 80267 477 2 0 88 38 449 0.987 35.74 4.02 PlyA + 80663 80668 6 1.05 5.00 Prom + 81198 81237 40 -6.15 5.01 Sngl + 82464 83507 1044 2 0 37 41 367 0.620 23.78 5.02 PlyA + 83654 83659 6 1.05 6.03 PlyA - 84452 84447 6 1.05 6.02 Term - 100593 99998 596 1 2 6 53 549 0.998 36.90 6.01 Init - 100682 100613 70 2 1 92 51 86 0.411 6.56 6.00 Prom - 109687 109648 40 -4.65 7.04 PlyA - 109804 109799 6 1.05 7.03 Term - 114024 113759 266 1 2 21 43 247 0.479 8.19 7.02 Intr - 115324 115231 94 1 1 86 93 42 0.505 3.12 7.01 Init - 117469 117374 96 1 0 75 58 44 0.272 0.46 7.00 Prom - 129333 129294 40 -3.65 8.00 Prom + 152363 152402 40 -3.65 8.01 Init + 156228 156280 53 2 2 75 85 19 0.091 0.98 8.02 Intr + 183914 184010 97 2 1 100 66 31 0.764 1.19 8.03 Term + 184053 184271 219 2 0 123 48 156 0.957 11.16 8.04 PlyA + 185448 185453 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_1|160_aa MDHSRSIPEAKELAKGETHQQSIWSTFYVFPNLKGGQARVQVFENVSVRATKSDIPRSAL WSRRKTSVSAAVLKEISKEISKGPQKPPGYRLCPLQAVGGGEFGPTRLSSRSITIRGIMG QPVTRYFSHLLSCNWETLLQIDQGHQATDGLKNGTPNELN >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_1|483_bp atggaccactcaaggagcataccagaagcaaaggagcttgcaaagggggagactcatcaa caaagtatatggtcaacgttttatgtgttccccaatttgaaaggaggacaggcaagggtg caggtttttgagaatgtgtccgtaagggccactaaatctgacattcctcggtccgccttg tggtctaggaggaaaactagtgtttctgctgctgtgttgaaggaaataagcaaagaaatc tccaaaggaccacaaaaacccccaggctatcggttatgtccccttcaagctgtaggggga ggggaatttggcccaacccggctgtcctcaaggtccattaccatccgaggaatcatggga cagcctgtaaccaggtatttctcccacctcctcagttgtaattgggagactttgctacag atagatcaaggccatcaagctacagatggtcttaaaaatggaaccccaaatgagctcaac taa >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_2|417_aa MGRHLSAELDRHITQESSGWDLAGATLGRSFQGKEQAAIFAVLQPPLVIPRQTGSGVDLQ QTPTDLQQRGLLEEKLMNIKEEHQHQQKGCPHRNPIQRPPTSKTKGVKLRTLAVSVTALK AAYLELFVPPGGFVVSLASGVKLQTFKVCVTAHEGRVDPKSEQQQDLSQRVKEQSFHSVE GDPRAQLASPSGSCTGAAGGAACQYRAMHPHSSALGWSMGLGAVEQGAVLIEEAPAAQEP RGEIGNFSKVLGYKINVEKSQAFLYTNNRPTESQIMSELPFTIATKKIKFLGIQLTLDVK DLFKENYKPLLKEIRGHKQVEKHSMFMDKKNIMKMAIMPKGIYRFNAIPIKLPFTFFTEL EKTTLNFISNQKRACTAKTILSKRNKAGGITLLDFKLYYKATVTKTAWYWYGTKIDI >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_2|1254_bp atggggagacacctctcagcagagcttgacagacatatcacacaggagagctctggctgg gatctggcgggtgccactctgggacgaagcttccaggggaaggaacaggcagcaatcttt gctgttctgcagcctccactggtgatacctaggcaaacagggtctggagtggacctccag caaactccaacagacctgcagcagaggggcctgttagaagaaaaactaatgaacataaag gaagagcatcaacatcaacaaaaaggatgtccacacagaaaccccatccaaaggccacca acatcaaagaccaaaggagtgaagctgcggaccttggcggtgagtgttacagctcttaag gcagcgtatctggagttgttcgttcctcccggtgggtttgtggtctcgctggcttcagga gtgaagctgcagaccttcaaggtgtgtgttacagctcatgaaggcagggtggacccaaag agtgagcagcagcaagatttatcgcaaagagtgaaagaacaaagcttccacagtgtggaa ggggacccgagagcccagctggcttcacccagtggatcctgcactggggcagcaggtgga gctgcctgccagtaccgcgccatgcacccacactcctcagcccttgggtggtcgatggga ctgggcgccgtggagcagggggcggtgctcattgaggaggctccggcagcacaggagccc aggggggagataggcaacttcagcaaagtcttaggatacaaaatcaatgtggaaaaatca caagcattcttatacaccaataatagaccaacagagagccaaatcatgagtgaactccca ttcacaattgctacaaagaaaataaaattcctaggaatccaacttacactggatgtgaag gaccttttcaaggagaactacaaaccactgctcaaggaaataagaggacacaaacaagtg gaaaaacattccatgttcatggataagaagaatatcatgaaaatggccataatgcccaaa ggaatttatagattcaatgctatccccatcaagctaccattcactttcttcacggagtta gaaaaaactactttaaatttcatatcgaaccaaaaaagagcatgtacagcaaagacaatc ctaagcaaaaggaacaaagctggaggcatcacactacttgacttcaaattatactacaag gctacagtaaccaaaacagcatggtactggtatggtaccaaaatagatatatag >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_3|73_aa MAETGQCGAQAVASLGATSSFESFHVVLSLLVEIHPTADYLFWYHSIRQVIHVISSPFRT THSQLLSLCLMSR >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_3|222_bp atggctgaaacaggtcaatgtggagctcaggccgtggcttcattgggtgcaacctcaagc tttgaaagcttccatgtggtgttgagcctacttgtggaaattcaccccacagcagattat ctcttttggtatcacagcatccggcaggtgatccatgtaatcagtagccctttcaggaca actcacagtcagctcctgtctctgtgccttatgtctagataa >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_4|158_aa MGKKQSRKTGNSKKQSTCPPPKNRSSSPAMEQRWTENDFDELREEGFSRSNYELQEEIQT KDKEVKNFEKNLDECITRITNTEKCLKELTELKAKARELREECRSLRSQCDQLEERVSAM EDEVNEMKREGKFREKRIKRNEQSLQEIWDYVKNQIYV >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_4|477_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctgtcctcct ccaaagaatcgcagttcctcaccagcaatggaacaaaggtggacggagaatgactttgat gagttgagagaagaaggcttcagtcgatcaaactacgagctacaggaggaaatacaaacc aaagacaaagaagttaaaaactttgaaaaaaatttagacgaatgtataactagaataacc aatacagagaagtgcttaaaggagttgacggagctgaaagccaaggctcgagaactacgt gaagaatgcagaagcctcaggagccaatgcgatcaactggaagaaagagtatcagcgatg gaagatgaagtgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaaaga aatgaacaaagcctgcaagaaatatgggactatgtgaaaaaccaaatctatgtctga >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_5|347_aa MNIDAKILNKILANRIQQHIKNLIHHDQVGFIPGMQGWFNIRKSVNVTQHINRTKDKNHM IISIDAEKAFDKIQQLFMIKTLNKLGIDGTYLKIIRAIYDKPAANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMTVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DMKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQKRACIAKSVLAKRTKLEASRYLTLNYTTRLQ >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_5|1044_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaaccttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaat atacgcaaatcagtaaatgtaacccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaactcttcatgataaaa actctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctatgac aaacccgcagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcggacgacatgactgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gacatgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcctgc attgccaagtcagtcctagccaaaagaacaaagctggaggcatcacgctacctgacttta aactatactacaaggttacagtaa >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_6|221_aa MASEELQKDLEEVKVLLEKATRKTEKSKIETEIKNKLQQKSQKKAELLDNEKPAAVVAPI TTGYTVKISNYGWDQSDKFVKIYITLTGVHQVPTENVQVHFTERSFDLLVKNLNGKSYSM IVNNLLKPISVEGSSKKVKTDTVLILCRKKVENTRWDYLTQVEKECKEKEKPSYDTETDP SEGLMNVLKKIYEDGDDDMKQTINKAWVKSREKQAKGDTEF >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_6|666_bp atggcttcagaagagctacagaaagatctagaagaggtaaaggtgttgctggaaaaggct actaggaagactgaaaaatccaagattgagaccgaaatcaagaacaagttgcaacagaaa tcgcagaagaaagcagaacttcttgataatgaaaaaccagctgctgtggttgctcccatt acaacgggctatacggtgaaaatcagtaattatggatgggatcagtcagataagtttgtg aaaatctacattaccttaactggagttcatcaagttcccactgagaatgtgcaggtgcat ttcacagagaggtcatttgatcttttggtaaagaatttaaatgggaagagttactccatg attgtgaacaatctcttgaaacccatctctgtggaaggcagttcaaaaaaagtcaagact gatacagttcttatattgtgtagaaagaaagtggaaaacacaaggtgggattacctgacc caggttgaaaaagagtgcaaagaaaaagagaagccctcctatgacactgaaacagatcct agtgagggattgatgaatgttctaaagaaaatttatgaagatggagatgatgatatgaag caaaccattaataaagcctgggtgaaatcaagagagaagcaagccaaaggagacacggaa ttttga >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_7|151_aa MTAVFIKRGNLDIDTNMRMPCEDEGRDWGDASISAYRIAASAYSQLFSLTATSTIKDISQ IPTGHTASSSGSLIPMAPDGETPPSRGQQTPHKGELQLASGRYPFGTNLPEEGAGSNICC SAASTGDTKANNVWSGPAANASRPAEERPDC >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_7|456_bp atgactgctgtcttcataaaaagaggtaatttggacatagacactaacatgagaatgcca tgtgaagatgaaggcagagattggggtgatgcttctatttctgcttatagaattgctgct tctgcctactcccaactattctccctaactgctacttccactattaaagacattagtcag attccaacaggacacactgcctcctcaagtgggtccctgatccccatggctcctgatggg gagacacctcccagcaggggtcaacagacacctcataaaggagagctccagctggcatct ggcaggtacccctttgggacaaatcttccagaggaaggagcaggcagcaatatttgttgt tctgcggcctccactggtgataccaaggcaaacaacgtctggagtggacctgcagcaaat gccagcagacctgcagaagagaggcctgactgttag >gi568815596r:183507438_183708119|GENSCAN_predicted_peptide_8|122_aa MTGWQKCEREEQKMHLGRSQGTILEPGIGVKHLRNLPGILLYCGQAGIQTVEEPIPMATT TASTQGGLPGHHQCSLKAQVIFSQLVVNAARARTLLSGQWAPLWPRAGPDMLYKNLGLDS GI >gi568815596r:183507438_183708119|GENSCAN_predicted_CDS_8|369_bp atgactggatggcagaaatgtgagagggaggaacagaagatgcatttgggaaggtcccaa ggtactatattggagccagggatcggagtcaaacaccttagaaatctacctggcatccta ttgtactgtggccaagctggcattcaaaccgtggaggagcctatccccatggccaccacc actgccagcacacagggaggtctgccaggccaccaccaatgttcacttaaggcccaagtt atcttcagtcagcttgtggtgaatgctgccagggctaggactctcctttcaggacagtgg gctcccctctggcccagggcaggtccagatatgctgtacaagaacctaggcctggactca gggatctga