GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:48:18 Sequence gi568815586r:114255535_114503898 : 248364 bp : 44.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 51 46 6 1.05 1.04 Term - 38699 38604 96 2 0 113 48 55 0.305 1.97 1.03 Intr - 39392 39361 32 0 2 98 78 21 0.164 -0.05 1.02 Intr - 51589 51539 51 2 0 55 87 71 0.522 2.58 1.01 Init - 52397 52322 76 2 1 85 36 113 0.479 7.05 1.00 Prom - 57985 57946 40 -3.76 2.00 Prom + 58524 58563 40 -5.56 2.01 Init + 60695 60802 108 1 0 92 70 39 0.662 2.72 2.02 Term + 63968 64177 210 1 0 66 52 138 0.597 5.29 2.03 PlyA + 66428 66433 6 1.05 3.00 Prom + 71511 71550 40 -3.36 3.01 Init + 84237 84393 157 2 1 75 78 61 0.627 3.87 3.02 Term + 91206 91318 113 1 2 66 44 96 0.443 1.72 3.03 PlyA + 91546 91551 6 1.05 4.09 PlyA - 92983 92978 6 1.05 4.08 Term - 100572 99998 575 1 2 124 39 434 0.967 36.72 4.07 Intr - 110857 110631 227 0 2 93 113 112 0.935 11.83 4.06 Intr - 130033 129942 92 1 2 119 97 68 0.895 9.59 4.05 Intr - 139359 139207 153 0 0 73 121 46 0.558 6.67 4.04 Intr - 143186 143039 148 1 1 85 84 158 0.999 15.34 4.03 Intr - 144098 143979 120 1 0 116 66 109 0.997 11.11 4.02 Intr - 146386 146292 95 1 2 132 100 42 0.982 8.56 4.01 Init - 148364 148218 147 2 0 110 110 212 0.632 25.69 4.00 Prom - 148770 148731 40 -10.45 5.00 Prom + 148864 148903 40 -10.25 5.01 Init + 150175 150501 327 0 0 92 94 135 0.655 10.59 5.02 Intr + 152260 152630 371 0 2 25 84 220 0.848 9.20 5.03 Intr + 153373 153473 101 1 2 -3 110 101 0.798 2.95 5.04 Intr + 158630 158735 106 0 1 114 0 154 0.777 8.47 5.05 Intr + 159285 159433 149 0 2 85 92 55 0.766 5.48 5.06 Intr + 162558 162687 130 1 1 83 77 54 0.798 3.55 5.07 Intr + 187727 187772 46 2 1 34 86 74 0.186 0.21 5.08 Intr + 188672 188761 90 1 0 70 43 76 0.239 1.49 5.09 Intr + 191842 192016 175 0 1 15 89 73 0.112 -0.39 5.10 Intr + 194575 194823 249 2 0 62 80 132 0.204 7.21 5.11 Term + 202720 202739 20 2 2 110 47 11 0.131 -2.32 5.12 PlyA + 204239 204244 6 1.05 6.06 PlyA - 211484 211479 6 1.05 6.05 Term - 224566 224480 87 1 0 120 45 45 0.496 1.06 6.04 Intr - 225017 224920 98 0 2 2 107 89 0.181 1.93 6.03 Intr - 225347 225244 104 1 2 19 67 106 0.144 1.42 6.02 Intr - 236515 236447 69 0 0 96 96 3 0.242 0.20 6.01 Init - 238876 238800 77 1 2 96 109 5 0.518 4.10 6.00 Prom - 239404 239365 40 -5.86 7.03 PlyA - 239515 239510 6 1.05 7.02 Term - 241984 241911 74 2 2 114 41 41 0.560 0.07 7.01 Intr - 245754 245613 142 0 1 99 28 129 0.897 7.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:114255535_114503898|GENSCAN_predicted_peptide_1|84_aa MAAKIQAKKELPGFKEMQEGKYVMGITPAPAGGILSVLNGIQSCPKKGMSSGEAVEECGY TSQTQWISGASQKQLSTGNHKRNQ >gi568815586r:114255535_114503898|GENSCAN_predicted_CDS_1|255_bp atggctgctaaaatccaagccaagaaagagcttcctggattcaaggaaatgcaagaaggc aagtatgtcatgggcatcacccctgccccagccggtggcatcctgtccgttctgaacggg atccaaagctgtcccaagaaaggcatgtcctcgggtgaggctgttgaagaatgtgggtat acttcacagactcagtggatttctggagcatctcaaaaacaactcagcactggaaatcac aagagaaatcaatga >gi568815586r:114255535_114503898|GENSCAN_predicted_peptide_2|105_aa MNQKTISGNSDDMCWQETGGPVKRSNCENLRMLFIKFSEAFTLHGVTTDFKAFSTLGSDS WMCRWQEEEGAASRTPASCLLAKKRGLLQRPELIPPASQKANTLG >gi568815586r:114255535_114503898|GENSCAN_predicted_CDS_2|318_bp atgaatcagaaaactatcagcgggaattccgatgacatgtgttggcaggaaacaggcggc ccagtcaaaaggagtaactgcgagaatttaaggatgctatttataaagttctccgaagcc ttcaccctccacggcgtcactaccgacttcaaagccttctccacccttggaagtgactcg tggatgtgtaggtggcaggaggaggagggagccgcgagcaggactccagcctcttgcctt ttggcaaagaagcgagggctgctgcagaggccagaattaattcctcctgctagccagaag gccaacacgttgggatga >gi568815586r:114255535_114503898|GENSCAN_predicted_peptide_3|89_aa MRLSPTIGHLQAEEQESQSESQNLKSREANSEANSVAFSLWPKAQEPLANHWCRGSLSSL EIALLQKNRFVVELKVGCCQHALSLLPYV >gi568815586r:114255535_114503898|GENSCAN_predicted_CDS_3|270_bp atgaggttaagtcccacaataggccatctgcaagctgaggaacaagaaagccagtctgag tcacaaaacctcaaaagcagggaagccaacagtgaagccaacagtgtagccttcagtctg tggccaaaggcccaagagcccctggcaaaccactggtgccgagggagcctctcatcttta gagatagccctgctgcagaaaaacagatttgtggtggagctcaaggtgggatgctgtcag catgcactgagccttctgccctatgtctga >gi568815586r:114255535_114503898|GENSCAN_predicted_peptide_4|518_aa MADADEGFGLAHTPLEPDAKDLPCDSKPESALGAPSKSPSSPQAAFTQQGMEGIKVFLHE RELWLKFHEVGTEMIITKAGRRMFPSYKVKVTGLNPKTKYILLMDIVPADDHRYKFADNK WSVTGKAEPAMPGRLYVHPDSPATGAHWMRQLVSFQKLKLTNNHLDPFGHIILNSMHKYQ PRLHIVKADENNGFGSKNTAFCTHVFPETAFIAVTSYQNHKITQLKIENNPFAKGFRGSD DMELHRMSRMQSKEYPVVPRSTVRQKVASNHSPFSSESRALSTSSNLGSQYQCENGVSGP SQDLLPPPNPYPLPQEHSQIYHCTKRKEEECSTTDHPYKKPYMETSPSEEDSFYRSSYPQ QQGLGASYRTESAQRQACMYASSAPPSEPVPSLEDISCNTWPSMPSYSSCTVTTVQPMDR LPYQHFSAHFTSGPLVPRLAGMANHGSPQLGEGMFQHQTSVAHQPVVRQCGPQTGLQSPG TLQPPEFLYSHGVPRTLSPHQYHSVHGVGMVPEWSDNS >gi568815586r:114255535_114503898|GENSCAN_predicted_CDS_4|1557_bp atggccgacgcagacgagggctttggcctggcgcacacgcctctggagcctgacgcaaaa gacctgccctgcgattcgaaacccgagagcgcgctcggggcccccagcaagtccccgtcg tccccgcaggccgccttcacccagcagggcatggagggaatcaaagtgtttctccatgaa agagaactgtggctaaaattccacgaagtgggcacggaaatgatcataaccaaggctgga aggcggatgtttcccagttacaaagtgaaggtgacgggccttaatcccaaaacgaagtac attcttctcatggacattgtacctgccgacgatcacagatacaaattcgcagataataaa tggtctgtgacgggcaaagctgagcccgccatgcctggccgcctgtacgtgcacccagac tcccccgccaccggggcgcattggatgaggcagctcgtctccttccagaaactcaagctc accaacaaccacctggacccatttgggcatattattctaaattccatgcacaaataccag cctagattacacatcgtgaaagcggatgaaaataatggatttggctcaaaaaatacagcg ttctgcactcacgtctttcctgagactgcgtttatagcagtgacttcctaccagaaccac aagatcacgcaattaaagattgagaataatccctttgccaaaggatttcggggcagtgat gacatggagctgcacagaatgtcaagaatgcaaagtaaagaatatcccgtggtccccagg agcaccgtgaggcaaaaagtggcctccaaccacagtcctttcagcagcgagtctcgagct ctctccacctcatccaatttggggtcccaataccagtgtgagaatggtgtttccggcccc tcccaggacctcctgcctccacccaacccatacccactgccccaggagcatagccaaatt taccattgtaccaagaggaaagaggaagaatgttccaccacagaccatccctataagaag ccctacatggagacatcacccagtgaagaagattccttctaccgctctagctatccacag cagcagggcctgggtgcctcctacaggacagagtcggcacagcggcaagcttgcatgtat gccagctctgcgccccccagcgagcctgtgcccagcctagaggacatcagctgcaacacg tggccaagcatgccttcctacagcagctgcaccgtcaccaccgtgcagcccatggacagg ctaccctaccagcacttctccgctcacttcacctcggggcccctggtccctcggctggct ggcatggccaaccatggctccccacagctgggagagggaatgttccagcaccagacctcc gtggcccaccagcctgtggtcaggcagtgtgggcctcagactggcctgcagtcccctggc acccttcagccccctgagttcctctactctcatggcgtgccaaggactctatcccctcat cagtaccactctgtgcacggagttggcatggtgccagagtggagcgacaatagctaa >gi568815586r:114255535_114503898|GENSCAN_predicted_peptide_5|587_aa MVAPGFMRGLLLTQNSGYCCLLGRTYRWSLRGDCPPPTHTSSAVRLLSLNTSQLASAKEH KIASKSTGNHIISVPRSSFTPPGRESWKPSEWSKSFLSAAFRIKVPGFESSRVKLTSNYP TSSLERKSQECASLESRAQVKTLKRTTKRDLFRPGLLPTSVKKSTYLPAQGWFVVSDKLN ITVYYQAYPSKIKRQPGDSDYLTSRCTYRTWRRHESRNRPARCHVWLGGSGGRLSCKTYR VDAPGGAERRARPAQHRAGERRAFRKGRAFKGFPAVPEYTWRLLPGVLERLRAQSAEGAP VKILGYPPAQATPMLADIRLNVWGSSQISQPALLTSNGSHRSPASWFLVPVAPPLPSPSF HPVVPSSPQGCLSYLPLRKNEASAHVLIVFTTLQSHEHTQKPKKLREDAKGSVCSLQRPS CDQRSIKERRGQEFGIKGNTPAERGERVREGRFSFDSQTRAQPETARHRGVCLSERLLAG PLLPRVNSRRSWTGVRARGNKQTNNQVLWVRITATDSNTKHDDDDDDDDDDNEANIYRPH VSGSVLSASHVLTHLLFATTLEGRQYNCPILQPMTAEVVKSDSSMPN >gi568815586r:114255535_114503898|GENSCAN_predicted_CDS_5|1764_bp atggtggctccggggtttatgcggggtttactgcttacccagaatagcggctactgctgc ctactagggcgcacctaccgctggagcctccgcggcgactgcccacctccaacacacacc tcctctgctgtgcgcttgctctccctaaatacttcccagttggcaagcgccaaagaacac aaaatagcctccaagagcaccggcaaccatataatctcagtgccccgctcctcctttaca cccccagggagggaaagttggaagcccagtgaatggtcgaagtcctttctgagcgcagct ttcaggattaaagttcccggattcgagtcctccagggtgaaactcacctccaactatccc acctcttctttggagcgcaaaagccaggagtgcgcctcgctggagagcagggctcaggtg aagactttgaaaaggaccaccaaaagagacctctttaggccaggtcttcttccaacgtct gtcaagaagagcacgtacctcccagctcaaggttggtttgtggtctccgacaaattaaat attactgtttattaccaggcataccccagtaaaataaagaggcaaccaggcgatagcgac tatctcaccagccgctgcacctataggacttggagacgtcacgagtcacgcaaccggccc gcgcgctgtcacgtttggctggggggcagcggcgggaggttgagttgcaaaacgtaccgc gtagacgcccctggtggcgccgagagaagagctaggcctgcccagcacagagccggagag cgtcgggccttccggaagggtagagcgtttaagggattcccagcagttcccgagtacacg tggaggctgctccccggcgtgctggagcggctgcgggctcagtctgcggagggcgccccg gtgaagatcctcggctaccccccagcccaggccacccccatgctagcagacatcaggctc aacgtttggggttcctcccagatttctcagcctgctctgctcactagcaacggctcccac aggagccccgcatcctggttcctggtccccgtggcaccaccactgccatcaccatccttc cacccagtggtcccatctagcccccaaggatgcctgagctacctgccactgaggaaaaat gaagcatcagctcatgtcctgatcgtgttcacaaccctccaaagccatgaacatactcag aagcccaagaagctcagagaagatgccaaggggtctgtctgcagcctgcagcgccccagc tgtgaccagagaagcattaaggagcggcgcgggcaagaatttggcatcaaaggaaatact ccagcggagcggggtgagagggtgagagaaggacgcttctcctttgattcgcagacccgg gcgcagccggagaccgcccggcaccgcggcgtctgtctgtctgagcgcctactcgcgggt cctctgctgccccgagtgaactctaggcgcagctggaccggggtgagggcgcgcggaaac aaacaaacaaacaaccaagttctttgggtcaggataacagccacagatagtaacactaaa catgatgatgatgatgatgatgatgatgatgataatgaggccaatatttatcgaccacac gtgtcaggttctgtgctaagtgcttcacatgtgctaactcatctgctctttgcaacaacc ctagaaggcaggcaatacaattgccccattttacaaccaatgactgctgaagtcgtgaag tcagactcctcaatgcctaattga >gi568815586r:114255535_114503898|GENSCAN_predicted_peptide_6|144_aa MVGRGQNRGLRGYQLLWEDFAASIPRGPCVGNSFQLDRAEMQFITPSRRRSFRSTQDFLL RVRGRIGDPQLISVACRILPSPEGRRSIAPIRFSKSYEEPTTATTKRVKRLRTLTKTFFP PKKERDQRKMFNVFGKRAPGALRT >gi568815586r:114255535_114503898|GENSCAN_predicted_CDS_6|435_bp atggtggggagaggacagaacagaggacttcgggggtaccagctcctctgggaagacttt gctgcctccattcccagaggaccttgtgttggcaacagtttccaactggacagggcggag atgcagttcataacaccctctagaaggcggagcttcaggagcacccaggatttcctgctc cgagtccggggtcgcatcggagacccccagctaatcagcgtcgcctgccggatcttgcct tctccagaaggcagacggtccatagctcccattagattttcaaagagctacgaggagccc accaccgccaccaccaaaagggtgaaaaggttaaggacccttacaaagacatttttcccc ccaaaaaaagagcgggatcaaagaaaaatgtttaacgttttcggcaagcgagcaccaggc gccctgcgcacttaa >gi568815586r:114255535_114503898|GENSCAN_predicted_peptide_7|71_aa CLPLIELQIGGAGDEFNLKTSPPCAAKGNGVSATSKIDTVAQIYGQGGIRFLAKSNSDLK TKRTPGNIIPA >gi568815586r:114255535_114503898|GENSCAN_predicted_CDS_7|216_bp tgtctgcccctgatcgagctgcagatcggaggggccggagacgagtttaatttaaagacg agccccccttgtgctgctaaaggaaatggcgtgtcagctacgtcaaaaatagatactgtt gcacagatatatggacagggaggaattcggtttttggccaagtctaattcagacctgaag accaaaagaactcctggaaacataattccagcttaa