GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:52:07 Sequence gi568815584r:70630383_70909171 : 278789 bp : 44.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5883 5957 75 2 0 78 113 3 0.598 1.51 1.02 Intr + 11283 12153 871 2 1 117 99 657 0.733 60.61 1.03 Intr + 22843 23011 169 2 1 37 70 57 0.005 -2.20 1.04 Intr + 34988 35233 246 2 0 70 82 62 0.016 0.37 1.05 Intr + 36337 36518 182 1 2 33 30 183 0.322 6.51 1.06 Intr + 37182 37364 183 1 0 105 110 216 0.998 25.26 1.07 Term + 40694 40773 80 0 2 99 41 136 0.960 7.93 1.08 PlyA + 41623 41628 6 1.05 2.20 PlyA - 42309 42304 6 1.05 2.19 Term - 43366 43305 62 2 2 90 43 64 0.008 0.07 2.18 Intr - 54169 54098 72 2 0 112 64 31 0.011 2.48 2.17 Intr - 69269 69230 40 2 1 99 45 35 0.009 -1.90 2.16 Intr - 78953 78854 100 1 1 93 81 100 0.627 9.91 2.15 Intr - 88769 88718 52 0 1 120 53 8 0.072 -1.53 2.14 Intr - 88912 88829 84 2 0 67 97 80 0.226 6.59 2.13 Intr - 100482 100087 396 1 0 93 -13 205 0.007 5.45 2.12 Intr - 102960 102157 804 1 0 98 109 667 0.999 61.23 2.11 Intr - 103416 103375 42 1 0 83 98 76 0.967 6.31 2.10 Intr - 104116 104004 113 0 2 85 94 94 0.987 9.72 2.09 Intr - 108016 107863 154 2 1 35 110 56 0.961 1.63 2.08 Intr - 109782 109660 123 1 0 60 28 133 0.627 5.06 2.07 Intr - 112209 111969 241 0 1 96 65 483 0.951 43.92 2.06 Intr - 118622 118447 176 0 2 61 92 164 0.999 13.76 2.05 Intr - 119699 119551 149 1 2 97 115 140 0.999 17.48 2.04 Intr - 122266 122118 149 1 2 74 76 71 0.290 3.53 2.03 Intr - 122649 122556 94 2 1 61 110 -24 0.149 -2.93 2.02 Intr - 130800 130620 181 1 1 82 94 148 0.648 13.73 2.01 Init - 136213 136183 31 1 1 61 86 43 0.631 1.36 2.00 Prom - 136584 136545 40 -5.56 3.00 Prom + 144677 144716 40 -3.66 3.01 Init + 162682 162834 153 0 0 107 81 44 0.616 5.58 3.02 Term + 166346 166402 57 1 0 108 48 61 0.744 1.79 3.03 PlyA + 166681 166686 6 1.05 4.03 PlyA - 168413 168408 6 1.05 4.02 Term - 170698 170193 506 2 2 102 44 665 0.995 58.11 4.01 Init - 178789 178384 406 1 1 73 56 879 0.970 77.95 4.00 Prom - 190195 190156 40 -3.96 5.03 PlyA - 190852 190847 6 1.05 5.02 Term - 191794 191540 255 1 0 56 44 191 0.591 7.09 5.01 Init - 200736 200683 54 0 0 54 115 -22 0.092 -1.58 5.00 Prom - 201320 201281 40 -4.66 6.00 Prom + 202666 202705 40 -1.46 6.01 Init + 207284 207360 77 1 2 64 42 75 0.193 1.06 6.02 Intr + 212268 212393 126 0 0 85 93 20 0.250 1.99 6.03 Term + 218793 218895 103 0 1 33 55 130 0.304 1.95 6.04 PlyA + 220321 220326 6 1.05 7.04 PlyA - 221005 221000 6 1.05 7.03 Term - 240501 240369 133 2 1 119 44 70 0.090 3.26 7.02 Intr - 277533 276992 542 0 2 -5 74 420 0.665 23.00 7.01 Init - 277999 277547 453 1 0 38 54 207 0.532 6.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100482 99998 485 1 2 93 42 223 0.984 13.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:70630383_70909171|GENSCAN_predicted_peptide_1|601_aa GWTTKSKLIPIAYFKESEGTSWSGKTRRASQSGIEWERAGPSLSLILPSFGQAKLLPGQR PSPPPHSGPRSPRSASPGPASTGPRHPQVAEGGGRERAARSREAGEGPASEAAAAAAGAA SGSSGENGSAGRQTAAGCHGRHEACEARGRRPRLLNPGRRGGGGGGGQIAARTRRRGGGR MERKGSAAGAKGNPSPPAAGEGQRPPPPLCVPGGGGGAPARGQVGAAAEPAELIRRAHEF KSQGAQCYKDKKFREAIGKYHRALLELKGLLPPPGERERDSRPASPAGALKPGRLSEEQS KTVEAIEIDCYNSLAVCMKTEENIGKERGKECSKEGYGETREASEQKGEDKTGARGKSAI SLSVCKTGEVRSCISTSTGSDNMQVRKGTEGPKIPPWREREERGGGLVASAYQKDPLVHQ KIVVLGEQELPSGVSSLLLLLTWLSALQPVKPDSFLTYKIETVIVPTLEYSEDYISNTCK TMRPASVQGKHSTNIGHDGDDDDEDDDEEMMIASACLLQAELVNYERVKEYCLKVLKKEG ENFKALYRSGVAFYHLGDYDKALYYLKEARTQQPTDTNVIRYIQLTEMKLSRCSQREKEA M >gi568815584r:70630383_70909171|GENSCAN_predicted_CDS_1|1806_bp ggttggaccacaaagtctaagctaattccaattgcctattttaaagagagtgagggtaca agctggagtggcaagactaggcgcgcctcgcagagcgggattgaatgggaaagagcgggg ccgtctttgtcattaatcctcccctccttcgggcaagccaagcttcttccgggtcagcga ccatccccgccaccccactcggggccacggtccccccgcagcgctagtcccgggcccgcg agcaccggcccccgccacccgcaggttgcggagggaggagggcgggagcgggcggcgcgg agccgggaggcgggggaggggccggcgtccgaggcggcggcggcggcggctggagcagcc tctggcagcagcggggagaatgggagtgcggggcgccagaccgccgcggggtgtcacggc cgccacgaagcctgcgaggcgcggggccggcgcccgcggcttttaaacccgggaaggcgc ggcggcggcggcggcggcgggcagatcgcggcgcgcaccaggcgccggggcggcggccga atggagagaaagggctcggcggccggggccaaggggaacccgagcccgcccgcggccgga gaggggcagcggccaccgccgccgctgtgcgtcccgggcggcggcggaggagccccagcg aggggccaggtcggggcggcggccgagccggccgagctcatccgacgagcgcacgagttc aaaagccaaggggcgcagtgctacaaggacaagaaattccgtgaagccataggcaaatac caccgggcgttgctggagctgaaggggctgctgccgccccccggggaacgggagcgggac tcgcgcccggcctccccggctggggccctgaagcccggccgcctctcggaggagcagagc aagacggtggaagccatcgagatcgactgttacaacagcctggcagtatgcatgaagaca gaggaaaatatagggaaagagagagggaaagaatgcagcaaagaaggctatggggaaaca cgggaagcaagtgagcaaaaaggtgaggacaaaacaggggccaggggaaagagtgccatc tccctaagtgtctgcaagacaggagaggtgagaagctgtataagcaccagcacaggctcc gacaacatgcaggtgaggaaggggacagaagggcctaagatcccaccatggagagagagg gaggagaggggaggggggcttgtagcatcagcttatcagaaggaccccctggtgcaccag aagattgtagtcctgggtgaacaagagctgccctcgggggttagcagcctgttgcttctg ttgacttggctttctgccctgcagccagtgaaacctgacagcttcctcacctataaaatt gaaacagtaatagtacccacattggagtattctgaggattatatcagtaatacatgtaaa acaatgagaccagcatctgtacaaggtaagcactcaacaaatattggccatgatggtgat gatgatgatgaagatgatgatgaagagatgatgattgcctctgcctgcctgctccaggct gagctggtaaactatgaacgagtcaaggaatattgcctcaaagtcttgaagaaggaaggg gagaacttcaaggccctttaccggtctggtgtggccttctaccaccttggggactatgac aaagcactctactacctgaaagaagcaaggacccaacaaccaacagacaccaacgtgatt cggtatatccagctgacggagatgaaactcagccgatgctcccagagagaaaaagaagcc atgtaa >gi568815584r:70630383_70909171|GENSCAN_predicted_peptide_2|1020_aa MDINSIVLLRILILQKVENGDLSNKILKITDFGLAREWHRTTKMSAAGTYAWMAPEVIRA SMFSKGSDVWSILPTWAVVLTFLLAVEPLKWDRMQTNKETSQDGGGHLFQEQAMEDNSGF DLGIWLAMLRVPGGHLVEAPQHDHYLDKSHESYGVLLWELLTGEVPFRGIDGLAVAYGVA MNKLALPIPSTCPEPFAKLMEDCWNPDPHSRPSFTNILDQLTTIEESGFFEMPKDSFHCL QDNWKHEIQEMFDQLRAKEKELRTWEEELTRAALQQKNQEELLRRREQELAEREIDILER ELNIIIHQLCQEKPRVKKRKGKFRKSRLKLKDGNRISLPSDFQHKFTVQASPTMDKRKSL INSRSSPPASPTIIPRLRAIQLTPGESSKTWGRSSVVPKEEGEEEEKRAPKKKGRTWGPG TLGQKELASGDEGLKSLVDGYKQWSSSAPNLVKGPRSSPALPGFTSLMEMALLAASWVVP IDIEEDEDSEGPGSGESRLQHSPSQSYLCIPFPRGEDGDGPSSDGIHEEPTPVNSATSTP QLTPTNSLKRGGAHHRRCEVALLGCGAVLAATGLGFDLLEAGKCQLLPLEEPEPPAREEK KRREGLFQRSSRPRRSTSPPSRKLFKKEEPMLLLGDPSASLTLLSLSSISECNSTRSLLR SDSDEIVVYEMPVSPVEAPPLSPCTHNPLVNVRVERFKRDPNQSLTPTHVTLTTPSQPSS HRRTPSDGALKPETLLASRSPSSNGLSPSPGAGMLKTPSPSRDPGEFPRLPDPNVVFPPT PRRWNTQQDSTLERPKTLEFLPRPRPSANRQRLDPWWFVSPSHARSTSPANSSSTETPSN LDSCFASSSSTVEERPGLPALLPFQAGPLPPTERTLLDLDAEGQSVPEATGFKVVPCPKQ YNEVIPEALLMEVESWCWNGEHLQTASRIQPLLNAIVAASFVWGNMRALLSHTREIKDAD AQEERSLGKACDHSHTCEYKACHKLKSHSQGWQIQVHTSSGNNYSESFLTHKDQGKTLAV >gi568815584r:70630383_70909171|GENSCAN_predicted_CDS_2|3063_bp atggacataaacagcatcgtgctgctacgtatattgatcctccagaaggtggagaatgga gacctgagcaacaagattctgaagatcactgattttggcctggctcgggaatggcaccga accaccaagatgagtgcggcagggacgtatgcttggatggcacccgaagtcatccgggcc tccatgttttccaaaggcagtgatgtgtggagcattctgcctacttgggcagtagttctc acattccttttagctgtagaacccctgaagtgggatagaatgcagacaaacaaagaaaca agccaggatggaggtgggcatttattccaagagcaagcaatggaggataacagtggcttt gacctaggaatctggttggcaatgctgagagttcctggaggtcacttggtagaagctccc cagcatgaccactacctggataagtcacatgaaagctatggggtgctactttgggagttg ctgactggtgaggtgccctttcgaggcattgatggcttagcagtcgcttatggagtggcc atgaacaaactcgcccttcctattccttctacgtgcccagaaccttttgccaaactcatg gaagactgctggaatcctgatccccactcacgaccatctttcacgaatatcctggaccag ctaaccaccatagaggagtctggtttctttgaaatgcccaaggactccttccactgcctg caggacaactggaaacacgagattcaggagatgtttgaccaactcagggccaaagaaaag gaacttcgcacctgggaggaggagctgacgcgggctgcactgcagcagaagaaccaggag gaactgctgcggcgtcgggagcaggagctggccgagcgggagattgacatcctggaacgg gagctcaacatcatcatccaccagctgtgccaggagaagccccgggtgaagaaacgcaag ggcaagttcaggaagagccggctgaagctcaaggatggcaaccgcatcagcctcccttct gatttccagcacaagttcacggtgcaggcctcccctaccatggataaaaggaagagtctt atcaacagccgctccagtcctcctgcaagccccaccatcattcctcgccttcgagccatc cagttgacaccaggtgaaagcagcaaaacctggggcaggagctcagtcgtcccaaaggag gaaggggaggaggaggagaagagggccccaaagaagaagggacggacgtgggggccaggg acgcttggtcagaaggagcttgcctcgggagatgaaggcctcaagtccctggtagatgga tataagcagtggtcgtccagtgcccccaacctggtgaagggcccaaggagtagcccggcc ctgccagggttcaccagccttatggagatggccttgctggcagccagttgggtggtgccc atcgacattgaagaggatgaggacagtgaaggcccagggagtggagagagtcgcctacag cattcacccagccagtcctacctctgtatcccattccctcgtggagaggatggcgatggc ccctccagtgatggaatccatgaggagcccaccccagtcaactcggccacgagtacccct cagctgacgccaaccaacagcctcaagcggggcggtgcccaccaccgccgctgcgaggtg gctctgctcggctgtggggctgttctggcagccacaggcctagggtttgacttgctggaa gctggcaagtgccagctgcttcccctggaggagcctgagccaccagcccgggaggagaag aaaagacgggagggtctttttcagaggtccagccgtcctcgtcggagcaccagcccccca tcccgaaagcttttcaagaaggaggagcccatgctgttgctaggagacccctctgcctcc ctgacgctgctctccctctcctccatctccgagtgcaactccacacgctccctgctgcgc tccgacagcgatgaaattgtcgtgtatgagatgccagtcagcccagtcgaggcccctccc ctgagtccatgtacccacaaccccctggtcaatgtccgagtagagcgcttcaaacgagat cctaaccaatctctgactcccacccatgtcaccctcaccaccccctcgcagcccagcagt caccggcggactccttctgatggggcccttaagccagagactctcctagccagcaggagc ccctccagcaatgggttgagccccagtcctggagcaggaatgttgaaaacccccagtccc agccgagacccaggtgaattcccccgtctccctgaccccaatgtggtcttccccccaacc ccaaggcgctggaacactcagcaggactctaccttggagagacccaagactctggagttt ctgcctcggccgcgtccttctgccaaccggcaacggctggacccttggtggtttgtgtcc cccagccatgcccgcagcacctccccagccaacagctccagcacagagacgcccagcaac ctggactcctgctttgctagcagtagcagcactgtagaggagcggcctggacttccagcc ctgctcccgttccaggcagggccgctgcccccgactgagcggacgctcctggacctggat gcagaggggcagagtgttccagaggccactggcttcaaggttgtcccctgccctaagcag tataacgaggtgattccagaagccctgcttatggaagtggagagttggtgctggaatggt gagcacctgcagactgctagcagaatccaacccttgctcaatgccattgtagcggcgtca tttgtctggggtaatatgcgagctttgttgtctcacaccagggaaatcaaggacgcagat gcacaagaagaaaggtccctgggcaaagcctgtgaccacagccacacctgtgagtataag gcctgccacaagctcaaatcccacagccagggatggcagatccaggttcacacttcctca gggaataattactcggagtccttcctgacgcacaaagaccaagggaagaccctggcagtt tga >gi568815584r:70630383_70909171|GENSCAN_predicted_peptide_3|69_aa MRSVTGSPQVGNKGSTEVGQNRLGVDPTAPGSGRLQQLLAAGNWLQWRRGPACDLDLITG SPAAIWTTG >gi568815584r:70630383_70909171|GENSCAN_predicted_CDS_3|210_bp atgaggagcgtgacaggcagtccccaagttgggaacaaagggagtactgaggttggacag aacagactgggagtagacccgacagcaccgggcagtggtcggctccagcagctcctagca gccggcaactggctacagtggagacgtgggccagcctgtgacttggacctcataactggc agtccagcagccatctggaccacagggtga >gi568815584r:70630383_70909171|GENSCAN_predicted_peptide_4|303_aa MEPSRALLGCLASAAAAAPPGEDGAGAGAEEEEEEEEEAAAAVGPGELGCDAPLPYWTAV FEYEAAGEDELTLRLGDVVEVLSKDSQVSGDEGWWTGQLNQRVGIFPSNYVTPRSAFSSR CQPGGEDPSCYPPIQLLEIDFAELTLEEIIGIGGFGKVYRAFWIGDEVAVKAARHDPDED ISQTIENVRQEAKLFAMLKHPNIIALRGVCLKEPNLCLVMEFARGGPLNRVLSGKRIPPD ILVNWAVQIARGMNYLHDEAIVPIIHRDLKSSNSEYEEMGLEGLRAVAVVSHDVSRSGLL MEP >gi568815584r:70630383_70909171|GENSCAN_predicted_CDS_4|912_bp atggagccctccagagcgcttctcggctgcctagcgagcgccgccgctgccgccccgccg ggggaggatggagcaggggccggggccgaggaggaggaggaggaggaggaggaggcggcg gcggcggtgggccccggggagctgggctgcgacgcgccgctgccctactggacggccgtg ttcgagtacgaggcggcgggcgaggacgagctgaccctgcggctgggcgacgtggtggag gtgctgtccaaggactcgcaggtgtccggcgacgagggctggtggaccgggcagctgaac cagcgggtgggcatcttccccagcaactacgtgaccccgcgcagcgccttctccagccgc tgccagcccggcggcgaggaccccagttgctacccgcccattcagttgttagaaattgat tttgcggagctcaccttggaagagattattggcatcgggggctttgggaaggtctatcgt gctttctggataggggatgaggttgctgtgaaagcagctcgccacgaccctgatgaggac atcagccagaccatagagaatgttcgccaagaggccaagctcttcgccatgctgaagcac cccaacatcattgccctaagaggggtatgtctgaaggagcccaacctctgcttggtcatg gagtttgctcgtggaggacctttgaatagagtgttatctgggaaaaggattcccccagac atcctggtgaattgggctgtgcagattgccagagggatgaactacttacatgatgaggca attgttcccatcatccaccgcgaccttaagtccagcaacagtgagtatgaagagatgggg ctggaggggctcagagcagttgcagttgtatcccacgacgtcagtaggagtggacttcta atggaaccttag >gi568815584r:70630383_70909171|GENSCAN_predicted_peptide_5|102_aa MTSNSRVTPGETSRRTAQPRVHVGKLSLLCREETTHRLAEARKLCAGRHGPEAASDRSSD PPAGGLPERGSRAQGPFSALAADLTLPGPLLPLGRERQRRGS >gi568815584r:70630383_70909171|GENSCAN_predicted_CDS_5|309_bp atgacttcaaactcacgagtaactccaggtgagaccagcagaagaactgcccagccccgc gtccacgtggggaaactgtccctcctgtgccgggaagaaacgactcaccgccttgcagag gcgcggaagctgtgcgcgggacgccacggccccgaagccgcgagtgaccgcagctccgat cccccggcgggaggcctgcccgagcgtgggagccgagcccagggccctttctccgcactg gctgcggacctaacccttcctgggcccctcctccccctggggagagaacgtcagcgccga ggatcctga >gi568815584r:70630383_70909171|GENSCAN_predicted_peptide_6|101_aa MYEISNTLRYVKEDRAKRLHTVGFRWCSTGSFHRDDRMHGRKSTLEGKHTVPEASSNSTS NAYIEVQRNILTCDENRLQTAKGKRKESDEETIAIIFGKIS >gi568815584r:70630383_70909171|GENSCAN_predicted_CDS_6|306_bp atgtatgaaatctcaaatacgttacgctacgtgaaagaagacagagccaaaaggttacat acagtaggattccgttggtgttctacaggctccttccatagagatgacaggatgcatgga aggaagagcaccctggaaggaaaacacacagttccagaggcctcttcaaacagtaccagt aatgcatacatagaagtgcaaagaaacattctgacctgtgatgaaaataggctgcagaca gccaagggcaaacgtaaagaatctgatgaggagaccattgcaattatttttggcaaaata tcatga >gi568815584r:70630383_70909171|GENSCAN_predicted_peptide_7|375_aa MRRPLKPQSARRLPHSPTGGGPRRGRPSRVAAGGNPEKGPRTLPLPSERAGAAGAGCRGL VARVTSETRKAPKRGVGARSGGGEPHTERVAAPGGRDPVEEEEGATPASPHRLAGKEPPA PRPHTHHVEGEGQAQQKEPQVEVQRVHEGGLYQPPLSAAHTPCRRICSVCDPIPTRRRRR RRRRRRRPRPGPSSASVAGACSCPVYPPPELRVSEPALRRSLVLLQEEEEAANEEEETRE EEAASGGAEDGGEEETGLLRAAAILSPNTRSRGPAPATRGPPAPPLPPPRPPPAASPGPG RERGPAPGSDREEAAPPPAPPLRSLRPLGRLRWVIVSLCYSTLPLDASSKLGKINIPYLK LVALELNSEEGNLEG >gi568815584r:70630383_70909171|GENSCAN_predicted_CDS_7|1128_bp atgcggcgtcccctaaagccgcagagcgctcggcgcctgccacacagccccacgggcggg gggccccggcggggacggcctagccgagtggccgcaggagggaaccccgagaaggggcct aggacgctgccgcttcccagtgagcgggcgggggcggccggcgcgggttgccgtgggctc gttgctcgtgtcacctccgaaaccaggaaagccccgaagagaggagtgggagcacgcagt gggggcggggagcctcacaccgagagggtggcggcccccgggggacgagaccccgtggaa gaggaagagggcgcgaccccagcatctccccaccggctcgcggggaaggagccacccgct ccccgcccccacactcaccatgtagagggtgaagggcaggcccagcagaaagagccacag gtagaggtgcagcgcgttcacgaaggtggcctgtaccagcccccgctgagcgcggcccac accccctgtcggaggatctgcagcgtctgcgaccccatccccacccggcgccgccgccgt cgccgccgccgccgccgccgtccccgccccggccccagctcggcctcggtcgccggagcc tgcagctgccccgtctatccacctccggagctccgggtgagcgagccggcgcttcggcgg tcgctggtgctgctgcaggaggaggaggaggcagcaaacgaggaggaggagacccgagag gaggaggcggcgagcggcggggcggaggacggcggagaggaggagaccggccttctgaga gccgccgccatcttgtctcctaacacccggagccgcggcccggcccccgccacccgcggc ccgccagcgcccccactgccgccgccgcgacccccgccggccgccagccctgggccaggc agagaaagaggcccggcgccagggagcgaccgagaggaagccgcgccccctccagctcct cctctccgctccctccgccccctgggccggctacgatgggtaattgtgtccctgtgctac agcacactccccttggatgcatcctccaaactgggaaaaattaatatcccctaccttaaa ctggttgccttagaattgaactcagaggaagggaacctagaaggatga