GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:40:13 Sequence gi568815596r:24847619_25071904 : 224286 bp : 46.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4906 5332 427 1 1 126 51 171 0.748 11.98 1.02 PlyA + 7438 7443 6 1.05 2.00 Prom + 10998 11037 40 -3.26 2.01 Init + 13101 13167 67 2 1 91 82 16 0.443 2.55 2.02 Term + 16826 17031 206 0 2 53 43 132 0.577 2.73 2.03 PlyA + 17366 17371 6 1.05 3.09 PlyA - 17393 17388 6 1.05 3.08 Term - 18287 18234 54 2 0 90 36 35 0.027 -3.94 3.07 Intr - 25101 24952 150 0 0 78 94 374 0.461 37.36 3.06 Intr - 30398 30304 95 0 2 84 97 66 0.369 6.78 3.05 Intr - 39163 39004 160 1 1 126 -32 167 0.126 8.06 3.04 Intr - 39832 39766 67 0 1 51 89 35 0.587 -1.19 3.03 Intr - 40167 40027 141 2 0 72 85 25 0.310 0.07 3.02 Intr - 40389 40284 106 1 1 72 60 21 0.536 -3.03 3.01 Init - 43066 43030 37 1 1 78 99 82 0.683 8.47 3.00 Prom - 43579 43540 40 -6.16 4.00 Prom + 44273 44312 40 -2.36 4.01 Init + 45233 45281 49 1 1 86 -3 31 0.166 -7.09 4.02 Intr + 50565 50853 289 1 1 80 84 179 0.670 13.00 4.03 Term + 51502 51514 13 1 1 126 48 0 0.553 -2.53 4.04 PlyA + 51722 51727 6 1.05 5.06 PlyA - 52268 52263 6 1.05 5.05 Term - 55905 55673 233 1 2 33 42 140 0.595 0.54 5.04 Intr - 58395 58229 167 2 2 75 98 87 0.941 8.00 5.03 Intr - 67511 67420 92 2 2 52 81 75 0.866 1.99 5.02 Intr - 68722 68666 57 1 0 94 86 22 0.749 1.68 5.01 Init - 71369 70695 675 2 0 77 105 1064 0.994 102.17 5.00 Prom - 74175 74136 40 -8.16 6.05 PlyA - 74268 74263 6 1.05 6.04 Term - 75722 75591 132 2 0 92 47 51 0.421 -0.51 6.03 Intr - 78767 78684 84 2 0 63 78 85 0.573 5.02 6.02 Intr - 80539 80487 53 2 2 73 60 36 0.219 -2.07 6.01 Init - 81771 81705 67 0 1 86 105 45 0.573 7.23 6.00 Prom - 91659 91620 40 -2.06 7.08 PlyA - 93397 93392 6 1.05 7.07 Term - 100130 99998 133 1 1 96 43 189 0.952 12.76 7.06 Intr - 103936 103776 161 1 2 71 121 104 0.996 10.79 7.05 Intr - 109547 109425 123 2 0 103 115 91 0.999 14.08 7.04 Intr - 110356 110192 165 1 0 50 86 129 0.778 9.06 7.03 Intr - 115856 115787 70 1 1 37 94 36 0.921 -1.72 7.02 Intr - 119675 119593 83 2 2 85 106 83 0.968 8.24 7.01 Init - 124274 124200 75 2 0 114 87 127 0.673 16.29 7.00 Prom - 130631 130592 40 -1.06 8.00 Prom + 138344 138383 40 -7.26 8.01 Init + 140871 141264 394 2 1 46 108 123 0.640 7.00 8.02 Intr + 142479 142606 128 1 2 65 2 88 0.047 -1.70 8.03 Term + 153797 153976 180 1 0 68 52 106 0.096 2.61 8.04 PlyA + 156280 156285 6 1.05 9.00 Prom + 176123 176162 40 -3.36 9.01 Init + 179866 179899 34 0 1 83 80 34 0.030 2.14 9.02 Intr + 184528 184773 246 2 0 75 85 92 0.036 5.03 9.03 Intr + 187655 187766 112 0 1 88 80 65 0.072 5.14 9.04 Term + 196183 196213 31 1 1 125 48 56 0.181 2.63 9.05 PlyA + 199488 199493 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_1|142_aa XLRRGGAHVAKPGLRAGMSGHRVLRKYSTGLSVRGLMPGQPRTSTEACCSPTPPADACLP LHGQEPLPEDPSCQRPRKLGLPSSLPHRVAEPVLGVQPWFSRAHPQDFRLPYPECGWWGC GLSITRAHQGGEATDREAHLCT >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_1|429_bp nngctgaggagaggaggggcccacgtggcaaagcccggcctcagagcaggcatgagtggc caccgtgtactcagaaagtactcaactgggctgagcgtgagaggcctgatgccaggacag ccccggacctcgacagaggcctgctgctccccaacacccccagcagacgcctgcctgcct ctccacggccaggagccgctgcccgaggacccgagctgccaacgcccgcggaaacttggg ttgccctcgtcactgccacacagagtggccgagccggtgctgggagtgcagccctggttc agccgagcccatccccaggacttcaggctgccctacccagagtgtggctggtggggctgt ggactcagcatcacccgtgctcatcagggtggggaagccacggaccgtgaggcacacctg tgcacctga >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_2|90_aa MAGCPPIWELTVIRRLVIFITEAAPFPIAKPQKTPKCPNEQTKRGVYMQGDISLNGKEML ICPTTQKNLEDIMLSEISQSKRDKYMIPLI >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_2|273_bp atggctggctgcccacccatctgggagttgactgtcattaggaggctggtcatcttcata acagaagcagcaccattcccaatagcaaagccgcagaaaacacccaaatgtccaaatgaa caaacaaaacgtggtgtatacatgcaaggagatatcagccttaacgggaaagaaatgctg atctgcccgacgacacagaagaaccttgaagacatcatgctaagtgaaataagccagtca aaaagggacaaatacatgattccgcttatataa >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_3|269_aa MREDTNDQYRDAGDENDLDSLKQTQKAGELGINSGVTLGIYNMPNNNRGGSEFKVWQLPR PRPMMEMKFHSSLSLELHSSPPCFPAPSVGLTERRCSGWPGLRGKGAVELAGGRLERGRL IRVSESSESVGIARNQLQFLRRVELLRCWELDDEWRELAWQRSSHPSRGGEPDRAVQSKG DHGARRGDFGGVQSRLLNLMGLILANVFLYLCAIAVGIMSYYMADRKHRKAFLEARQSLE VKMNLEEQSQQQPLDPGRKGKVVECIGLT >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_3|810_bp atgagggaggacacaaatgaccagtaccgggatgcaggtgatgaaaatgaccttgactca ctcaagcagactcagaaagctggagaattgggaataaattcaggagtaaccttgggaatt tataatatgcctaacaataataggggaggcagcgagtttaaagtgtggcaacttccaagg cccaggcccatgatggagatgaaattccactccagcctgagtttggagctgcattcttca ccgccatgctttcctgccccgagtgttggcctcaccgagaggaggtgcagtggctggcct ggtctccgagggaaaggagccgtggagctcgcaggtggaaggctagagagaggcaggctc atccgagtgagtgagagcagtgagtccgtgggcatcgcccggaaccagctccagtttcta aggagagtagagcttttgcgctgctgggagctggatgatgagtggcgagaactggcctgg cagcgcagctcacacccatctcggggcggggagcctgatcgtgctgtgcagtctaaaggg gaccacggggcaagaagaggtgactttggtggtgtccagagcaggcttctcaacctcatg ggtttgatcctggccaacgtcttcctctacctgtgcgccatcgctgtgggcatcatgtcc tactacatggctgaccgcaagcaccgcaaggccttcctggaggcccgccagtcgctggag gtgaagatgaacctggaagagcagagccagcagcagcctctcgaccctggtcgcaaggga aaagtggtggagtgcattgggctcacttaa >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_4|116_aa MGFCHVGQAGLELPASRSKSNVPFTAQLSTPPPEGLFSPLSQFVRHRGPRGGSRTAFPSP WGEPHPRTEKKGREWTYCLRDGSRDTDCVGMHLCRAVLGERGFVYFTQCFMKGQSH >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_4|351_bp atggggttttgccatgttggccaggctggtctcgaactcccggcctcaagatcgaaatcc aacgtgcccttcacggcccagctcagtactcctcctcctgaggggcttttctctccactc tcccagttcgtgcgccacagaggcccccggggaggctcgaggaccgcctttccatcaccg tggggtgagccccaccctcgcaccgagaagaaaggaagggagtggacctactgtctgcgg gatgggagtcgggacactgactgtgtgggaatgcacctatgcagggctgtcctcggggag aggggcttcgtgtacttcacacagtgcttcatgaaggggcagagccactag >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_5|407_aa MPRNQGFSEPEYSAEYSAEYSVSLPSDPDRGVGRTHEISVRNSGSCLCLPRFMRLTFVPE SLENLYQTYFKRQRHETLLVLVVFAALFDCYVVVMCAVVFSSDKLASLAVAGIGLVLDII LFVLCKKGLLPDRVTRRVLPYVLWLLITAQIFSYLGLNFARAHAASDTVGWQVFFVFSFF ITLPLSLSPIVIISVVSCVVHTLVLGVTVAQQQQEELKGMQLLREVDKELILRLLELPGK RTEKHEEALPVSKVFFLCSPYPPMPMRSSVGKDRRPAGLHNYGVVKGPCESWSTLTPGGF LLWTLGQSTGVTVIRLEPPGAPPLTNCEIWEKKVEEEQRVKKAPRLPPGSADPAGLRAKT SRSKCKSEVVFSEEIETDGCGSGYMTVCQNIELYTKKGNCTVHEKGE >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_5|1224_bp atgccgaggaaccagggcttctccgagcccgaatactcggccgagtactcagccgagtac tccgtcagcctgccctccgaccctgaccgcggggtgggccggacccatgaaatctcggtc cggaactcgggctcctgcctgtgcctgcctcgcttcatgcggctgactttcgtgccggag tccttggagaacctctaccagacctacttcaaaaggcagcgccacgagaccctgctggtg ctggtggtctttgcagccctctttgactgctacgtggtggtcatgtgtgctgtggtcttc tccagcgacaagctggcttccctcgccgtggctggaattggactggtgttggacatcatc ctcttcgtgctctgcaaaaaggggctgctcccggaccgggtcacccgcagagtgctgccc tacgtgctgtggctgctcataaccgcccagatcttctcctacctgggcctgaacttcgcg cgtgcccacgcggctagtgacacggtgggctggcaggtcttctttgtcttctccttcttc atcacgctgcccctcagcctcagccccatcgtgatcatctccgtggtctcctgtgtggtg cacacgttggtcctgggggtcaccgtggcccagcagcagcaggaggagctcaaggggatg cagctgctgcgggaggtggacaaggagcttattttgaggctcctggagcttccagggaaa aggactgagaagcatgaagaagcattgccagtgtcaaaagtcttcttcctctgcagccct tatccccccatgcccatgcgcagctctgtggggaaggacagaagacctgctggccttcac aattacggcgtagtgaagggaccttgtgagtcttggtcgaccctcaccccgggtggcttt ctgctttggactctgggtcaaagcactggcgtcacagtcattcgacttgaaccaccagga gctcccccacttaccaactgtgagatctgggaaaagaaagtggaggaggaacagagggtg aagaaggcgcccaggctccccccgggaagcgcagacccggctggtctcagggccaagact tccagatccaaatgcaagtcagaagtcgtcttctctgaggagatcgagacagacggttgt ggaagtggttacatgactgtgtgtcagaacatagaactgtacaccaaaaaagggaattgt actgtacatgaaaaaggggaataa >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_6|111_aa MARTGETWTLRGEQGEGALNDQAFLEQGIWPVAAEGNLEGGPRNSTRGQPAEPTSCVDGT DRGEEREEDVKQLSCPGIFTQMLLAPRESSQGPTAVPLRFPYPQGKAVPEP >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_6|336_bp atggcaagaactggggagacctggacactgagaggggaacaaggagagggtgccctaaat gaccaagccttcctggaacaagggatctggcctgttgcagctgagggtaacttggaaggg gggcctagaaacagcacccgagggcagccagcagagcccacgtcctgtgtggatggcacc gatcgaggagaggagagggaagaggatgtcaagcagctctcttgcccaggaatattcacc cagatgcttctggctccccgggagtcctcgcaagggcctactgctgtcccactgcgtttc ccatatccacagggaaaggccgtgccagaaccctga >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_7|269_aa MPKRKEPGRSLRIKVISMGNAEVGKSCIIKRYCEKRFVSKYLATIGIDYGVTKVHVRDRE IKVNIFDMAGHPFFYEVRNEFYKDTQGVILVYDVGQKDSFDALDAWLAEMKQELGPHGNM ENIIFVVCANKIDCTKHRCVDESEGRLWAESKGFLYFETSAQTGEGINEMFQTFYISIVD LCENGGKRPTTNSSASFTKEQADAIRRIRNSKDSWDMLGVKPGASRDEVNKAYRKLAVLL HPDKCVAPGSEDAFKAVVNARTALLKNIK >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_7|810_bp atgccgaagcggaaggagcccggcaggtctctccgcatcaaagtcatctccatgggcaac gccgaagtggggaaaagctgtattataaagcgatactgtgagaaaagattcgtgtctaaa tacctggcaacaattggaattgactatggagtcacaaaggtacacgtcagagacagagaa atcaaagttaacatctttgatatggctggacatcccttcttctatgaggttcgaaatgag ttttacaaggacacacagggtgtgatactggtctatgatgttgggcagaaagactccttt gacgcccttgatgcgtggctggcagaaatgaagcaagagcttggacctcatggaaacatg gaaaatattatatttgtagtttgtgccaacaagattgattgtaccaaacatcgctgtgta gatgaaagtgaaggacgtctttgggctgaaagcaaagggttcctgtactttgaaacttca gcacaaactggagaaggcattaatgagatgttccagaccttttatatatccatagttgat ttatgtgaaaatggcgggaaacgccctaccaccaatagcagtgctagtttcaccaaagaa caagcagatgccattcgcagaattcgaaatagtaaagacagttgggacatgctgggagtc aaacctggggcctcaagggatgaagtcaataaagcgtatcggaaacttgctgtgcttctt caccctgacaaatgtgtagcacctggcagtgaagatgccttcaaagcagttgtgaatgct cggacagccctcctgaaaaacatcaagtag >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_8|233_aa MQAALPLGPYDPVDPMVLEVSVADKDAVWSLWQACIGELQWRPLGFWSKALPSSADKYSP FERRLLACYWSLVETEHLTMDRQVTMQPELPIINWVLSDPSSHKVGYAQQHSIIKWKWYI WDQAQAGPEGATRIHGSRNQGVEVEVVPLTITPSDPLAKFLLPVAVTLRSAGLESPCENR DTCPVEKGDAKAENARKDMASLNCGTCSCSCGVDHGNRFEERSSVLLHLPSLF >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_8|702_bp atgcaagctgctctgccacttgggccatatgacccagtggatccaatggtgcttgaggtg tcagtggcagataaggatgctgtttggagtctttggcaggcctgcatcggtgaattacag tggaggcctctaggattttggagcaaggctctgccatcttctgcagataaatactctcct tttgagagacggctcttggcctgttactggtctttggtggaaactgaacatttgactatg gatcgtcaagtcaccatgcaacctgaactgcctatcataaactgggtgctttctgaccca tctagccataaagtgggttatgcacagcagcattccatcatcaaatggaagtggtatata tgggatcaggctcaagcaggtcctgaaggtgcaaccaggattcatgggtccaggaatcaa ggggtggaagtggaagtggtaccactcaccatcacccctagtgacccactagcaaaattt ttgcttcctgttgcagttacattacgttctgctggcctagagagtccgtgtgagaaccgg gacacctgtcctgttgaaaagggggacgcaaaagctgaaaatgcccggaaggatatggct tccttgaactgcgggacgtgtagctgcagctgtggggtggaccatggaaacaggtttgaa gagagaagctctgttcttcttcatctcccgtcattgttctga >gi568815596r:24847619_25071904|GENSCAN_predicted_peptide_9|140_aa MMIGSEDLEVPAMSTSSLYLWNPGFFHGSGVELKNLPCSKLPRGCLVQAEGKPSVNHLLQ FPSELPRRVTLARLICYFICCHITNLQLSYLQSDQQEEEMEATPWTFTSFRVWVGLGFAV RASITQDGDGRGSSLKLQII >gi568815596r:24847619_25071904|GENSCAN_predicted_CDS_9|423_bp atgatgataggatctgaagacctggaagtaccagccatgtccaccagttccctctacctg tggaacccaggcttcttccatgggtctggagtggagctcaagaatctgccttgcagcaag ctccccaggggctgcttggtgcaagctgaagggaagccctctgtgaatcacctccttcag ttccccagcgagcttcctagaagagtgacattggccagactcatctgctacttcatctgt tgccacatcacaaacctgcagcttagctacttgcaatctgatcagcaggaagaagagatg gaagcaacgccatggaccttcacgtccttcagagtgtgggtgggcttgggctttgcagtg agagcaagtatcacccaggatggagatgggagaggtagcagcctgaagctgcagatcatt tga