GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:23:13 Sequence gi568815596r:70153858_70354010 : 200153 bp : 43.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 6930 6768 163 0 1 84 111 127 0.832 14.15 1.06 Intr - 11335 11235 101 2 2 41 78 17 0.820 -4.17 1.05 Intr - 11778 11671 108 1 0 109 92 27 0.943 5.46 1.04 Intr - 15804 15700 105 1 0 60 115 56 0.941 5.69 1.03 Intr - 21920 21816 105 0 0 70 110 27 0.892 3.29 1.02 Intr - 25785 25675 111 1 0 102 99 13 0.916 4.15 1.01 Init - 28128 27306 823 0 1 59 78 641 0.998 55.08 1.00 Prom - 31881 31842 40 -4.76 2.00 Prom + 47170 47209 40 -2.16 2.01 Init + 48116 48346 231 1 0 60 110 198 0.515 15.78 2.02 Intr + 48773 48913 141 1 0 54 76 56 0.308 1.55 2.03 Intr + 49490 49666 177 1 0 16 32 158 0.745 3.12 2.04 Term + 50354 50755 402 1 0 16 51 212 0.981 5.45 2.05 PlyA + 51850 51855 6 1.05 3.07 PlyA - 52954 52949 6 1.05 3.06 Term - 65144 65124 21 2 0 114 36 2 0.017 -4.09 3.05 Intr - 70772 70697 76 1 1 73 80 60 0.155 3.22 3.04 Intr - 73965 73878 88 1 1 62 97 60 0.311 3.33 3.03 Intr - 75461 75407 55 2 1 65 92 2 0.395 -3.15 3.02 Intr - 76997 76899 99 2 0 115 121 55 0.871 11.81 3.01 Init - 94530 94375 156 0 0 91 57 95 0.415 4.61 3.00 Prom - 99214 99175 40 -6.76 4.00 Prom + 102505 102544 40 -7.66 4.01 Init + 104308 104419 112 0 1 95 74 243 0.998 21.97 4.02 Intr + 105503 105709 207 0 0 94 71 189 0.981 16.85 4.03 Intr + 107355 107529 175 1 1 126 76 89 0.988 10.50 4.04 Intr + 121102 121313 212 1 2 85 94 323 0.999 31.16 4.05 Intr + 121657 121809 153 2 0 69 115 73 0.771 8.14 4.06 Term + 122877 123535 659 1 2 94 45 465 0.999 36.92 4.07 PlyA + 123648 123653 6 1.05 5.05 PlyA - 125459 125454 6 1.05 5.04 Term - 127827 127777 51 0 0 106 36 61 0.584 0.13 5.03 Intr - 134335 134211 125 2 2 83 103 61 0.770 7.40 5.02 Intr - 135515 135493 23 1 2 122 103 28 0.129 4.89 5.01 Init - 139792 139761 32 1 2 78 106 0 0.092 0.06 5.00 Prom - 140712 140673 40 -2.86 6.04 PlyA - 142145 142140 6 1.05 6.03 Term - 143620 143432 189 1 0 117 42 222 0.999 17.95 6.02 Intr - 147123 146983 141 0 0 131 94 181 0.996 23.55 6.01 Init - 148154 148068 87 2 0 108 74 146 0.735 15.87 6.00 Prom - 181646 181607 40 -3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 66609 66776 168 2 0 81 54 148 0.879 5.46 S.002 Term + 139375 139665 291 0 0 96 48 224 0.842 14.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:70153858_70354010|GENSCAN_predicted_peptide_1|506_aa MEPNSLRTKVPAFLSDLGKATLRGIRKCPRCGTYNGTRGLSCKNKTCGTIFRYGARKQPS VEAVKIITGSDLQVYSVRQRDRGPDYRCFVELGVSETTIQTVDGTIITQLSSGRCYVPSC LKAATQGVVENQCQHIKLAVNCQAEATPLTLKSSVLNAMQASPETKQTIWQLATEPTGPL VQRITKNILVVKCKASQKHSLGYLHTSFVQKVSGKSLPERRFFCSCQTLKSHKSNASKDE TAQRCIHFFACICAFASDETLAQEFSDFLNFDSSGLKEIIVPQLGCHSESTVSACESTAS KSKKRRKDEVSGAQMNSSLLPQDAVSSNLRKSGLKKPVVASSLKRQACGQLLDEAQVTLS FQDWLASVTERIHQTMHYQFDGKPEPLVFHIPQSFFDALQQRISIGSAKKRLPNSTTAFV RKDALPLGTFSKYTWHITNILQVKQILDTPEMPLEITRSFIQNRDGTYELFKCPKVEVES IAETYGRIEKQPVLRPLELKTFLKVX >gi568815596r:70153858_70354010|GENSCAN_predicted_CDS_1|1518_bp atggaaccaaattctctgaggactaaagtcccagctttcttatctgatttggggaaggcc acattgaggggaatcagaaagtgtccccgatgtggcacatacaatggaacccggggactg agctgtaagaacaagacatgtggaaccatattccgctacggtgcacgcaagcagcctagt gttgaagctgtcaaaatcattacaggctctgatcttcaggtctactcagtgcggcaaaga gaccggggccctgattaccgatgctttgtggagctcggggtttcagagacaacaatccag acagtggatgggacgatcatcactcagctgagctctggacggtgttatgtcccctcatgc ctgaaagctgccactcaaggcgttgtggaaaaccagtgccagcacatcaagctggcggtg aactgccaggcagaggccacccctctgaccctgaagagctcggtcctgaatgcaatgcag gcctccccggaaaccaaacagaccatctggcagttggccacggaacccacaggtcctctg gtgcagagaattactaaaaacatcttggtggtgaaatgcaaggcaagccagaagcacagt ttggggtatttgcatacatcttttgtgcagaaagtcagtggcaaaagcttgcctgagcgc cgcttcttctgctcctgtcagactctgaaatcgcacaagtcaaatgcctccaaggatgag acagcccagagatgcattcatttctttgcttgcatctgtgcctttgccagtgatgagaca ctggctcaggaattctcagacttcctaaattttgattccagcggtcttaaagagattatt gtaccccagttaggttgccattcagaatcaacagtatctgcttgtgagtctactgcctct aagtcaaagaagaggagaaaggatgaagtatctggtgcacagatgaacagttcactactg cctcaagatgcagtgagcagtaatctaaggaaaagtggcctgaaaaagcctgtggttgct tcctcgttaaaaaggcaggcctgtggtcagctgttagatgaggcacaagtgactttatcc ttccaagactggctggccagtgtcacagaacgcatccatcaaaccatgcactatcagttt gatggcaaaccagaaccattggtgttccacattcctcagtcattttttgatgccctgcaa caaagaatatctataggaagtgcaaaaaaacggctccccaactccaccacagcttttgtt cggaaagatgccttgccactgggaaccttttccaagtatacttggcatatcactaatatc ctgcaagttaaacaaatcttagataccccagagatgcccttggaaatcacccgtagcttt atccagaaccgagatgggacttatgagctatttaaatgccctaaagtggaagtagaaagc atagcagaaacctacggtcgtatagaaaaacaaccagtgctgcgacccttggaactaaaa acttttctcaaagttgnn >gi568815596r:70153858_70354010|GENSCAN_predicted_peptide_2|316_aa MWPQHGLRYLVAAVLSRWALVETWAEVDRSPMSMEKALKHLEAYNTEKEGAFASRVGWAF LTMLWKVHAQSLRDTAQVSRQLAQGKGDHILTEWLMAAMWTGWNDARELSKTVSKWQSYA ELVEDTRDQRPHVELAIHWYPTNVQQVLVLVGTDADYSLVYGKRDKFLGKAAYTDGYRGQ SVKMTVDYQELNKVTAPLHAAVLSITDLMDHLTMELGQYHYAVDLANASLSDDIAPESQE QFGFTWEGRQWTFTVLPQGYVHSPTICHGLVATDLATWKFPKGIRLFHYIDDIMLTSDSP ADLEAVVPLLQQHLAA >gi568815596r:70153858_70354010|GENSCAN_predicted_CDS_2|951_bp atgtggccccaacatgggttgcggtacttggtggcagctgtgctgtccagatgggccctg gtggaaacatgggcagaggtagacaggtcccccatgagcatggagaaggccctgaagcac ctggaagcatacaacaccgagaaggagggtgcctttgccagcagagttggatgggcattt ttgactatgctatggaaagtgcatgcccagtccctgcgggatacagcacaggtgagcagg cagttggcacaagggaaaggtgaccatatcctgactgaatggctgatggcagccatgtgg acagggtggaatgatgccagagaattatcaaaaactgtgagtaaatggcaatcatatgca gagctggtggaggacaccagagatcagaggccacatgtggaattggcaatccactggtac cccaccaatgtacagcaggtgttggtgctggtaggtactgatgcagattatagcctcgtc tatgggaaacgggataagtttttaggcaaggctgcatacacagatggttatagaggccag tcagtgaaaatgacagtggactatcaagaactaaataaagtaacagcccctttacatgca gcagtcctgtctatcacggatttgatggaccacctgacgatggaattgggacagtaccat tatgcagtggatttggctaatgcatccctttcagatgatatcgctccagagagccaggaa cagtttggcttcacatgggaagggcgacaatggactttcacagtgttgccacagggctac gtgcatagtcccaccatatgtcatggtctcgttgctacggatttagccacctggaaattt ccaaaggggatccgcctattccattacattgatgatattatgttaacctctgattctcct gcagatttagaagctgtggtgcccctcttgcaacaacatttggcagcatga >gi568815596r:70153858_70354010|GENSCAN_predicted_peptide_3|164_aa MREAGWWSSRREGLGPAPLPAYCSPWTPRYRGFCFLSMHLDAYNQDPICEDETAGNDPYC FVEFHEHRHAAAALAAMNGRKIMGKEVKVNWATTPSSQKKDTSNHFHVFVGDLSPEITTE DIKAAFAPFGRISDARVVKDMATGKSKGYGFVSFFNKWSFSAFQ >gi568815596r:70153858_70354010|GENSCAN_predicted_CDS_3|495_bp atgagggaggcgggatggtggtcgtcccggagggaaggcctcggccctgcgccgctccca gcctattgttctccgtggacaccgcgatatcgtggtttttgtttccttagcatgcacctg gatgcttacaaccaagacccaatctgtgaggacgagacagctggaaatgatccctattgt tttgtggagtttcatgagcatcgtcatgcagctgcagcattagctgctatgaatggacgg aagataatgggtaaggaagtcaaagtgaattgggcaacaacccctagcagtcaaaagaaa gatacaagcaatcatttccatgtctttgttggtgatctcagcccagaaattacaactgaa gatataaaagctgcttttgcaccatttggaagaatatcagatgcccgagtggtaaaagac atggcaacaggaaagtctaagggatatggctttgtctcctttttcaacaaatggtctttc tccgcatttcaataa >gi568815596r:70153858_70354010|GENSCAN_predicted_peptide_4|505_aa MGRVVAELVSSLLGLWLLLCSCGCPEGAELRAPPDKIAIIGAGIGGTSAAYYLRQKFGKD VKIDLFEREEVGGRLATMMVQGQEYEAGGSVIHPLNLHMKRFVKDLGLSAVQASGGLLGI YNGETLVFEESNWFIINVIKLVWRYGFQSLRMHMWVEDVLDKFMRIYRYQSHDYAFSSVE KLLHALGGDDFLGMLNRTLLETLQKAGFSEKFLNEMIAPVMRVNYGQSTDINAFVGAVSL SCSDSGLWAVEGGNKLVCSGLLQASKSNLISGSVMYIEEKTKTKYTGNPTKMYEVVYQIG TETRSDFYDIVLVATPLNRKMSNITFLNFDPPIEEFHQYYQHIVTTLVKGELNTSIFSSR PIDKFGLNTVLTTDNSDLFINSIGIVPSVREKEDPEPSTDGTYVWKIFSQETLTKAQILK LFLSYDYAVKKPWLAYPHYKPPEKCPSIILHDRLYYLNGIECAASAMEMSAIAAHNAALL AYHRWNGHTDMIDQDGLYEKLKTEL >gi568815596r:70153858_70354010|GENSCAN_predicted_CDS_4|1518_bp atggggcgcgtcgtcgcggagctcgtctcctcgctgctggggttgtggctgttgctgtgc agctgcggatgccccgagggcgccgagctgcgtgctccgccagataaaatcgcgattatt ggagccggaattggtggcacttcagcagcctattacctgcggcagaaatttgggaaagat gtgaagatagacctgtttgaaagagaagaggtcgggggccgcctggctaccatgatggtg caggggcaagaatacgaggcaggaggttctgtcatccatcctttaaatctgcacatgaaa cgttttgtcaaagacctgggtctctctgctgttcaggcctctggtggcctactggggata tataatggagagactctggtatttgaggagagcaactggttcataattaacgtgattaaa ttagtttggcgctatggatttcaatccctccgtatgcacatgtgggtagaggacgtgtta gacaagttcatgaggatctaccgctaccagtctcatgactatgccttcagtagtgtcgaa aaattacttcatgctctaggaggagatgacttccttggaatgcttaatcgaacacttctt gaaaccttgcaaaaggccggcttttctgagaagttcctcaatgaaatgattgctcctgtt atgagggtcaattatggccaaagcacggacatcaatgcctttgtgggggcggtgtcactg tcctgttctgattctggcctttgggcagtagaaggtggcaataaacttgtttgctcaggg cttctgcaggcatccaaaagcaatcttatatctggctcagtaatgtacatcgaggagaaa acaaagaccaagtacacaggaaatccaacaaagatgtatgaagtggtctaccaaattgga actgagactcgttcagacttctatgacatcgtcttggtggccactccgttgaatcgaaaa atgtcgaatattacttttctcaactttgatcctccaattgaggaattccatcaatattat caacatatagtgacaactttagttaagggggaattgaatacatctatctttagctctaga cccatagataaatttggccttaatacagttttaaccactgataattcagatttgttcatt aacagtattgggattgtgccctctgtgagagaaaaggaagatcctgagccatcaacagat ggaacatatgtttggaagatcttttcccaagaaactcttactaaagcacaaattttaaag ctctttctgtcctatgattatgctgtgaagaagccatggcttgcatatcctcactataag cccccggagaaatgcccctctatcattctccatgatcgactttattacctcaatggcata gagtgtgcagcaagtgccatggagatgagtgccattgcagcccacaacgctgcactcctt gcctatcaccgctggaacgggcacacagacatgattgatcaggatggcttatatgagaaa cttaaaactgaactatga >gi568815596r:70153858_70354010|GENSCAN_predicted_peptide_5|76_aa MSKAHPPELKKFMDKKLSLKLNGGRHVQGILRGFDPFMNLVIDECVEMATSGQQNNIGMV VIRGNSIIMLEALERV >gi568815596r:70153858_70354010|GENSCAN_predicted_CDS_5|231_bp atgagcaaagctcaccctcccgagttgaaaaaatttatggacaagaagttatcattgaaa ttaaatggtggcagacatgtccaaggaatattgcggggatttgatccctttatgaacctt gtgatagatgaatgtgtggagatggcgactagtggacaacagaacaatattggaatggtg gtaatacgaggaaatagtatcatcatgttagaagccttggaacgagtataa >gi568815596r:70153858_70354010|GENSCAN_predicted_peptide_6|138_aa MAELQQLRVQEAVESMVKSLERENIRKMQGLMFRCSASCCEDSQASMKQVHQCIERCHVP LAQAQALVTSELEKFQDRLARCTMHCNDKAKDSIDAGSKELQVKQQLDSCVTKCVDDHMH LIPTMTKKMKEALLSIGK >gi568815596r:70153858_70354010|GENSCAN_predicted_CDS_6|417_bp atggctgagctgcagcagctccgggtgcaggaggcggtggagtccatggtgaagagtctg gaaagagagaacatccggaagatgcagggtctcatgttccggtgcagcgccagctgttgt gaggacagccaggcctccatgaagcaggtgcaccagtgcatcgagcgctgccatgtgcct ctggctcaagcccaggctttggtcaccagtgagctggagaagttccaggaccgcctggcc cggtgcaccatgcattgcaacgacaaagccaaagattcaatagatgctgggagtaaggag cttcaggtgaagcagcagctggacagttgtgtgaccaagtgtgtggatgaccacatgcac ctcatcccaactatgaccaagaagatgaaggaggctctcttatcaattggaaaataa