GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:39:04 Sequence gi568815575r:54346145_54595432 : 249288 bp : 45.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11711 11795 85 1 1 37 71 124 0.699 4.62 1.02 Intr + 12211 12321 111 2 0 110 25 61 0.097 2.35 1.03 Intr + 23426 23609 184 0 1 84 82 63 0.620 4.15 1.04 Intr + 24337 24498 162 1 0 80 69 38 0.352 0.19 1.05 Intr + 62197 62340 144 1 0 63 37 87 0.017 0.50 1.06 Intr + 81377 81485 109 2 1 138 66 72 0.541 10.29 1.07 Intr + 94546 94636 91 0 1 48 101 100 0.711 6.87 1.08 Intr + 97256 97347 92 0 2 79 99 136 0.999 13.51 1.09 Intr + 97864 98040 177 0 0 46 57 127 0.724 5.52 1.10 Term + 98272 98406 135 0 0 110 48 99 0.996 6.12 1.11 PlyA + 99130 99135 6 1.05 2.19 PlyA - 99785 99780 6 -0.45 2.18 Term - 100270 99965 306 1 0 91 44 177 0.998 8.72 2.17 Intr - 101310 101167 144 0 0 87 85 167 0.997 16.78 2.16 Intr - 102823 102662 162 1 0 82 94 137 0.996 13.87 2.15 Intr - 103124 102999 126 2 0 102 84 178 0.999 19.68 2.14 Intr - 103616 103515 102 2 0 125 63 86 0.999 10.17 2.13 Intr - 104157 104127 31 2 1 52 92 31 0.773 -1.97 2.12 Intr - 109383 109304 80 0 2 98 80 83 0.971 6.85 2.11 Intr - 109640 109455 186 2 0 74 22 193 0.502 11.19 2.10 Intr - 110222 110076 147 2 0 75 94 223 0.999 22.03 2.09 Intr - 110423 110365 59 0 2 97 101 142 0.934 15.00 2.08 Intr - 119445 119307 139 0 1 103 87 163 0.995 17.74 2.07 Intr - 119708 119552 157 1 1 64 111 271 0.996 27.01 2.06 Intr - 121788 121640 149 0 2 119 77 269 0.999 27.93 2.05 Intr - 122732 122643 90 2 0 72 109 91 0.962 9.79 2.04 Intr - 124313 123872 442 1 1 56 69 515 0.906 39.86 2.03 Intr - 124616 124439 178 0 1 117 113 131 0.999 17.48 2.02 Intr - 125343 125170 174 1 0 123 110 95 0.997 15.11 2.01 Init - 149288 148982 307 2 1 80 105 257 0.721 24.06 2.00 Prom - 170854 170815 40 -3.96 3.00 Prom + 173303 173342 40 -5.06 3.01 Init + 184971 185020 50 2 2 84 46 66 0.586 2.42 3.02 Intr + 192219 192244 26 0 2 83 83 23 0.572 -1.03 3.03 Intr + 192896 192957 62 0 2 125 110 32 0.989 7.55 3.04 Intr + 193991 194098 108 1 0 44 94 79 0.957 4.58 3.05 Intr + 195129 195245 117 2 0 40 116 135 0.997 12.16 3.06 Intr + 196811 196894 84 1 0 139 87 87 0.999 13.72 3.07 Intr + 197063 197198 136 1 1 143 60 143 0.994 17.14 3.08 Intr + 198079 198182 104 2 2 70 107 82 0.780 8.19 3.09 Intr + 202085 202229 145 1 1 104 86 103 0.998 11.56 3.10 Intr + 204819 204906 88 1 1 115 82 62 0.999 7.33 3.11 Intr + 205424 205598 175 2 1 85 46 346 0.999 30.04 3.12 Intr + 205688 205830 143 1 2 120 78 90 0.997 10.35 3.13 Intr + 206148 206284 137 0 2 87 101 177 0.905 19.11 3.14 Intr + 208421 208548 128 0 2 56 77 147 0.980 10.80 3.15 Intr + 212292 212511 220 2 1 113 75 211 0.992 20.17 3.16 Term + 214376 214458 83 0 2 79 37 152 0.804 7.06 3.17 PlyA + 215016 215021 6 1.05 4.00 Prom + 215613 215652 40 -2.06 4.01 Init + 237237 237285 49 2 1 86 58 40 0.310 -0.09 4.02 Term + 240872 241074 203 0 2 53 41 173 0.366 6.65 4.03 PlyA + 242654 242659 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 94278 94358 81 2 0 77 82 125 0.988 9.77 S.002 Term - 230979 230791 189 0 0 70 42 125 0.877 3.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:54346145_54595432|GENSCAN_predicted_peptide_1|429_aa MSRAPLRVVCACARERLPASPPGQLRPVEKKKKKKSPSTHYFLAPHCPPLGLREPLNAVA PLLGGVTLFLTLSNYYSEEGVPLPIVGHYGLYSVPWAEEMMDGHRNPCNIFCPGSSIALW AFAYSTKAVPHVGIPLIGHISIALLPEYITRPLDTAQESVKIQTQWLSNQLAFFSSFPAN RMTIQTNKQTKQKATIVGDDVEKLEPLYVAGGNVKWRSHCGKQFGFLNRYSYGSILLACK ELKLMALKKQEAAMLLCTTYGDGHMIAVENGFGGVHSQEKAKWLGGAVEDYFMRNADLEL DEVEDFLGELLTNEFDTVVEDGSLPQVSQQLQTMFHHFQRGDGAALREMASCITQRKCKV TATALKTARETDEDEDDVDSVEEMEVTATNDGAATDGVCPQPEPSDPDAQTIKEEDIVED GWTIVRRKK >gi568815575r:54346145_54595432|GENSCAN_predicted_CDS_1|1290_bp atgtctcgagctccgctccgcgtcgtgtgcgcttgcgcgcgcgagcgtctccccgcgagc ccaccgggccagctacggccggtagagaagaagaagaagaagaaaagtccaagcacacac tacttcctagccccccactgcccacccttgggtctccgcgagcctctgaatgctgtagcg ccgctgttaggaggtgtgaccttgtttctgactctatcaaactactactcagaggagggt gtccctctgcccattgttgggcattatgggctgtacagtgttccttgggctgaagaaatg atggatggccatcgaaatccatgcaatatcttctgtcctggttcttccatagcactctgg gcatttgcatattctaccaaggctgtgccccatgtgggcatccccttaataggccacatt tccattgccctcctgcctgagtatataaccaggccattggacactgcccaggagtcagta aaaatccaaacacagtggttgtctaatcagctggcattcttctcctcctttcctgctaac aggatgaccatacaaacaaacaaacaaacaaaacagaaagcaacaattgttggtgatgat gtggagaaactggaacccttgtacgttgctggtgggaacgtaaaatggcgcagccactgt ggaaaacagtttgggttcctcaacagatatagctatggctccatcttgctagcatgcaaa gaactcaaactgatggccttgaagaagcaagaagctgctatgctattgtgtactacctat ggagatggccacatgatcgctgtggagaatggcttcgggggtgtgcacagccaggagaag gccaagtggctggggggtgcagtggaggattacttcatgcgcaatgctgacttggagcta gatgaggtggaagacttccttggagagctgttgaccaacgagtttgatacagttgtggaa gacgggagtctgccccaggtgagccagcaactgcagaccatgttccaccacttccagagg ggtgatggggctgctctgagggagatggcctcctgcatcactcagagaaaatgcaaggtc acagccactgcacttaagacagctagagagactgatgaggatgaagatgatgtggacagt gtggaagagatggaggtcacagctacgaatgatggggctgctacagatggggtctgcccc cagcctgaaccctctgatccagacgctcagactattaaggaagaggatatagtggaagat ggctggaccattgtccggagaaaaaaatga >gi568815575r:54346145_54595432|GENSCAN_predicted_peptide_2|992_aa MHGHRAPGGAGPSEPEHPATNPPGAAPPACADSDPGASEPGLLARRGSGSALGGPLDPQF VGPSDTSLGAAPGHRVLPCGPSPQHHRALRFSYHLEGSQPRPGLHQGNRILVKSLSLDPG QSLEPHPEGPQRLRSDPGPPTETPSQRPSPLKRAPGPKPQVPPKPSYLQMPRMPPPLEPI PPPPSRPLPADPRVAKGLAPRAEASPSSAAVSSLIEKFEREPVIVASDRPVPGPSPGPPE PVMLPQPTSQPPVPQLPEGEASRCLFLLAPGPRDGEKVPNRDSGIDSISSPSNSEETCFV SDDGPPSHSLCPGPPALASVPVALADPHRPGSQEVDSDLEEEDDEEEEEEKDREIPVPLM ERQESVELTVQQKVFHIANELLQTEKAYVSRLHLLDQVFCARLLEEARNRSSFPADVVHG IFSNICSIYCFHQQFLLPELEKRMEEWDRYPRIGDILQKLAPFLKMYGEYVKNFDRAVEL VNTWTERSTQFKVIIHEVQKEEACGNLTLQHHMLEPVQRIPRYELLLKDYLLKLPHGSPD SKDAQKSLELIATAAEHSNAAIRKMERMHKLLKVYELLGGEEDIVSPTKELIKEGHILKL SAKNGTTQDRYLILFNDRLLYCVPRLRLLGQKFSVRARIDVDGMEVSIQEVGNGNLEWGI GVGRNKDAGFDPESVCLKESSNLNLPRTFLVSGKQRSLELQARTEEEKKDWVQAINSTLL KHEQTLETFKLLNSTNREDEDTPPNSPNVDLGKRAPTPIREKEVTMCMRCQEPFNSITKR RHHCKACGHVVCGKCSEFRARLVYDNNRSNRVCTDCYVALHGVPGSSPACSQHTPQRRRS ILEKQASVAAENSVICSFLHYMEKGGKGWHKAWFVVPENEPLVLYIYGAPQDVKAQRSLP LIGFEVGPPEAGERPDRRHVFKITQSHLSWYFSPETEELQRRWMAVLGRAGRGDTFCPGP TLSEDREMEEAPVAALGATAEPPESPQTRDKT >gi568815575r:54346145_54595432|GENSCAN_predicted_CDS_2|2979_bp atgcatggccaccgagccccggggggcgccgggccttcggagcccgaacacccggccacg aacccgccgggcgccgctccgccggcctgtgccgactcggaccctggagcctcggaaccc ggactgctggcgcgcaggggctcaggttcggctcttggcggcccactggatccccagttt gtcggaccctcggacaccagcctgggcgctgctccaggccaccgggtcttgccctgcggt cccagtccacagcaccaccgggccctgcgcttctcttaccacctggagggctcgcagcct cggcctgggctgcaccagggaaaccggatcctggttaaaagtttgtcccttgaccctggc caaagcctagagcctcatccagaaggtccccagcggcttcgctcagacccaggtcccccg actgaaacccctagccagcgtccttcaccactgaagcgggcaccgggcccgaagccacag gtgcccccaaagcccagctacctgcagatgccccggatgccccccccactggagcccatc ccccctccaccatcacgcccactgcctgccgacccccgagtggccaagggcctggctccc agggcagaggccagccccagttctgcagcagtatcctcactgattgagaagtttgaaaga gagcctgtgattgtcgcctcggatagaccagtccctggccccagcccaggtcccccagag ccagtcatgttgccacagccaacctcgcagccaccagtgccccagctccccgagggtgag gcctcccgctgcctgtttctgctggctcctgggccccgggacggtgagaaggtgcccaac cgggacagcggcattgatagcatcagctcgccatccaacagcgaggagacctgcttcgtc agtgatgacgggccccccagccacagcctctgccctgggccccctgccctggctagtgtg cctgttgccttggccgacccccaccggcctggctcccaagaggttgacagtgacctggag gaggaggacgacgaggaggaggaggaagagaaggacagagaaatcccagtgcccctgatg gagagacaggagtctgtggagttgactgtgcagcaaaaggtgtttcacattgccaatgag ctcctgcaaactgagaaggcctacgtttccaggctccatctcctggatcaggtgttctgt gcccggctgctggaagaagctcggaaccgcagttccttcccggccgacgttgtccacggc atcttctctaacatctgctccatctattgcttccaccagcagttcctgctgcctgagcta gagaagcgcatggaggaatgggaccgctatccacgcattggagacatcctgcagaaactg gcccccttcctcaagatgtatggtgagtatgtgaagaactttgaccgggccgtggagctg gtcaacacctggacagagcgctccacccagtttaaagtcatcatccatgaggtgcagaag gaggaagcctgtggcaacctgacattgcagcaccacatgctggagcctgtgcagcgcatc ccccgctatgagcttcttctcaaggactatctgttaaagctgccccatggctccccggac agcaaggatgcccaaaagtctctggagctgatcgccacagcagcagagcactcgaatgct gccatccgcaaaatggagcgaatgcataagctgctgaaggtatatgagctgttagggggc gaggaggacattgtcagccccaccaaagagctcataaaagaaggccacatccttaagctg tcagcaaagaatgggaccactcaagaccgatacctcatactattcaacgaccgcctcctt tactgcgtgcccaggctgcggctccttggccagaagtttagcgtgcgggcacgcattgat gtagatggcatggaggtaagcatccaagaggtgggaaatgggaacttggagtggggcatt ggtgttgggaggaacaaagatgctggctttgaccccgagtctgtgtgtctaaaggagagc tccaacctcaatctgcctcgaaccttcctggtgtcaggaaagcagcgctccctcgagctc caggccaggactgaggaggagaagaaagactgggtccaggccatcaactccaccctcctg aagcatgaacagacgctggagactttcaaactgttgaactcaacaaacagggaagatgaa gacaccccacccaactctccaaacgtggatcttgggaagcgggcacctacgcccatccgg gaaaaggaagtcaccatgtgcatgcgctgccaggagcccttcaattctatcaccaaacgc aggcaccactgcaaggcctgcgggcatgtggtttgtgggaagtgctccgagttccgggcc cgcctcgtctatgacaacaaccgctccaaccgtgtgtgcactgattgctatgtggccttg cacggggtgcctgggagcagtccagcctgcagccagcatacaccccagcgccggaggtcc atcctggagaaacaggcctcagtggctgcagagaacagcgtcatctgcagcttcctgcac tacatggagaagggtggcaaaggatggcacaaggcatggttcgtggtccctgaaaatgaa cccttggtgctgtatatctacggagcccctcaggatgtgaaagcccagcgcagcctgccc ctcattggcttcgaggtgggaccgcccgaggcaggggagcggcctgacagaaggcatgtc ttcaagatcacccagagccacctcagctggtacttcagccctgagacagaggaactacag cgacgctggatggctgtgcttggccgggcgggccgaggggacacgttctgcccggggccc acactgtctgaggacagggagatggaggaggcaccggtggctgctttaggagccactgct gaaccccccgaatccccccagacccgagacaagacctag >gi568815575r:54346145_54595432|GENSCAN_predicted_peptide_3|601_aa MLRSKQEQTLIHEETVRNSLSPYEGENKKPGEGSKGHKKISWPYPQPAKQNGKKATSKVP SAPHFVHPNDHANREAELKKKWVEEMREKQQAAREQERQKRRTIESYCQDVLRRQEEFEH KEEVLQELNMFPQLDDEATRKAYYKEFRKVVEYSDVILEVLDARDPLGCRCFQMEEAVLR AQGNKKLVLVLNKIDLVPKEVVEKWLDYLRNELPTVAFKASTQHQVKNLNRCSVPVDQAS ESLLKSKACFGAENLMRVLGNYCRLGEVRTHIRVGVVGLPNVGKSSLINSLKRSRACSVG AVPGITKFMQEVYLDKFIRLLDAPGIVPGPNSEVGTILRNCVHVQKLADPVTPVETILQR CNLEEISNYYGVSGFQTTEHFLTAVAHRLGKKKKGGLYSQEQAAKAVLADWVSGKISFYI PPPATHTLPTHLSAEIVKEMTEVFDIEDTEQANEDTMECLATGESDELLGDTDPLEMEIK LLHSPMTKIADAIENKTTVYKIGDLTGYCTNPNRHQMGWAKRNVDHRPKSNSMVDVCSVD RRSVLQRIMETDPLQQGQALASALKNKKKMQKRADKIASKLSDSMMSALDLSGNADDGVG D >gi568815575r:54346145_54595432|GENSCAN_predicted_CDS_3|1806_bp atgcttcgaagcaaacaggaacagaccttaatccatgaagagacagtcaggaattccttg agcccctatgaaggtgaaaataaaaagccaggtgaaggttccaagggccacaagaagata agttggccctaccctcagcctgcaaagcaaaatgggaagaaagcaacctccaaagtgccc tctgcacctcattttgttcaccccaatgatcatgccaatcgagaggctgaattaaagaag aagtgggttgaggagatgagggagaagcagcaagccgcccgggagcaagaaagacaaaaa cgcaggaccattgagagctactgtcaggatgtcctaagacgccaggaggagtttgagcat aaggaggaagttttgcaggaattaaatatgtttcctcagctggatgacgaggccacgagg aaggcttattacaaggagttccgtaaggtggtggaatactctgatgtgattctggaagtc ctggatgccagagacccattaggctgccgctgcttccaaatggaggaggctgtcctgcga gcacaaggcaacaagaagctggtcctggtcttgaacaagattgacctggtccccaaggag gttgtggagaaatggctggattaccttcggaatgagttgccaaccgtggctttcaaggcc agtacccagcatcaggtcaaaaacctgaatcgttgcagtgtgccagtagatcaggcctct gagtcactgctgaaaagcaaagcctgctttggagctgaaaacctcatgagggttctgggg aactattgccgccttggtgaagtgcgcacccacattcgtgtgggtgttgtgggtcttccc aatgttgggaagagcagcctgatcaatagcctgaagcgcagccgcgcatgcagcgtggga gctgttcctggaattaccaaattcatgcaggaggtctacctggacaagttcatccggctc ttggatgctccaggcattgtcccagggcccaactcagaggtgggcaccatcctgcgtaac tgcgtccacgtgcagaagctggcagaccctgtgaccccagtggagaccatcctgcagcgc tgcaacctggaggagatttccaactattatggcgtctctgggttccagaccactgagcac tttctgacggcagtggcccaccgtttggggaagaagaagaagggaggcttatatagtcag gaacaggcggccaaagctgtcctagctgactgggtgagcgggaagatcagcttctatata ccaccaccagccactcacactctgcccacccatctcagtgctgagatcgttaaggaaatg accgaggtctttgacatcgaggatactgagcaggccaatgaagacaccatggaatgcttg gccaccggagaatctgatgagctgttgggtgacacggacccacttgaaatggagatcaag ttgctccattctccgatgacgaaaatagcagatgccattgaaaataaaaccaccgtgtat aagattggagatctcactgggtattgcaccaatccgaaccgtcatcagatggggtgggct aaacgcaatgtggaccaccgccctaagagcaacagtatggtggatgtctgctcagtggac cgccgctcagtgctgcagaggatcatggagacggaccccctgcaacagggccaggctctg gcatctgccctgaaaaataagaagaagatgcagaaacgtgcagataaaatcgccagcaag ctgtctgattccatgatgtctgctctcgacctctctggcaatgctgatgatggtgttggt gactaa >gi568815575r:54346145_54595432|GENSCAN_predicted_peptide_4|83_aa MGFHHVSQACLELLTSEGFPTTSIAITHAMNTAQRFENLPTYLVHCCHSWYLSKPSGGQE SACLDTLTLVAAYAPMEPKDRHA >gi568815575r:54346145_54595432|GENSCAN_predicted_CDS_4|252_bp atggggtttcaccatgttagccaggcttgtctcgaactgctgacctcagagggttttccc accacttcgattgctatcacccatgccatgaacactgcccaacggtttgagaacctgccc acctacctggtccactgctgccattcctggtacctgagcaagccatctggaggtcaagag tcagcctgcctggacacactaacactagtggcagcatatgctcccatggagcccaaggac aggcatgcttag