GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:46:49 Sequence gi568815597f:112973407_113224098 : 250692 bp : 43.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 25652 25703 52 1 1 70 92 49 0.026 4.91 1.02 Intr + 40744 40795 52 2 1 49 116 35 0.004 0.37 1.03 Intr + 67240 67471 232 1 1 2 81 245 0.010 12.98 1.04 Term + 83765 83872 108 1 0 116 38 43 0.251 0.61 1.05 PlyA + 84138 84143 6 1.05 2.00 Prom + 92033 92072 40 -5.36 2.01 Init + 100001 100239 239 1 2 95 76 177 0.269 12.80 2.02 Intr + 117912 117977 66 0 0 104 84 26 0.764 1.82 2.03 Intr + 119800 119874 75 1 0 108 87 43 0.944 4.83 2.04 Intr + 122468 122611 144 2 0 89 93 106 0.999 10.60 2.05 Intr + 122816 122959 144 2 0 97 87 64 0.952 6.60 2.06 Intr + 127014 127082 69 0 0 112 93 -5 0.531 0.70 2.07 Intr + 133772 133835 64 2 1 50 85 77 0.569 2.32 2.08 Intr + 136856 137156 301 1 1 30 91 155 0.262 6.31 2.09 Intr + 139073 139354 282 0 0 63 84 204 0.955 14.89 2.10 Intr + 141180 141470 291 1 0 65 111 170 0.397 14.11 2.11 Intr + 145827 146117 291 1 0 70 79 195 0.967 13.91 2.12 Term + 153455 153543 89 0 2 74 44 58 0.117 -2.18 2.13 PlyA + 154652 154657 6 1.05 3.00 Prom + 166231 166270 40 -4.16 3.01 Sngl + 195797 195997 201 1 0 52 42 225 0.673 9.48 3.02 PlyA + 196016 196021 6 1.05 4.00 Prom + 205513 205552 40 -4.76 4.01 Init + 225504 225797 294 2 0 66 -19 724 0.894 56.69 4.02 Term + 228792 228878 87 2 0 113 48 73 0.858 3.46 4.03 PlyA + 229274 229279 6 1.05 5.05 PlyA - 230164 230159 6 1.05 5.04 Term - 236862 236773 90 0 0 109 43 56 0.916 0.92 5.03 Intr - 237234 236907 328 2 1 95 115 86 0.972 7.70 5.02 Intr - 239839 239779 61 2 1 58 115 43 0.072 1.79 5.01 Intr - 250391 250291 101 1 2 97 40 70 0.432 2.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 117715 117728 14 0 2 82 109 12 0.887 2.50 S.002 Term + 215213 215275 63 1 0 107 54 81 0.887 4.49 S.003 Init - 239813 239779 35 2 2 84 115 30 0.821 4.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:112973407_113224098|GENSCAN_predicted_peptide_1|147_aa MCTDSPGAGTTRASAELYHIGKRMNIQKLMFNSGRIVNSMEDNPDRNIMTVWKHYTIEDI IIVTEKAMKAIKPKTINSCWRKLCPDVVHDLSGYMTEAIKEIMKEIVDVPKKFITSCLSS RFPVAGQLELLQEDKQEGCSCQCRLPC >gi568815597f:112973407_113224098|GENSCAN_predicted_CDS_1|444_bp atgtgcaccgactcccctggagctggcaccaccagggcgtctgcagagctatatcacatt gggaaacgcatgaacatccaaaagttaatgtttaactctggaaggattgtcaacagtatg gaagataaccctgataggaacatcatgacagtctggaagcattacaccattgaagacatc atcattgttacagaaaaagctatgaaagccatcaagcccaaaacaataaattcctgctgg agaaaactgtgtccagatgttgtgcatgacttatcaggatatatgacagaagcaatcaag gaaatcatgaaagagattgtggatgtcccaaaaaagttcattaccagctgcttaagcagc aggttcccagtggctggtcagctggaactgcttcaggaggacaaacaggaaggatgcagc tgccagtgcagattaccctgttag >gi568815597f:112973407_113224098|GENSCAN_predicted_peptide_2|684_aa MAPAPLGVPEEQLLGCRSRVLSRLLFIAQTALLLLPAAGAGLCPAPCSCRIPLLDCSRRK LPAPSWRALSGLLPPDTAILDFSHNRLSNWNISLESQTLQEVKMNYNELTEIPYFGEPTS NITLLSLELEHNNLTRVNKGWLYGLRMLQQLYVSQNAIERISPDAWEFCQRLSELDLSYN QLTRLDESAFVGLSLLERLNLGDNRVTHIADGVFRFLSNLQTLDLNNNAIMSIQENAFSQ THLKELTSTFSNEDITITTPKEINIDKIRTHPETIIALRGMNVTLTCTAVSSSDSPMSTV WRKDSEILYDVDTENFVRYWQQAGEALEYTSILHLFNVNFTDEGKYQCIVTNHFGSNYSQ KAKLTVNEMPSFLKTPMDLTIRTGAMARLECAAEGHPAPQISWQKDGGTDFPAARERRMH VMPEDDVFFIANVKIEDMGIYSCMAQNTAGGLSANASLTVLANQLLIIVDAGLEDAGKYT CIMSNTLGTERGHIYLNVISSPNCDSSQSSIGHEDDGWTTVGIVIIVVVCCVVGTSLIWV IVIYHMRRKNEDYSITNTGGTGTRVICSDCYDNANIYSRTREYCPYTYIAEEDVLDQTLS SLMVQMPKETYLVHPPQDTTALESLIPSANREPSAFPTNHERISEKKLPSTQMSGGYLQV ESFWNHLSIRLLHWATSIFFIVRI >gi568815597f:112973407_113224098|GENSCAN_predicted_CDS_2|2055_bp atggcgccggcgcccctaggcgtcccggaggagcagttgctggggtgtcgatctagagtg ctttctcggttactcttcattgcccagaccgctctcctcctgttgcccgccgccggagca ggtctctgccccgcgccctgctcctgccgcattcctctcctggactgcagtcgcaggaaa ttgcccgcaccgagctggagggcgctgtcgggcttgctgccccccgacaccgctatcctg gatttcagtcataatcggttgtctaactggaacatcagcttggaatcacaaacattacag gaagtgaaaatgaattacaatgaactaacagaaatcccgtattttggagaacctacatct aatattactctactttcattagaactggaacacaacaaccttacacgagtaaacaagggg tggttgtatggcttgcgaatgttacagcagctctatgtgagccagaatgctattgaaaga atcagccctgatgcatgggagttctgccaaagactatccgaacttgatttgtcctataac cagctgacccgcctggatgaatctgcctttgtgggtctgagcttattggagagattgaat ttaggagacaacagagtcactcatattgctgatggtgtatttagatttctttccaatctt cagacattagatttgaacaataatgctataatgtctatccaagaaaatgctttttctcag actcatttaaaagaactaacaagtacattttctaacgaagacatcaccattaccacaccc aaagaaattaacatcgataagataaggacacatcctgaaaccataattgctctaagaggc atgaatgtgactctgacgtgcactgcagtgagcagcagtgattcacccatgtccactgtg tggcgcaaagacagtgaaatcctgtatgacgtggatactgagaattttgttcgttattgg cagcaagctggagaagctctggaatatactagtatcttacatcttttcaatgtgaatttc acagatgaaggaaaatatcagtgtattgttactaatcactttggttctaattattctcag aaagccaaactgactgtaaatgagatgccatcttttctgaaaacgccaatggatctgact attcgcactggtgccatggccagattagaatgtgctgcagagggacaccctgcacctcag atttcctggcagaaagatggtggtactgactttcctgcggctcgagaaagacgcatgcac gtcatgcccgaggatgacgtcttctttattgccaatgtgaaaatagaagatatgggaatc tatagctgcatggcacaaaatacagcaggaggtctctcagcaaatgcttccctaacagtg ttagccaatcagcttctcatcattgtagatgccgggctagaagatgctgggaaatatacc tgcattatgtctaacacccttgggacagaacgtggccacatttacctaaatgtcatttca tcccccaattgtgactcttcccagagtagcattgggcatgaagatgatggctggaccaca gttggcattgtcatcattgttgtggtctgctgtgttgttggcacttctttgatctgggtc attgttatttaccacatgagaaggaaaaatgaagactatagtatcacaaacacaggtggc actggtacccgggtgatttgctcagattgttatgacaatgccaacatctactccaggacc cgagaatactgtccatacacctatattgctgaggaggacgttcttgatcagacactgtcc agcctcatggtccaaatgcctaaagagacatatttagtacatcctccccaggatactact gccctagagagcctgataccgtcagccaacagagagccatctgcctttcccaccaaccat gagaggataagtgagaagaaacttccctccacacagatgagcggtggttatttgcaagta gaaagcttctggaaccatttgtccatccgacttctgcactgggccacatccatcttcttc attgtcagaatctga >gi568815597f:112973407_113224098|GENSCAN_predicted_peptide_3|66_aa MGDIKDGIMPSHFSRGSKSVARQVLQALEGLKMVEKGQDGGHKLIPQGQRDVDRITGEVA ADNKKH >gi568815597f:112973407_113224098|GENSCAN_predicted_CDS_3|201_bp atgggggacatcaaagatggcatcatgcccagccacttcagtcgaggctccaagagtgtg gcccgccaggttctccaagccctggaggggctgaaaatggtggaaaagggccaagatggg ggccacaaactaatacctcagggacagagagatgtggacagaatcaccggagaggtggca gctgacaacaagaagcattag >gi568815597f:112973407_113224098|GENSCAN_predicted_peptide_4|126_aa MSYEQLIQLYSAHQWCRLNWGLRRKQHSLLKRLRKAKTAVPPMEKPEVVKTHLRDMIIVP EMVGSMVGVYNGKTFNQVEVKPEMISHYLGEFSITYKPMKSTPGEDAVTIVKMTTKVLEY YRSLKQ >gi568815597f:112973407_113224098|GENSCAN_predicted_CDS_4|381_bp atgtcctacgagcagctgatacagctgtacagtgcgcaccagtggtgccggctgaactgg ggcctgcggcggaagcagcactctctgctgaagcgcctgcgcaaggccaagacggcggtg ccgcccatggagaagccggaagtggtgaagactcacctgcgggacatgatcatcgtgccc gagatggtgggcagcatggtgggcgtctacaacggcaagaccttcaaccaggtggaggtc aagccggagatgatcagccactacctgggcgagttctccatcacctacaagcccatgaaa tccactcctggtgaagatgctgtgaccattgttaaaatgaccacaaaggttttagaatat tacagaagcctaaagcagtga >gi568815597f:112973407_113224098|GENSCAN_predicted_peptide_5|193_aa XDQGWMQLASRSLRKLRSPWCGLEVTRSTWALQCLHDQLTLAVMGWTHTAVKLDRKKRTL SDFQDETHLWAKAKKKGNHCCSMGKDKEVFPEDMPHPQFCSLAAVSLIPHHAVPGESNQL KQMLMELKAAPISSSHASKSSPLADSHIFTLANDCTWAPALPYQVPVSDSDINSVGENEK WVPQCYKTLKTEK >gi568815597f:112973407_113224098|GENSCAN_predicted_CDS_5|582_bp nnggatcagggctggatgcagttagcttctaggagcttgcgcaagctcagatctccctgg tgtgggctggaggtgacccgctccacgtgggccctgcagtgcttgcatgaccagctgact cttgcagtgatgggctggacacatactgctgtgaagctggacaggaaaaagagaacccta agtgacttccaagatgagacccacctttgggctaaagcaaagaaaaaggggaaccattgt tgctctatggggaaagataaggaggtctttccggaagacatgccacacccacagttttgc agtctggctgctgttagtctcattcctcaccacgctgtacctggagagtctaatcagttg aagcaaatgcttatggagctaaaggcggcacccatctcctcctcccatgcttcaaagtca tcaccactagctgactcgcatatatttacactggccaatgattgtacctgggcaccagcc ctgccctaccaggttcctgtttctgactctgatataaattctgtgggcgaaaatgagaaa tgggttccacaatgctacaagactctcaaaactgaaaaataa