GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:30:35 Sequence gi568815588r:102576897_102805490 : 228594 bp : 48.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15686 15828 143 0 2 92 94 108 0.969 11.87 1.02 Intr + 16740 16825 86 2 2 40 73 139 0.997 6.22 1.03 Intr + 17097 17169 73 0 1 118 55 35 0.907 2.61 1.04 Intr + 20244 20397 154 2 1 44 81 278 0.855 22.35 1.05 Intr + 22537 22648 112 2 1 46 105 138 0.936 10.74 1.06 Intr + 38372 38506 135 2 0 85 105 189 0.995 19.98 1.07 Intr + 40394 40532 139 2 1 97 52 98 0.940 7.57 1.08 Intr + 41891 41978 88 1 1 64 94 7 0.395 -1.56 1.09 Intr + 49080 49181 102 1 0 97 80 18 0.696 2.05 1.10 Intr + 50279 50347 69 0 0 112 74 15 0.701 1.65 1.11 Intr + 50586 50761 176 1 2 93 56 93 0.907 6.26 1.12 Term + 53170 53259 90 0 0 92 38 120 0.886 5.12 1.13 PlyA + 53349 53354 6 1.05 2.00 Prom + 55909 55948 40 -0.06 2.01 Init + 56838 56937 100 2 1 81 94 39 0.210 2.26 2.02 Intr + 67063 67247 185 2 2 79 64 -9 0.140 -4.69 2.03 Intr + 67500 68295 796 2 1 48 19 1295 0.714 109.64 2.04 Intr + 72271 72382 112 2 1 -15 58 122 0.042 -1.66 2.05 Intr + 73969 74044 76 1 1 112 68 -25 0.066 -2.58 2.06 Intr + 77757 77909 153 2 0 80 92 234 0.227 23.27 2.07 Intr + 78184 78417 234 0 0 106 86 555 0.999 54.99 2.08 Intr + 79210 79241 32 0 2 136 115 48 0.999 9.23 2.09 Intr + 79374 79489 116 0 2 77 100 195 0.974 19.69 2.10 Term + 79851 80458 608 1 2 79 42 533 0.999 42.48 2.11 PlyA + 81183 81188 6 1.05 3.14 PlyA - 81359 81354 6 1.05 3.13 Term - 88611 88543 69 0 0 135 50 12 0.245 0.04 3.12 Intr - 91938 91913 26 1 2 72 96 30 0.217 -0.06 3.11 Intr - 93148 93076 73 1 1 111 34 44 0.106 0.28 3.10 Intr - 99300 99245 56 1 2 75 73 39 0.150 -0.20 3.09 Intr - 100702 100545 158 0 2 75 70 61 0.797 2.65 3.08 Intr - 101330 101184 147 1 0 87 66 72 0.887 4.25 3.07 Intr - 102710 102637 74 2 2 88 107 34 0.816 3.50 3.06 Intr - 109105 108920 186 1 0 73 94 162 0.954 15.19 3.05 Intr - 113047 112997 51 1 0 130 99 13 0.979 5.70 3.04 Intr - 122593 122477 117 1 0 114 89 54 0.977 8.76 3.03 Intr - 128593 128450 144 1 0 45 99 129 0.942 10.18 3.02 Intr - 137606 137432 175 1 1 46 57 85 0.230 1.14 3.01 Init - 145150 145101 50 1 2 61 75 71 0.290 3.52 3.00 Prom - 147139 147100 40 -8.56 4.00 Prom + 148148 148187 40 -5.66 4.01 Init + 149741 149901 161 1 2 93 94 154 0.196 15.81 4.02 Intr + 150091 150261 171 1 0 75 75 285 0.991 24.96 4.03 Intr + 151535 151633 99 2 0 100 91 123 0.992 13.03 4.04 Intr + 152423 152498 76 2 1 107 86 89 0.999 10.12 4.05 Intr + 152827 152912 86 0 2 108 73 100 0.980 9.12 4.06 Intr + 154827 154887 61 0 1 61 88 50 0.955 1.04 4.07 Intr + 155256 155322 67 2 1 101 43 45 0.829 -0.22 4.08 Intr + 155963 156012 50 0 2 113 91 33 0.884 4.50 4.09 Term + 158966 159151 186 1 0 118 43 67 0.808 2.69 4.10 PlyA + 160259 160264 6 1.05 5.00 Prom + 160441 160480 40 -4.36 5.01 Init + 167185 167247 63 0 0 86 110 123 0.578 13.46 5.02 Intr + 186745 186878 134 0 2 50 106 25 0.067 -0.06 5.03 Intr + 188697 188793 97 0 1 11 97 77 0.131 0.91 5.04 Term + 196348 196371 24 0 0 106 40 28 0.086 -1.98 5.05 PlyA + 196828 196833 6 1.05 6.05 PlyA - 197032 197027 6 1.05 6.04 Term - 205856 205824 33 2 0 115 43 32 0.518 -1.01 6.03 Intr - 209042 208907 136 1 1 53 89 75 0.617 4.67 6.02 Intr - 218537 218439 99 1 0 124 57 65 0.649 6.23 6.01 Init - 226036 226023 14 1 2 59 115 22 0.684 1.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 14871 14886 16 2 1 77 85 8 0.883 -0.39 S.002 Intr + 149716 149901 186 1 0 120 94 150 0.803 17.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:102576897_102805490|GENSCAN_predicted_peptide_1|455_aa XNTFCSGDHVSWHSPLDNSESRIQHMLLTEDPQMQPVQTPFGVVTFLQIVGVCTEELHSA QQWNGQGILELLRTVPIAGGPWLITDMRRGETIFEIDPHLQERVDKGIETDGSNLSGVSA KCAWDDLSRPPEDDEDSRSICIGTQPRRLSGKDTEQIRETLRRGLEINSKPVLPPINPQR QNGLAHDRAPSRKDSLESDSSTAIIPHELIRTRQLESVHLKFNQESGALIPLCLRGRLLH GRHFTYKSITGDMAITFVSTGVEGAFATEEHPYAAHGPWLQPCPGLLRLPLWRTGREQAC LLGGAAEEAGGRVSPQRSPSPGSSPHYLSSFVPYLGAQGVREKTDSVDRRVCRENVGGFR RFDFSRGTLVRTPSKSSELPAPESLIEEEARAFRLVRVLPSPCCSLADSEQNAHVCCQRT FRWGVLFKLPKEYSWPEKKLKVSILPDVVFDSPLH >gi568815588r:102576897_102805490|GENSCAN_predicted_CDS_1|1368_bp nagaacaccttctgcagtggggaccatgtgtcctggcacagccctttggataacagtgag tcaagaattcagcacatgctgctgacagaggacccacagatgcagcccgtgcagacaccc tttggggtagttaccttcctccagatcgttggtgtctgcactgaagagctacactcagcc cagcagtggaacgggcagggcatcctggagctgctgcggacagtgcctattgctggcggc ccctggctgataactgacatgcggaggggagagaccatatttgagatcgatccacacctg caagagagagttgacaaaggcatcgagacagatggctccaacctgagtggtgtcagtgcc aagtgtgcctgggatgacctgagccggccccccgaggatgacgaggacagccggagcatc tgcatcggcacacagccccggcgactctctggcaaagacacagagcagatccgggagacc ctgaggagaggactcgagatcaacagcaaacctgtccttccaccaatcaaccctcagcgg cagaatggcctcgcccacgaccgggccccgagccgcaaagacagcctggaaagtgacagc tccacggccatcattccccatgagctgattcgcacgcggcagcttgagagcgtacatctg aaattcaaccaggagtccggagccctcattcctctctgcctaaggggcaggctcctgcat ggacggcactttacatataaaagtatcacaggtgacatggccatcacgtttgtctccacg ggagtggaaggcgcctttgccactgaggagcatccttacgcggctcatggaccctggtta caaccatgtcccgggctgctgcgtctgcccctgtggagaacagggagggagcaggcctgt ctcctgggcggagctgctgaggaggctggagggagggtttcccctcagaggagcccctca cctggatcttcaccccattatctttcttcctttgtgccctacctgggtgctcaaggtgta cgggagaagacagattctgttgaccgaagagtttgtagagaaaatgttggaggatttaga agatttgacttctccagaggaacccttgtaaggacacccagcaaatcgtctgaattgcca gctcctgagagcctcattgaagaggaagccagggccttcaggctagtacgagttctcccc agcccctgctgttccctggcagattctgaacagaacgcccatgtctgttgccagagaaca ttccgctggggagttctgttcaaacttcccaaagagtacagctggcctgaaaagaagctg aaggtctccatcctgcctgacgtggtgttcgacagtccgctacactag >gi568815588r:102576897_102805490|GENSCAN_predicted_peptide_2|803_aa MRRWEKTLSLPLHPSSILLSPKTPFALGRGAQTAPRSRPARAPQPAPGQPAGGRLHRGPG TAGLPLPPRPDLAALGAAALASWPRRPRAPAPAREALGRRRRRSPLRARREDRGSLLRGA GTRKALLTERPSGPAGAGARGSVGLQRLRTDPRGWGVGEACPGPLPAAAMAENWKNCFEE ELICPICLHVFVEPVQLPCKHNFCRGCIGEAWAKDSGLVRCPECNQAYNQKPGLEKNLKL TNIVEKFNALHVEKPPAALHCVFCRRGPPLPAQKVCLRCEAPCCQSHVQTHLQQPSTARG HLLVEADDVRAWSCPQHNAYRLYHCEAEQVAVCQYCCYYSGAHQGHSVCDVEIRRNEIRA IAQPCLPAGLAERLRVEVECAFLTVGSTTVLLCECPLSIFQCLGNLICSFGELGWGREGD GRQKMLMKQQDRLEEREQDIEDQLYKLESDKRLVEVAKPKAMLRGGGGLSAGLWEKVNQL KEEVRLQYEKLHQLLDEDLRQTVEVLDKAQAKFCSENAAQALHLGERMQEAKKLLGSLQL LFDKTEDVSFMKNTKSVKILMDRTQTCTSSSLSPTKIGHLNSKLFLNEVAKKEKQLRKML EGPFSTPVPFLQSVPLYPCGVSSSGAEKRKHSTAFPEASFLETSSGPVGGQYGAAGTASG EGQSGQPLGPCSSTQHLVALPGGAQPVHSSPVFPPSQYPNGSAAQQPMLPQYGGRKILVC SVDNCYCSSVANHGGHQPYPRSGHFPWTVPSQEYSHPLPPTPSVPQSLPSLAVRDWLDAS QQPGHQDFYRVYGQPSTKHYVTS >gi568815588r:102576897_102805490|GENSCAN_predicted_CDS_2|2412_bp atgaggcggtgggaaaagaccctctccctgccactacatccctcttctatcctcctctca ccaaaaaccccttttgccttaggaagaggggcccagacagcgccccggagtcggcccgcc cgcgccccgcagcccgcgcccggccagccagccgggggacgcctgcaccgtggcccgggg accgccggcctgcccctcccgccccgtccggatctagcagccctcggcgcggccgccctc gcctcctggccccgcagaccccgggctccggcccctgcgagggaggcgctcgggaggagg aggagacgcagcccgctgcgcgcgcggcgtgaggaccgcggctccctcctccggggggcg ggcacgcggaaggcgctgctgactgagcgaccgtcggggccggctggggccggagctcgg ggctcggtgggcctacagcggctccggacggacccccggggctggggagtcggggaggcc tgccccggccccctgcccgcggccgccatggcggagaattggaagaactgcttcgaggag gagctcatctgccctatctgcctgcacgttttcgtggagccagtgcagctgccgtgcaaa cacaacttctgccggggctgcatcggcgaggcgtgggccaaggacagcggcctcgtacgc tgcccagagtgcaaccaggcctacaaccagaagccgggcctggagaagaacctgaagctc accaacatcgtggagaagttcaatgccctgcacgtggagaagccgccggcggcgctgcac tgcgtgttctgccgccgcggccccccgctgcccgcgcagaaggtctgcctgcgctgcgag gcgccctgctgccagtcccacgtgcagacgcacctgcagcagccctccaccgcccgcggg cacctcctggtggaggcggacgacgtgcgggcctggagctgcccgcagcacaacgcctac cgcctctaccactgcgaggccgagcaggtggccgtgtgccagtactgctgctactacagc ggcgcgcatcagggacactcggtgtgcgacgtggagatccgaaggaatgaaatccgggca attgcgcagccgtgtctgcctgccgggcttgctgagcgcctgcgagttgaagttgagtgt gccttcctgacagttggcagcactactgtgctgctgtgtgagtgcccgctcagcattttc cagtgtcttggcaacctgatttgctcattcggtgaactggggtggggtcgtgaaggggac ggcaggcagaagatgctcatgaagcagcaggaccggctggaggagcgagagcaggacatt gaggaccagctgtacaaactcgagtcagacaagcgcctggtggaggtagctaagcccaag gccatgctgcggggtgggggtggcttgagcgcagggctttgggagaaagtgaaccaactg aaggaggaagttcggctgcagtacgagaagctgcaccagctgctggacgaggacctgcgg cagacagtggaggtcctagacaaggcccaggccaagttctgcagcgagaacgcagcgcag gcgctgcacctcggggagcgcatgcaggaggccaagaagctgctgggctccctgcagctg ctctttgataagacggaggatgtcagcttcatgaagaacaccaagtctgtgaaaatcctg atggacaggacccagacctgcacgagcagcagcctttcccccactaagatcggccacctg aactccaagctcttcctgaacgaagtggccaagaaggagaagcagctgcggaaaatgcta gaaggccccttcagcacgccggtgcccttcctgcagagtgtccccctgtacccttgcggc gtgagcagctctggggcggaaaagcgcaagcactcaacggccttcccagaggccagtttc ctagagacgtcgtcgggccctgtgggcggccagtacggggcggcgggcacagccagcggt gagggccagtctgggcagcccctggggccctgcagctccacgcagcacttggtggccctg ccgggcggcgcccaaccagtgcactcaagccccgtgttccccccatcgcagtatcccaat ggctccgccgcccagcagcccatgctcccccagtatggcggccgcaagattctcgtctgt tctgtggacaactgttactgttcttccgtggccaaccatggcggccaccagccctacccc cgctccggccactttccctggacagtgccctcgcaggagtactcacacccgctcccgccc acaccctccgtcccccagtcccttcccagcctggcggtcagagactggcttgacgcctcc cagcagcccggccaccaggatttctacagggtgtatgggcagccgtccaccaaacactac gtgacgagctaa >gi568815588r:102576897_102805490|GENSCAN_predicted_peptide_3|441_aa MQGDQLDENPSNPGKKRRAGGGLCSSRSPPTAWAGALAHLRTLGKAELHPAPYCCSAPQL CVRGTSGAAPQQLPLGLLSILRKLKSAPDQEVRILLLGLDNAGKTTLLKQLASEDISHIT PTQGFNIKSVQSQGFKLNVWDIGGQRKIRPYWKNYFENTDILIYVIDSADRKRFEETGQE LAELLEEEKLSCVPVLIFANKQDLLTAAPASEIAEGLNLHTIRDRVWQIQSCSALTGEGV QGASQLLFKLGAEPFTRQVPTSVENWCCPPSGFRRHRPHLGKGQMSALGQPSDYLVLLPK CTDSNWQSCNHCKFSLLREGAFKVIVPLREPSPQSLWMLVLKAIRSGPDPFKPIWSAAGR EVNAHRPASHTEKQILMTFDCNDIWNDPEDPALGSQSQLLLWMPPVEGAGAAAIPYARRV DQKFIPPLHPVLGTPLPAFYF >gi568815588r:102576897_102805490|GENSCAN_predicted_CDS_3|1326_bp atgcagggagaccagctggacgagaaccccagtaatccaggcaagaagaggcgagccggc ggcggcctctgcagctcccgtagcccgcccactgcgtgggcgggcgccttagcgcacttg cgcactctggggaaagcggagctgcaccccgccccgtattgctgctcagctcctcagctg tgcgtgcgagggacgtcgggggcggcgccgcagcagttgcccctgggcttgctctcaatt ttgcgcaagttgaaaagtgcaccagaccaggaggtgagaatacttctcctgggcttggat aatgctggcaagaccactcttctgaagcagcttgcatctgaagacatcagccacatcaca cctacacagggtttcaacatcaaaagtgtacaatcacaaggttttaaactgaatgtatgg gacattggtggacagaggaaaatcagaccatactggaagaattattttgaaaataccgat attcttatatatgtaatcgacagtgcagacagaaaaagatttgaagagacgggtcaggaa ctagcggaattactggaggaagaaaaactaagttgtgtgccagtgctcatctttgctaat aagcaggatttgctcacagcagcccctgcctctgaaattgcagaaggactgaacctgcat accatccgcgaccgagtctggcagatccagtcttgctcagctctcacaggagagggcgtt cagggtgcttcccagctcctctttaagcttggggcagaacccttcacccggcaggtaccc acatccgtggagaactggtgctgtccaccttcgggctttcggcgccaccggcctcatctt ggcaaaggtcagatgtcggctctgggacagccgtcagattatctagttctgctgcccaag tgcactgacagcaactggcagagttgtaatcactgtaaattcagccttctaagagagggc gctttcaaggtcattgtgcctttaagggaaccttctccccagtccctctggatgctggtg cttaaagctataagaagcgggccagacccttttaaacctatctggagtgctgcaggaaga gaagtaaatgcacatcgccctgccagccacacagagaagcaaattcttatgacttttgac tgtaatgacatctggaatgatccagaagaccctgccctggggtcccagagtcagctgctg ctctggatgccaccagtagagggggcaggagctgctgccatcccctatgccaggagggtg gaccagaagttcataccacctttacacccagttcttggaacacccctccctgcattctac ttctga >gi568815588r:102576897_102805490|GENSCAN_predicted_peptide_4|318_aa MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSERELDWAKVMVEKSRMGVVPP GTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGMIITGFMLQFYRTMPAVIFWQ WVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAVGMNMLTKKAPPLVGRWVP FAAVAAANCVNIPMMRQQELIKGICVKDRNENEIGHSRRAAAIGITQVVISRITMSAPGM ILLPVIMERLEKLHFMQPHLHGASGVWAFPTEMVSAVHSLLGLIDHAASALPDLPAVQKE TGKGQEGQHLSEDKCPSA >gi568815588r:102576897_102805490|GENSCAN_predicted_CDS_4|957_bp atggaggctgacctgtctggctttaacatcgatgccccccgttgggaccagcgcaccttc ctggggagagtgaagcacttcctaaacatcacggacccccgcactgtctttgtatctgag cgggagctggactgggccaaggtgatggtggagaagagcaggatgggggttgtgccccca ggcacccaagtggagcagctgctgtatgccaagaagctgtatgactcggccttccacccc gacactggggagaagatgaatgtcatcgggcgcatgtctttccagcttcctggcggcatg atcatcacgggcttcatgctccagttctacaggacgatgccggcggtgatcttctggcag tgggtgaaccagtccttcaatgccttagtcaactacaccaacaggaatgcggcttccccc acatcagtcaggcagatggccctttcctacttcacagccacaaccactgctgtggccacg gctgtgggcatgaacatgttgacaaagaaagcgccgcccttggtgggccgctgggtgccc tttgccgctgtggctgcggctaactgtgtcaatatccccatgatgcgacagcaggagctc ataaagggaatctgcgtgaaggacaggaatgaaaatgagattggtcattcccggagagct gcggccataggcatcacccaagtagttatttctcggatcaccatgtcagctcctgggatg atcttgctgccagtcatcatggaaaggcttgagaaattgcacttcatgcagcctcatctt catggtgccagtggcgtgtgggcttttcccacagaaatggtatctgctgttcattcctta cttggtttaattgaccatgctgccagcgcgctccctgatctgcctgctgtccagaaggag acagggaaggggcaggaagggcagcacctgtctgaggacaagtgtccatcagcctaa >gi568815588r:102576897_102805490|GENSCAN_predicted_peptide_5|105_aa MALLLLQALPSPLSARAEPPQPETLSPIQGHFTFSDLQYIHLGSRDSNILLPAVFQLIEG AAKMDGACFVQGIVLGPVRDMEVNVSHLTPKLLTLEEKISALSER >gi568815588r:102576897_102805490|GENSCAN_predicted_CDS_5|318_bp atggcgctcctgctcctccaggcgctgcccagccccttgtcagccagggctgaacccccg cagccagaaacccttagtccaatccaaggtcactttacattttcagacctccagtatatt catctgggaagcagggatagcaatatcctgttaccagctgtattccagctgatagaagga gcagctaaaatggatggggcctgctttgtgcaaggcatcgtgcttggtcctgtgagggat atggaagtgaacgtttcacatcttacccctaagttgcttactttagaggagaagatatca gcgcttagtgaacgttag >gi568815588r:102576897_102805490|GENSCAN_predicted_peptide_6|93_aa MPDVWESRTVEASFNPVFLKQFLTQYGSFSAYDGVMFWACRPVESGRLQRHWTLVHGPLF HPWTLRRKSSPQEFLITGLFCYRDFITISSNQQ >gi568815588r:102576897_102805490|GENSCAN_predicted_CDS_6|282_bp atgcctgatgtatgggagagcagaacagtggaagcatcatttaatccagtgttcctcaaa cagtttttaacacagtacggaagtttctcggcttacgatggggtcatgttctgggcatgc aggcctgtggagtctggcaggctacagaggcactggaccctggttcatggaccactcttt cacccctggacacttagaaggaaaagcagtccccaggaattcctcatcactggcctcttc tgctacagagattttatcaccatttccagcaaccagcaatag