GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:12:04 Sequence gi568815590f:8903064_9130031 : 226968 bp : 41.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 433 493 61 1 1 122 111 26 0.607 6.12 1.02 Intr + 2268 2393 126 2 0 25 92 61 0.156 0.16 1.03 Intr + 17050 17121 72 0 0 78 78 39 0.002 0.58 1.04 Intr + 21298 21355 58 0 1 88 92 66 0.442 4.54 1.05 Term + 26852 26961 110 0 2 96 41 101 0.468 3.89 1.06 PlyA + 28910 28915 6 1.05 2.04 PlyA - 30635 30630 6 1.05 2.03 Term - 30990 30735 256 2 1 5 39 232 0.768 4.07 2.02 Intr - 32713 32603 111 0 0 69 55 126 0.576 5.98 2.01 Init - 45749 45679 71 2 2 68 15 84 0.083 -0.33 2.00 Prom - 49022 48983 40 -3.85 3.00 Prom + 50402 50441 40 -4.95 3.01 Init + 58131 58191 61 2 1 86 97 86 0.978 10.76 3.02 Intr + 58697 58981 285 0 0 47 52 344 0.564 23.09 3.03 Intr + 60835 61075 241 2 1 50 27 101 0.010 -4.11 3.04 Intr + 70997 71204 208 2 1 45 54 175 0.063 7.96 3.05 Intr + 75934 76142 209 0 2 68 27 122 0.064 1.05 3.06 Intr + 92178 92379 202 0 1 100 64 189 0.342 16.07 3.07 Intr + 99974 100267 294 1 0 -26 63 256 0.005 7.98 3.08 Intr + 104907 105085 179 2 2 94 59 155 0.995 11.00 3.09 Intr + 108479 108689 211 2 1 67 92 152 0.984 11.49 3.10 Intr + 113259 113342 84 2 0 140 29 48 0.786 3.20 3.11 Intr + 115234 115343 110 0 2 101 106 7 0.944 1.96 3.12 Intr + 117287 117401 115 2 1 85 93 36 0.957 3.33 3.13 Intr + 118032 118163 132 2 0 83 93 62 0.978 6.12 3.14 Intr + 141241 141411 171 0 0 14 96 159 0.982 8.52 3.15 Term + 143249 143335 87 1 0 102 53 65 0.795 1.08 3.16 PlyA + 143982 143987 6 1.05 4.04 PlyA - 144723 144718 6 1.05 4.03 Term - 149443 149357 87 1 0 38 49 153 0.394 3.08 4.02 Intr - 153192 153031 162 0 0 79 39 83 0.640 1.75 4.01 Init - 155273 155130 144 2 0 107 66 108 0.404 10.67 4.00 Prom - 155990 155951 40 -5.85 5.02 PlyA - 156650 156645 6 1.05 5.01 Sngl - 157309 156698 612 1 0 90 40 186 0.670 9.95 5.00 Prom - 160216 160177 40 -3.75 6.12 PlyA - 161793 161788 6 1.05 6.11 Term - 163231 162924 308 2 2 3 43 238 0.297 5.09 6.10 Intr - 167692 167615 78 2 0 78 85 48 0.765 2.10 6.09 Intr - 168023 167884 140 1 2 76 89 105 0.703 8.59 6.08 Intr - 175953 175917 37 1 1 83 92 30 0.013 -0.70 6.07 Intr - 187165 187072 94 1 1 129 81 -1 0.069 1.92 6.06 Intr - 193474 193321 154 0 1 93 73 127 0.349 10.85 6.05 Intr - 199803 199652 152 0 2 -9 94 109 0.508 -0.16 6.04 Intr - 200634 200445 190 2 1 99 97 58 0.766 6.47 6.03 Intr - 205216 205092 125 1 2 20 62 129 0.683 1.96 6.02 Intr - 207288 207185 104 1 2 34 113 67 0.772 2.77 6.01 Init - 208829 208502 328 2 1 103 33 279 0.841 21.43 6.00 Prom - 210931 210892 40 -4.75 7.00 Prom + 210958 210997 40 -7.45 7.01 Init + 215327 215380 54 1 0 52 93 -12 0.326 -2.97 7.02 Term + 217900 218142 243 0 0 121 47 269 0.799 21.02 7.03 PlyA + 218375 218380 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 17115 16928 188 0 2 62 77 203 0.820 15.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:8903064_9130031|GENSCAN_predicted_peptide_1|142_aa XPFSGSRSESTKKEPSTLLSPLFRICVNSQWLFNCSLSPPVVGCQEPFVHVTPHPCLRII RPEPPQQPAITNRSKGRSLSNFIRHCEAQYRVEAAKAWGFHPLKPQQLALTCCITQVLLV GHTKGVANYRKKIQKKEFPERI >gi568815590f:8903064_9130031|GENSCAN_predicted_CDS_1|429_bp nngcctttttctggatccaggagcgaatcaactaagaaagagcctagcacccttttaagt ccgctttttcgtatctgtgtaaattcacaatggctcttcaactgctccctgtctccccca gtagtgggctgtcaggagccatttgtccatgtgaccccacatccttgccttcgaataatt agacctgagcctccgcagcagccagccatcactaacagaagtaaggggagaagtcttagc aacttcattagacattgtgaggctcaataccgtgtagaagctgccaaggcttggggcttc caccctctgaagccacaacagttggccctcacatgctgtatcacccaggtccttctggtt gggcataccaagggagtagcgaattatcggaaaaaaattcagaagaaagaattcccagaa agaatttaa >gi568815590f:8903064_9130031|GENSCAN_predicted_peptide_2|145_aa MRMAVEKYDWTQPYDLMAVNWRETNSRHRSFQLPFSSAKDSVSAALDGVFGLPGSIRGTM RSWIWDERKWKKCGTTAAMVAVCGGLGRKKLTHLVTAAVSLTHSGTHTVLWRRGCSQYKQ VSSNEDLVFVGRNRKKSRKQLRELK >gi568815590f:8903064_9130031|GENSCAN_predicted_CDS_2|438_bp atgaggatggccgtggaaaagtatgattggacacagccgtatgatctaatggcagtaaac tggagagaaacgaactcccggcaccgctcctttcagcttcctttcagctctgccaaagat tctgtctcagcagctttggatggagttttcgggcttcctggcagcatccgcgggaccatg aggtcctggatttgggatgaaaggaagtggaagaaatgcggaaccacggccgctatggtt gctgtttgcggtggtctagggaggaagaagttgacacacctggtaacggctgctgtcagc cttacacattccgggactcacacggtgctttggagaagaggttgttcacaatataaacag gtatccagcaacgaggacctggtctttgtgggaagaaacagaaagaaatcacgaaagcaa ttaagagagctcaaataa >gi568815590f:8903064_9130031|GENSCAN_predicted_peptide_3|862_aa MVEGKEEQVTSYVDVQRACAGIRGAFEKPQGAVARVHIGQVIMSICTKLQNKEHVIEALC KANFKFPGRQNIHFSEKWDFTKFSVDEFEDMMAEKQLIPITVGSSIPPIVTLRTSGMESP CNTGCSYGNCYKTRSGFTRVGRRNLPRKPGARVMGVDVLLGPRKVGSDEEALKKSQYDDD AFMVKFCASSTEHFLNNPQTDAVLGQRYGRESKSPKCLNHKHRDLTPVTVETNALASDPY RSAFLPLNGAVRGELGCRTGGEKNVGKMSRCETGRSHPTSGSSGSGTSDCIQQGAREASE APAETQDWLPLPSSGDVSRHSAAPARGLLQLLPSGFDMTARQFHNAYLPYQRPLMMEMLW PVYRECTEDFRSSASPSDVRRLKTPHSDYANAAICVHVTHEEQLSSGVTAGMEDPQSKEP AGEAVALALLESPRPEGGEEPPRPSPEVRSRPAPGCWRQLPRPQSALGTPFFSGVPQKVQ LCALGPQPLPRCPAPWLYPRETQQCKFDGQETKGSKFITSSASDFSDPVYKEIAITNGCI NRMSKEELRAKLSEFKLETRGVKDVLKKRLKNYYKKQKLMLKESNFADSYYDYICIIDFE ATCEEGNPPEFVHEIIEFPVVLLNTHTLEIEDTFQQYVRPEINTQLSDFCISLTGITQDQ VDRADTFPQVLKKVIDWMKLKELGTKYKYSLLTDGSWDMSKFLNIQCQLSRLKYPPFAKK WINIRKSYGNFYKLYYLSVRALSVHTPVTMESVHNFPLDVIIYFRGCLLIPLLSIQTRIS DPGSWNGAEIVERSEVTLVWESGGCEEQKGSQRPGQQTHHCVSSVGEFTLAPIQPGSKSQ SAERDMSSTYARLPRGDEAEQP >gi568815590f:8903064_9130031|GENSCAN_predicted_CDS_3|2589_bp atggtggaaggcaaggaggagcaagtcacatcttacgtggatgtccagagagcttgtgca ggtattcgaggtgcctttgaaaagccccagggtgctgtggccagggttcacattggccaa gtcatcatgtccatctgcaccaagctgcagaacaaggagcatgtgattgaggccctatgc aaggccaactttaagttccctggccgccagaatatccacttctcagagaagtgggacttt accaagttcagtgtggatgaatttgaagacatgatggctgagaagcagcttatcccaata actgtggggtcaagtatacccccaatcgtgaccctccggacaagcgggatggagtcgccc tgcaacacgggctgctcctatggcaactgctacaaaacaagatcaggcttcaccagggtc gggagaagaaacctcccaagaaagccaggcgctagagtcatgggtgtggatgtcctttta gggcctaggaaagtagggtctgatgaggaggcattgaagaaaagccagtatgacgacgat gctttcatggtgaagttttgtgcatcttctactgaacattttttaaacaacccccaaaca gatgcagttttgggccagcgttatggaagagaaagcaaatcccccaaatgtctgaatcac aaacacagagacctcacccccgtgacagtggaaacaaatgccctggcctctgacccctac agatccgcgttcctgcccctaaacggtgcagtgaggggagagctgggctgcagaacaggt ggagagaagaacgtggggaagatgagcaggtgcgagacagggagaagccatcctacttct ggctccagcggcagcggtaccagtgactgtatacaacagggggcccgagaagcatctgag gctccagcggagacacaggattggcttccgctccccagcagtggggacgtctccagacat tcagcagctccagcccggggactgctgcagcttctaccctctggtttcgacatgacagct cgccagtttcacaatgcataccttccttatcagagaccgttgatgatggagatgctctgg ccagtctatagagaatgcacagaggattttcggtcctctgcttcaccttctgatgtcaga aggctgaaaactccacactcagattatgctaatgctgccatttgtgtacatgtgacccat gaagagcaactctcctctggcgtgacagccggcatggaggatccacagagtaaagagcct gccggcgaggccgtggctctcgcgctgctggagtcgccgcggccggagggcggggaggag ccgccgcgtcccagtcccgaggtgaggagtcgacccgcgcctggctgttggcgccagctg ccccgtccccaaagcgccctcggcacccctttcttctcaggtgtcccgcagaaggtgcag ctgtgcgcccttggaccccagcctcttcctcggtgcccagctccctggctttatcctcgg gaaactcaacagtgtaaatttgatggccaggagacaaaaggatccaagttcattacctcc agtgcgagtgacttcagtgacccggtttacaaagagattgccattacgaatggctgtatt aatagaatgagtaaggaagaactcagagctaagctttcagaattcaagcttgaaactaga ggagtaaaggatgttctaaagaagagactgaaaaactattataagaagcagaagctgatg ctgaaagagagcaattttgctgacagttattatgactacatttgtattattgactttgaa gccacttgtgaagaaggaaacccacctgagtttgtacatgaaataattgaatttccggtt gttttactgaatacgcatactttagaaatagaagacacgtttcagcagtatgtaagacca gagattaacacacagctgtctgatttctgcatcagtctaactggaattactcaggatcag gtagacagagctgataccttccctcaggtactaaaaaaagtaattgactggatgaaattg aaggaattaggaacaaagtataaatactcacttttaacagatggttcttgggatatgagt aagttcttgaacattcagtgtcaactcagcaggctcaaataccctccttttgcgaaaaag tggatcaatattcggaagtcatatggaaatttttacaagctctattacctttcagtcaga gcactcagtgttcatactcccgtgactatggagagtgtccacaactttcccctggatgta ataatttacttccgcggttgtttacttatccccttactctccattcaaactcggataagt gatcccggatcctggaatggtgctgagattgttgaaagaagcgaggtgactttggtgtgg gaatcaggaggatgtgaggaacagaagggaagccagcgaccaggtcagcagacacatcac tgcgtcagttcagtaggagagttcacactggcccccatacagcctggcagtaaaagccag tcagcagagagagacatgagcagcacatatgcgagactgcctagaggagatgaggcagag cagccctga >gi568815590f:8903064_9130031|GENSCAN_predicted_peptide_4|130_aa MGHIATYEVLNRNCPWLHPFSQRTFTPSGNPISSAGFLDLFTHQLQGHGVPHPADPKSAE TVSSTSSGAGDHSSKAQKPFHFQTLLHDVSSKLQEPIAHRHPNAAGVSDGTLPESQRLSA LGEFNQSAPE >gi568815590f:8903064_9130031|GENSCAN_predicted_CDS_4|393_bp atggggcacatagcaacctacgaggtcttaaatagaaactgcccgtggctccaccctttc tctcagcgcacattcactccatcaggaaatcccatcagctctgccggattcctggatctt ttcactcaccagctccagggccacggagtgccccatccagctgatcccaaatccgcagag acagtgtcttccacttccagcggagctggggatcacagctccaaggcccagaaacctttc catttccaaactcttcttcacgatgtctcctcgaagctccaggaacccatagcacatagg caccctaatgctgctggtgtcagtgacggtaccttgccagaatcccagagactcagtgct ctaggcgagttcaaccaatcagcaccagaatag >gi568815590f:8903064_9130031|GENSCAN_predicted_peptide_5|203_aa MPQVRKLKYLLVWVDTFTGWVEVFLTGSEKATAVISSLLSDIIPRFGLPTSIQSNNRPAF ISQITQAVSQALGIQWKLRTPYHPQYSGKVERTNGLLKTHLTKLSLQLKKDWTILLPLSL LRIQACPRNATGYSPFELLYGRSFLLSPSLIPDTRPTWTVPQKTCHPYYLLSSHTPIHHS QLLIHALLLFTLPVHTVSPSHRS >gi568815590f:8903064_9130031|GENSCAN_predicted_CDS_5|612_bp atgccccaagtcagaaaactaaaatatctcttagtctgggtagacactttcactggatgg gtagaggtctttctcacagggtctgagaaggccaccgcggtcatttcttcccttctgtca gacataattcctcggtttggccttcccacctctatacagtccaataacagaccagccttt attagtcaaatcacccaagcagtttctcaggctcttggtattcagtggaaacttcgtacc ccttaccatcctcaatattcaggaaaggtagaacggactaatggtcttttaaagacacat cttaccaagctcagcctccaacttaaaaaggactggacaatacttttaccactttccctt ctcagaattcaggcctgtcctcggaatgctacagggtacagcccatttgagctcctgtat ggacgctcctttttattaagccccagtctcattccagacaccagaccaacttggactgtg ccccagaaaacttgtcatccctactatcttctgtctagtcatactcctattcaccattct caactactcatacatgccctgctcttgtttacactgccggttcacactgtttctccaagc catcgcagctga >gi568815590f:8903064_9130031|GENSCAN_predicted_peptide_6|569_aa MDDFERVRTLVEEVNAEVVGIAREFELEVEPEDMTELLQSHHKTSTNEALLLTDEQRKWF LEMESTPVEDAVNIPETTTKDLEYYLSLVDTAAAGFERTDSNFERGSTMVESLAYSSDLY EQDIPMSLDVLYYDFYNNLGIIWKQGFEPDKTMELVTEIPALSWNPARAASLVIRSTRKI YPSVPRSSSCPSKPLFWPGIPQPAVKVRADPTVFREVGNASLNLLQCSQESEILASTCFL STEKRDQGLSQAQVKQGASCNMYEESTKSKHKCRASKNKRAKCKTRKSKYAQALAPRYAS EPFHSPAVTLSKFFHGASPVTTSMGKKALLLGYLALVQGLREGQSLDEEPRHIRRRKKQW SFCIIGSSKCPGSLDKVCLQGEGKTTPLRAHNLARPRKICCFGKLEINRPKQPEEMGGEK IEPVHSAVSAGVQSDGSHGCPYMRLEPDSFRRQMFSFCVHLQGWTPLREETSCAGGERGR DAPASGNPNEENGEMRLTIRRTKKRKKVGRKRRRKKVEARKRVEMNEEARAPGGKQGTAD REDSGVDIAKQKMRRIARQQKSQRKKRRP >gi568815590f:8903064_9130031|GENSCAN_predicted_CDS_6|1710_bp atggatgactttgagagggtcaggactttagtggaggaagtcaatgcagaggtggtggga atagcaagagaattcgaattagaagtggaacctgaagatatgactgaattgctgcaatct catcataaaacttcaacaaatgaggcgttgcttcttacggatgagcaacgaaagtggttt cttgagatggaatctactcctgttgaagatgctgtgaacattcctgaaacaacaacaaag gatttagaatattacctaagcctagttgacacagcagcggcaggatttgagaggactgac tctaattttgaaagaggttctaccatggtagaaagtctggcttattcctctgacctttat gaacaagacattcctatgtccttagacgttctttactatgacttctacaacaacctgggc atcatatggaagcaaggctttgagcctgataaaaccatggagctggtaactgaaattcct gcattaagctggaaccctgctagagctgcctccttggttataagatcaactagaaaaatc tacccgtcagtaccaaggagctccagctgcccctccaagcccctcttctggcctggcatt ccccagcctgcagtgaaggtcagagcagatcctaccgtattcagagaagtaggaaacgcg agcttgaatcttcttcagtgttctcaagagagtgagatattggcaagcacctgttttctc agcactgagaaaagggatcagggattgtcccaagctcaagttaaacagggggcctcctgc aatatgtatgaagagagcaccaagagcaaacacaagtgcagggccagtaaaaataaaaga gcaaagtgcaagactcgaaagagtaaatatgcacaggctctggccccaagatatgccagt gagccctttcacagtccagcagtgacgttgtccaagttctttcatggtgcctctcctgtg actacctccatgggaaagaaagcattgcttctggggtacttggccctggttcaaggcttg agagaaggccaatcccttgatgaagaaccaaggcatatcaggaggaggaagaagcagtgg agtttctgcataattgggtcaagcaagtgtcctgggtccttggataaggtttgcttacaa ggagaaggtaagacaacaccccttagagctcataatcttgccaggccaagaaagatatgt tgctttgggaaactggaaatcaacagaccaaaacagccagaagaaatgggtggagagaag atagagcccgttcactctgcagtctccgcaggggtacagagtgatggcagccatgggtgc ccttacatgcgacttgaaccagattctttcagacgccaaatgttctctttctgtgtgcat ttgcaaggatggactccactgcgagaagaaacaagttgtgcaggtggagaaaggggaaga gatgcccctgctagtgggaaccccaatgaagaaaatggggagatgaggctgacgataagg aggacaaagaagaggaagaaggtggggaggaagaggaggaggaagaaggtggaggcaagg aagagggtggagatgaatgaagaagctagagcccctggtggcaaacagggcactgcagat agagaagatagtggtgttgatatcgcaaagcagaagatgaggaggatagccaggcagcag aaaagtcaaagaaaaaaaaggcggccgtga >gi568815590f:8903064_9130031|GENSCAN_predicted_peptide_7|98_aa MANQMSNFHSKNFPEGEQGSVQVYVTHICRFSADETVSGDPGQTMLAGRANPGQSAPCLA ALPGATEELVQQGRLIRVGQSRGVDVLPECAAVADNIG >gi568815590f:8903064_9130031|GENSCAN_predicted_CDS_7|297_bp atggctaatcaaatgtcaaattttcacagcaagaattttccagaaggggaacagggctct gtccaggtgtatgttactcatatttgcagattcagtgctgatgagacagtttctggagac ccaggacaaacaatgctggctggaagagccaaccctggccagtcagccccttgtttggca gcacttccaggtgctacagaagaactggtgcagcaaggacgattgattcgtgtgggtcaa agtaggggtgttgatgttcttccggaatgtgcagcagttgctgataatattggctga