GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:45:18 Sequence gi568815591r:123015521_123299946 : 284426 bp : 36.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5852 5998 147 1 0 74 68 78 0.194 3.91 1.02 Term + 18993 19049 57 2 0 87 44 82 0.265 0.41 1.03 PlyA + 21292 21297 6 1.05 2.02 PlyA - 21848 21843 6 1.05 2.01 Sngl - 35383 35228 156 1 0 54 43 194 0.786 5.65 2.00 Prom - 40308 40269 40 -3.05 3.04 PlyA - 40869 40864 6 1.05 3.03 Term - 53862 53473 390 0 0 34 49 283 0.304 13.00 3.02 Intr - 57175 57119 57 1 0 108 103 -16 0.168 0.06 3.01 Init - 61060 60980 81 1 0 89 110 3 0.386 3.62 3.00 Prom - 64934 64895 40 -4.85 4.02 PlyA - 65051 65046 6 1.05 4.01 Sngl - 65648 65184 465 2 0 42 43 268 0.920 13.59 4.00 Prom - 65896 65857 40 -11.74 5.02 PlyA - 65997 65992 6 1.05 5.01 Sngl - 67626 66535 1092 0 0 41 45 466 0.659 34.36 5.00 Prom - 67813 67774 40 -11.84 6.09 PlyA - 68051 68046 6 1.05 6.08 Term - 68702 68058 645 2 0 -44 49 438 0.006 19.83 6.07 Intr - 69110 68949 162 2 0 64 71 82 0.005 3.35 6.06 Intr - 78687 78632 56 1 2 64 98 38 0.025 0.18 6.05 Intr - 90019 89979 41 0 2 127 75 32 0.130 2.75 6.04 Intr - 113980 113863 118 2 1 48 116 74 0.813 4.80 6.03 Intr - 116737 116605 133 1 1 118 80 39 0.841 5.40 6.02 Intr - 118950 118890 61 2 1 96 108 23 0.554 2.92 6.01 Init - 131781 131639 143 0 2 56 102 108 0.577 8.65 6.00 Prom - 143644 143605 40 -5.05 7.08 PlyA - 145153 145148 6 1.05 7.07 Term - 146091 145987 105 0 0 113 43 74 0.300 2.83 7.06 Intr - 152902 152854 49 0 1 87 83 30 0.277 0.16 7.05 Intr - 153041 152984 58 0 1 93 86 19 0.306 -0.78 7.04 Intr - 153815 153628 188 1 2 60 24 234 0.368 12.61 7.03 Intr - 156384 156248 137 0 2 76 115 88 0.916 8.75 7.02 Intr - 165581 165453 129 2 0 97 115 21 0.191 5.67 7.01 Init - 184426 184328 99 1 0 58 113 90 0.919 8.82 7.00 Prom - 195817 195778 40 -4.05 8.00 Prom + 202443 202482 40 -6.15 8.01 Init + 209796 209949 154 2 1 62 63 98 0.744 4.89 8.02 Intr + 214981 215287 307 2 1 69 81 197 0.326 11.68 8.03 Intr + 260659 260717 59 1 2 44 116 34 0.081 -0.59 8.04 Term + 261998 262221 224 0 2 43 44 241 0.831 11.30 8.05 PlyA + 262388 262393 6 1.05 9.03 PlyA - 266990 266985 6 1.05 9.02 Term - 276372 276307 66 0 0 73 39 98 0.757 0.36 9.01 Init - 277617 277573 45 0 0 53 92 55 0.681 3.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 110155 110044 112 0 1 114 48 37 0.826 -0.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_1|67_aa HFDYIGYCLPASKVSDKKSADNLIKDLLYVMRCFYFAAFKILFEFILLEHLSQHCDLGTY GLPDPQA >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_1|204_bp cactttgactatattggctactgccttcctgcctccaaagtttctgataagaaatctgct gataatcttattaaggatctcttatatgtgatgaggtgcttctattttgctgcttttaag attctctttgagttcatcttacttgagcacctcagccagcactgtgatctaggaacttac ggacttcctgatccacaagcatga >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_2|51_aa MWESLELHRHLMNGFDQNADSDMDNKVQVDMVSDGDEELVDTGVNVTLVMF >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_2|156_bp atgtgggaaagtttagaacttcatagacacttgatgaatggctttgatcaaaatgctgat agcgatatggacaataaagtccaggttgacatggtctcagatggagatgaggaacttgtt gacactggagtaaatgtgactcttgttatgttttag >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_3|175_aa MQSISVKAKIPPKLNRQGRLYSRLLQQKHKLKQVQKKQNKTKKTKVDLEEKIKKISKSDK GTLQLYHLSSEHTPAHDHQPQHLQIGKASELILTPSQHQPHPNSLCREKGHPDHPATMSF FFNFEKLEGAKMANWMQPGGTSPTKGPGHQEDWHTEQTFGEKALKVDGGRMQTLD >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_3|528_bp atgcagtccataagtgtcaaagcaaaaattccaccaaagttaaacagacaaggaagactt tattcaaggctattgcaacagaaacacaaactaaaacaagttcaaaagaaacaaaacaaa acaaaaaaaactaaagtggatttggaagaaaaaattaaaaaaatttccaagagtgataag gggactctccaactgtaccacctgtcatctgagcatactccagcccatgaccatcagccc cagcacctccagattggcaaagcatcagaactgatcctaacaccttcacagcaccaacct cacccaaacagcctttgcagagaaaaaggccacccagaccatcctgctactatgagtttt ttctttaattttgaaaagctggagggagccaagatggccaactggatgcagccaggagga acatctcccaccaagggaccaggacatcaggaagactggcacactgagcagacctttgga gagaaggcattgaaagtggatggagggaggatgcagacactggactga >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_4|154_aa MCKNHKHSSTPIADKQESQIMSELPFTIASKRIKRLGIQLTRDMKGLFKENYKPLLSEIK QDTNKWNNIPCSWIGRINIVKMVILPKVIYRFNTIPIKLPMTFFTELEKTTLKFIWNQKR ARIAKTILSQKNKAGGIMLPDFKLYYKATVTKTA >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_4|465_bp atgtgcaaaaatcacaagcattcttctacaccaatagcagacaaacaagagagccaaatc atgagtgaactcccattcacaattgcttcaaagagaataaaacgcctaggaatccaactt acaagggatatgaagggcctcttcaaggagaactacaaaccattgctcagcgaaataaaa caggacacaaacaaatggaataacattccatgctcatggataggaagaatcaatatcgtg aaaatggtcatactgcccaaggtaatttatagattcaataccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagccaaaagaacaaagctggaggcatcatgttacct gacttcaaactatactacaaggctacagtcaccaaaacagcatga >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_5|363_aa MLVSDKTDFKPTKVKRDKEGHYIMVKESTQQEELTILNIYALNTGAPRFVKQVLRDLQRD IDSHTIITGDFNTPLSTLDRSMRQKVNKDIQELNSALHQVDLIDIYRTLHPKSTEYTFFS APHHTYSKTDHIVGSKALLSKCKRTEIITNCISDHSAIKLELRIKKLTQNRSTTWKLNNL LLNDYWVHNEMKAEIKMFFETNESKDTTYQNLWDTFKAVRREKFIALNAHKRKQERSKID TLISQLKELEKQEQTHSKASRRQEITKIRAELKEIETQNALQKINESRSWFFEKINKIDR PLVRLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIGEYYKHLYANKLENLEEMDKFLNT YTL >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_5|1092_bp atgctagtctctgataaaacagactttaaaccaacaaaggtcaaaagagacaaagaaggc cattacataatggtaaaggaatcaacgcaacaagaagagctaactatcctaaatatatat gcactcaatacaggagcacccagattcgtaaagcaagtccttagagacctacaaagagac atagactcccacacgataataactggagactttaacaccccactgtcaacattagacaga tcaatgagacagaaagttaacaaggatatccaggaattgaactcagctctgcaccaagtg gacctaatagacatctatagaactctccaccccaaatcaacagaatatacatttttctca gcaccacatcacacttattccaaaactgaccacatagttggaagtaaagcactcctcagc aaatgtaaaagaacagaaattataacaaactgtatctcagaccacagtgcaatcaaacta gaactcaggattaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctg ctcctgaatgactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaa accaatgagagcaaagacacaacataccagaatctctgggacacatttaaagcagtgcgt agagagaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgac accctaatatcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagc agaaggcaagaaataactaagatcagagcagaactgaaggagatagagacacaaaacgca cttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaacaaaattgataga ccactagtaagactaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaat gacaaaggggatatcaccactgatcccacagaaatacaaactaccattggagaatactat aaacacctctacgcaaataaactagaaaatctagaagaaatggataaattcctcaacaca tacaccctctga >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_6|452_aa MRTKYRTKKGHVTRKLTCLCIAYSSTIGGLTTITGTSTNLIFAEYFNTCPYHSTLILDLA SVAFPRIQNCHGFISSEKMVLMIKKIILSRLWEQHVAVLLLKQMCKMSLISAGLIILCSF KEMFKCGKTKTVQQKACAEVIKQEYQKLGPISVHPSAVDLTYEAPKTEQRTRDTEMEKQH HCLQPPLLIPRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQKGHHTKTSSVWHHHQRPKLT RITSVEKFLNDLMELKTMARELRDECTSFSSRFDQLEERVSVMEDEMNEMKQEEKFREKR VKRNEQSLQKIWDYVKRPNLRLIGVPESDGENGTKLENTLKDIIQENLPNLARQANIQIQ EIQRTLQRYSSRRATPRRIIVRFTKVEMKEKILRAAREKGWVTHKGKPIRLTADLSAETL QARREWGPIFNVLKEKEFSTQNFISSQTKLHK >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_6|1359_bp atgagaaccaaatatcgaacaaagaagggccacgtgacacgtaaacttacgtgtttgtgc attgcctactcttctaccattggtggactgacaacaatcactggtacctccaccaacttg atctttgcagagtatttcaatacctgcccttatcattctactcttatcctggatctggct tcagtggcttttcctaggattcaaaactgccatggttttattagtagtgagaagatggta ctgatgattaaaaagataattctatccagattgtgggagcagcatgtggcagtcctcctg ctgaagcaaatgtgtaaaatgtcactgatttcagcaggcttaattattctttgcagtttt aaggagatgttcaaatgtggcaaaaccaaaacagtccaacaaaaagcttgtgctgaggtg attaagcaagaataccaaaagcttgggccaataagtgttcatccaagtgctgtagacctt acatatgaagctcctaaaactgagcaaagaacaagagatacagaaatggaaaaacagcat cattgccttcagcctccgctgctgatacccaggcaaacggggtctggagtggacctccag caaactccaacagacctgcagctgagggtcctgactgttagaaggaaaactaacaaacag aaaggacatcacaccaaaacctcatctgtatggcaccatcatcaaagaccaaagctaact agaataactagtgtagagaagttcttaaatgacctgatggagctgaaaacaatggcacga gaactacgtgacgaatgcacaagcttcagtagccgatttgatcaactggaagaaagggta tcagtgatggaagatgaaatgaatgaaatgaagcaggaagagaagtttagagaaaaaaga gtgaaaagaaacgaacaaagcctccaaaaaatatgggactatgtgaaaagaccaaatcta cgtctgattggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctg aaggatattatccaggagaacttacccaacctcgcaaggcaggccaacattcaaattcag gaaatacagagaacgctacaaagatactcctcgagaagagcaactccaagacgcataatt gtcagattcaccaaagttgaaatgaaggaaaaaatattaagggcagccagagagaaaggt tgggttacccacaaagggaagcccatcagattaacagcggatctctcagcagaaactcta caagccagaagagagtgggggccaatattcaacgttcttaaagaaaaagaattttcaacc cagaatttcatatccagccaaactaagcttcataagtga >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_7|254_aa MKFFSYILVYRRFLFVVFTVLVLLPLPIVLHTKEAECAYTLFVVATFWLTEALPLSVTAL LPSLMLPMFGIMPSKKVASAYFKDFHLLLIGVICLATSIEKWNLHKRIALKMVMMVGVNP AWLTLGFMSSTAFLSMWLSNTSTAAMVMPIAEAVVQQIINAEAEVEATQMTYFNGSTNHG LEIDESVNGHEINERKEKTKPVPGYNNDTGKISSKVELEKHWKLAVQDGSPSPSVHSVSQ LAAQGKEKVEGICT >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_7|765_bp atgaaattcttcagttacattctggtttatcgccgatttctcttcgtggttttcactgtg ttggttttactacctctgcccatcgtcctccacaccaaggaagcagaatgtgcctacaca ctctttgtggtcgccacattttggctcacagaagcattgcctctgtcggtaacagctttg ctacctagtttaatgttacccatgtttgggatcatgccttctaagaaggtggcatctgct tatttcaaggattttcacttactgctaattggagttatctgtttagcaacatccatagaa aaatggaatttgcacaagagaattgctctgaaaatggtgatgatggttggtgtaaatcct gcatggctgacgctggggttcatgagcagcactgcctttttgtctatgtggctcagcaac acctcgacggctgccatggtgatgcccattgcggaggctgtagtgcagcagatcatcaat gcagaagcagaggtcgaggccactcagatgacttacttcaacggatcaaccaaccacgga ctagaaattgatgaaagtgttaatggacatgaaataaatgagaggaaagagaaaacaaaa ccagttccaggatacaataatgatacagggaaaatttcaagcaaggtggagttggaaaag cactggaaacttgcagttcaagatggctccccatctccctctgtccattctgtatcgcag ctagctgctcaaggaaaggagaaagtggaaggcatatgtacttag >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_8|247_aa MLTTKEIIETTLLTEELFPGEHLEEGLCRQEVETSDPLRFCGILKKDLNRLGSSSWESGE SPKTSNQEDIAMFIFNVMGIISTCGQIHLICELLIPAKSLPICARYLPIHEEKSLRLWPS GQWTPWQAAEQTRIAAYPPTRWTVQVEGAWAKGHAMYHSYHMKVTEETIAAVKDCQSLDV DVTGACCSVAALETAEIPLREGHVGNVHWLSYWMINVLLTALELISSVTLTASNVKVMKG LDGLPLE >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_8|744_bp atgcttacgacaaaagagatcattgaaacaaccttgttaactgaggagctgtttccagga gagcatctggaagaaggactttgcaggcaagaagtagaaacgtcagatcccctgagattt tgtggaatcttaaaaaaggacttgaatagactaggttcctcctcctgggaatctggtgaa agcccaaaaacatcaaaccaagaagatatagccatgttcatatttaatgtcatgggcata attagcacatgtgggcagatacatttgatatgtgaacttctgatacctgcaaagtctctg cccatctgtgcccgatatcttcccatccatgaagaaaaatcactgcggctgtggccttct gggcagtggaccccgtggcaggcagcagagcagacacgcattgctgcatatccacccact aggtggactgtgcaagtggagggggcgtgggccaagggccatgctatgtatcatagttat cacatgaaggtaactgaggagacaatagctgctgttaaagactgtcagtccctcgatgtg gacgtaaccggcgcctgctgctctgtggctgctttggaaactgcagagatcccactgaga gagggacatgttggaaatgtgcactggctttcatattggatgatcaatgttcttctcaca gctctagagctgatcagctctgtgaccttgactgcttccaacgtcaaagtaatgaaggga ttggatggtttgcctttagagtga >gi568815591r:123015521_123299946|GENSCAN_predicted_peptide_9|36_aa MNPDSAQAFRSTINIPIQHEDNGDEDLCDDPLSLSE >gi568815591r:123015521_123299946|GENSCAN_predicted_CDS_9|111_bp atgaatcctgattctgctcaagcttttagatcaactatcaatatacctattcaacatgaa gacaatggggatgaagatctttgtgatgatccactttcacttagtgaatag