GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:25:01 Sequence gi568815575f:15690496_15923239 : 232744 bp : 40.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 291 467 177 2 0 88 24 110 0.360 3.91 1.02 Intr + 2468 2563 96 1 0 75 93 80 0.820 6.49 1.03 Intr + 4275 4420 146 2 2 19 54 150 0.694 2.86 1.04 Intr + 7168 7270 103 1 1 45 98 103 0.730 6.26 1.05 Term + 9857 10150 294 1 0 6 28 170 0.463 -2.88 1.06 PlyA + 11071 11076 6 1.05 2.02 PlyA - 11351 11346 6 1.05 2.01 Sngl - 12748 12233 516 1 0 57 39 500 0.985 35.79 2.00 Prom - 37989 37950 40 -2.05 3.05 PlyA - 38104 38099 6 1.05 3.04 Term - 47512 47388 125 2 2 85 42 104 0.676 3.07 3.03 Intr - 48252 48061 192 1 0 62 23 155 0.644 4.94 3.02 Intr - 50145 50039 107 2 2 0 94 112 0.076 1.94 3.01 Init - 54233 54013 221 2 2 68 26 179 0.030 7.85 3.00 Prom - 57514 57475 40 -4.95 4.00 Prom + 58083 58122 40 -9.05 4.01 Init + 58371 58420 50 2 2 98 52 60 0.058 3.87 4.02 Intr + 59575 59670 96 1 0 13 76 147 0.023 4.21 4.03 Intr + 65858 65979 122 2 2 3 60 147 0.334 2.62 4.04 Intr + 74083 74280 198 2 0 78 94 201 0.775 18.10 4.05 Intr + 82001 82119 119 0 2 51 87 132 0.612 8.66 4.06 Intr + 83807 83902 96 1 0 75 93 72 0.701 5.69 4.07 Intr + 84751 84813 63 0 0 103 74 45 0.858 2.60 4.08 Intr + 86219 86374 156 1 0 89 91 120 0.997 11.69 4.09 Term + 91990 92169 180 0 0 94 41 192 0.990 11.73 4.10 PlyA + 93138 93143 6 1.05 5.00 Prom + 97122 97161 40 -3.75 5.01 Init + 100001 100041 41 1 2 102 121 39 0.468 8.31 5.02 Intr + 100439 100518 80 2 2 51 64 172 0.917 9.38 5.03 Intr + 109377 109458 82 1 1 66 78 145 0.992 9.08 5.04 Intr + 113193 113301 109 0 1 36 64 138 0.993 5.57 5.05 Intr + 113616 113702 87 2 0 65 45 135 0.102 6.15 5.06 Intr + 119706 119780 75 2 0 24 95 78 0.050 0.89 5.07 Intr + 122586 122707 122 2 2 29 85 73 0.569 -0.53 5.08 Intr + 125182 125395 214 1 1 73 86 178 0.601 13.90 5.09 Term + 132236 132847 612 1 0 81 46 407 0.853 29.29 5.10 PlyA + 133627 133632 6 1.05 6.08 PlyA - 133718 133713 6 1.05 6.07 Term - 141567 141530 38 1 2 107 38 41 0.356 -2.48 6.06 Intr - 154753 154570 184 1 1 32 92 108 0.749 4.04 6.05 Intr - 155021 154884 138 2 0 142 99 86 0.999 14.84 6.04 Intr - 155516 155408 109 1 1 86 65 74 0.948 4.27 6.03 Intr - 162029 161851 179 2 2 78 70 77 0.369 2.70 6.02 Intr - 164682 164460 223 2 1 34 12 172 0.386 1.41 6.01 Init - 169611 169532 80 0 2 83 75 40 0.409 2.78 6.00 Prom - 169788 169749 40 -3.65 7.03 PlyA - 171613 171608 6 1.05 7.02 Term - 183493 183354 140 2 2 76 38 105 0.852 1.44 7.01 Init - 185375 185252 124 2 1 86 70 70 0.698 5.38 7.00 Prom - 185968 185929 40 -3.45 8.07 PlyA - 186145 186140 6 -0.45 8.06 Term - 186946 186727 220 0 1 55 42 131 0.511 0.73 8.05 Intr - 189266 188983 284 2 2 47 76 162 0.025 6.19 8.04 Intr - 190563 190496 68 1 2 96 91 85 0.013 7.31 8.03 Intr - 201511 201354 158 0 2 99 68 43 0.036 2.13 8.02 Intr - 210772 210580 193 2 1 78 76 51 0.007 0.53 8.01 Init - 211290 210801 490 0 1 53 1 307 0.018 13.91 8.00 Prom - 212146 212107 40 -4.55 9.00 Prom + 217201 217240 40 -7.65 9.01 Init + 220505 220539 35 1 2 79 40 27 0.258 -3.31 9.02 Term + 222432 222579 148 0 1 29 50 259 0.829 12.59 9.03 PlyA + 223052 223057 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 50105 50039 67 2 1 98 94 44 0.851 7.29 S.002 Sngl - 54233 53895 339 2 0 68 47 209 0.869 10.58 S.003 Term + 113616 113732 117 2 0 65 45 168 0.881 7.76 S.004 Intr - 189921 189746 176 2 2 19 55 185 0.960 6.96 S.005 Sngl - 211290 210757 534 0 0 53 54 292 0.966 18.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_1|271_aa MQGHKLRLIQVIDGERLNEGNRNIENEQLLKSISEIKATGLGGEILFQRTESKVNQDLEL HLVHWNAVKFENFEDAALEENGLAVIGVFLKSGTVRHADMGMTVMKEEVDPVSRSLEAGG MAHCAEPHEEAPGLVKRQRILLDPSRSSISLVQPCLESITFIIKKQPVEVDDDQLFFAPL FANNLNTSHLYGIPPPKFVFPHLNHEKRQNQTEKGLLQNTSPAVFKSIKVMKNKAEKLQS GGDQGGVGSGEDKGHLGNLKSGVHLTVSTQC >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_1|816_bp atgcagggccacaagttgcgattaatccaagtgatagatggtgagagattaaatgaaggc aatagaaatatagaaaatgagcaacttttaaaaagcatttctgaaataaaagcaacagga ctaggaggtgagatcctcttccagagaacagaaagcaaagttaaccaagaccttgagctg cacttagtgcattggaacgcagtcaaatttgaaaactttgaggatgcagcactggaagaa aatggtttggctgtgataggagtatttttaaagtcgggtacagtaagacatgcagacatg ggaatgactgtcatgaaggaagaagttgatcctgtttccagatccctagaagctggaggc atggcacactgtgcagagccacatgaggaagcaccagggttggtcaagaggcagagaata ttactggacccaagtaggtccagtatctccctggtccagccctgtcttgagtccatcacc tttatcattaagaagcagccagtagaggttgatgatgatcagttattctttgcaccattg tttgcaaacaacctaaataccagtcacctctatggcattcctcctccaaaatttgtattc cctcatctaaatcatgagaaaagacaaaaccaaactgaaaaaggacttctacaaaataca tcaccagccgtcttcaaaagcatcaaggtcatgaaaaacaaggcagaaaaactccagagt ggaggagaccaaggaggagtaggttctggagaagacaaaggacacttgggcaatctaaag tctggagttcacttaacagtatcaacccaatgttaa >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_2|171_aa MAWASGFLCSLAGRFSRSPGCGQAVGVAGGPKPPLHVGQGSLGEIRPDVAQQRQPVGKAP PGLGDHLSQDADAGGDHLSLDADAGVVPAGRSGAEGPDADPVLPVEVGDLAVLLDLGAEE GPGLQNPVLLGGAQLSGKRLGPSSRHWAARSFRNLRSRPEIKCQGSVTSDG >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_2|516_bp atggcctgggcgtcgggcttcctctgctctctggcggggcgattttcacggtcaccaggc tgcggacaggctgtgggggtggcgggcggtccaaagccaccgcttcacgtaggccagggg agccttggagaaatcaggcctgatgttgctcagcagcggcagcctgttgggaaagcacct ccaggcctcggggaccatctctctcaggacgcggatgcgggcggggaccatctctctctg gatgcagatgcgggtgtagtcccagctgggaggagtggtgcggaggggccggatgcggac ccagtgcttcctgtggaagtaggtgaccttgcagttctcctggacctgggagctgaggaa gggcctggactgcagaacccagtgcttttgggcggggcgcagctttctggcaagcggctt gggccttccagtagacactgggctgccagaagtttccgaaatctgcggtcaaggccagag atcaaatgtcagggctcagtgactagcgacggttag >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_3|214_aa MDRGTFRDDEEVAGDDEAPMLKRTIGVKMGRTLGLLNKQFQYQGDVGKQVEKASFLAHNR VRMRHSNFWEDDGNSYRYIKGDSQKGAMKDAETENGQRQSRKAQSLEDPAISQKAGLSGV IWIQPTLATLPTKELLSRAPTPFSLEQLLSRPVQRTLKDDSRLVTPIVEEEGWAWATPLS PALAPPPPDQSSLLLLAAGRISGLKLDLGLQSLI >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_3|645_bp atggacaggggaaccttccgcgatgatgaggaggtggcaggggatgatgaggccccaatg ctgaagagaacaattggagtcaaaatgggaagaactttaggtttactcaataaacaattc cagtatcagggggatgtgggcaaacaagtggaaaaggcctcctttcttgctcacaatcga gttagaatgagacatagtaatttttgggaagatgatggcaacagttacaggtatatcaag ggagactcacaaaaaggagccatgaaggatgctgagactgagaacggtcaaaggcaatca aggaaggcgcagagcttggaagatccagctatttcacagaaggctggactttctggagtt atttggattcaaccaaccttggcaaccctacctaccaaggaactgctctcccgcgccccc acccccttttctctggaacagctgttatcgcgtcctgtgcaaagaaccttaaaggatgac agccgtttggtaactcccatagtggaggaagagggctgggcctgggccacgcctctgagc cctgctctggccccgcccccgccggaccaatcctccctcctcctgctcgccgctggccgc attagtggattgaaactagacctggggcttcagagccttatctaa >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_4|359_aa MVKPYLYEKYKKEPLHRQIAVEKVPDSEIHASEALQPLYLYLQNPEPSLKALETIEKVED TDLMGLELTQALWADKANTDYLRNGWMLNVHPLWESVDLVPGGDRQSPINIRWRDSVYDP GLKPLTISYDPATCLHVWNNGYSFLVEFEDSTDKSVIKGGPLEHNYRLKQFHFHWGAIDA WGSEHTVDSKCFPAELHLVHWNAVRFENFEDAALEENGLAVIGVFLKLGKHHKELQKLVD TLPSIKHKDALVEFGSFDPSCLMPTCPDYWTYSGSLTTPPLSESVTWIIKKQPVEVDHDQ LEQFRTLLFTSEGEKEKRMVDNFRPLQPLMNRTVRSSFRHDYVLNVQAKPKPATSQATP >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_4|1080_bp atggtaaaaccctatctctatgaaaaatacaaaaaagagcctctgcacaggcaaattgct gtggagaaagttccagattccgagattcatgccagcgaggccctgcagcctctatacttg tacttacaaaacccggaaccgagccttaaagccctggagactatagaaaaagtggaggac acagacctcatgggcttggagttgacacaggccctttgggcagataaagcaaacacagat tacctgaggaatggttggatgctaaatgtgcatccactctgggagagcgtggacctggtt cctgggggcgatcgccagtcacccatcaacattcggtggagggacagtgtttatgatccc ggcttaaaaccactgaccatctcttatgacccagccacctgcctccacgtctggaataat gggtactctttcctcgtggaatttgaagattctacagataaatcagtgatcaagggagga cccctggaacacaactaccgattgaagcagttccattttcactggggggccatcgatgcc tggggttctgagcacaccgtggacagcaaatgcttcccagcagagctgcacttagtgcat tggaacgcagtcagatttgaaaactttgaggatgcagcactggaagaaaatggtttggct gtgataggagtatttttaaagctaggcaaacatcataaggagctacagaaattagtggat actttgccgtcaattaagcataaggacgcccttgtggaatttgggtcatttgacccttcc tgcctgatgcctacctgcccagattactggacctactcagggtctctgactaccccaccc ctctccgagtctgtcacctggatcattaagaagcaaccagtagaggttgatcatgatcag cttgagcaatttcggaccctgcttttcacttccgaaggggagaaagagaaaagaatggtg gacaacttccgcccccttcagccactgatgaatcgcactgttcgttcatccttccggcat gattatgtgctgaatgtacaagcgaaacccaagccggccaccagccaagcaaccccctaa >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_5|473_aa MAAPEKMTFPEKPSHKKYRAALKKEKRKKRRQELARLRDSGLSQKEEEEDTFIEEQQLEE EKLLERERQRLHEEWLLREQKAQEEFRIKKEKEEAAKKRQEEQERKLKEQWEEQQRKERE EEEQKRQEKKEKEKTFMWKPSVDLFSHLASLRTPIICQDKGNAYQEHFPEKKKKRSSQLA IPEGAFGYPHHLLAILGLRCSRKHNFPTSSPTLLIKSMFTTFGMEQCRRDDYDPDASLEY SEEETYQQFLDFYEDVLPEFKNVGKVIQFKVYLKYNNVQEESTATFFMCSEIPTMNSGKL IETSTCLQIGLAPPLGRTPKGGRGWATTTTTTAGCGEGETLVQTTPTKEMGNPRGKVVVT GGRNLTNAHQRVGRGTIHEAEEEIGTAAGTAAGAGAAGAGAGAGAAGAAAAGAKVPLGPE VVAGGGRVIETELFRVPNPNKLVLFLNDCISYLLWLLHTRIEGYERRWVRKSG >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_5|1422_bp atggctgcgcccgagaagatgacgtttcccgagaaaccaagccacaaaaagtacagggcc gccctgaagaaggagaaacgaaagaaacgtcggcaggaacttgctcgactgagagactca ggactctcacagaaggaggaagaggaggacacttttattgaagaacaacaactagaagaa gagaagctattggaaagagagaggcaaagattacatgaggagtggttgctaagagagcag aaggcacaagaagaattcagaataaagaaggaaaaggaagaggcggctaaaaaacggcaa gaagaacaagagagaaagttaaaggaacaatgggaagaacagcagaggaaagagagagaa gaggaggagcagaaacgacaggagaagaaagaaaaagagaagacgttcatgtggaaacct agtgttgatctcttcagtcatttagcaagtcttcgaacacctattatatgccaggacaaa ggtaatgcgtaccaggagcacttccctgagaagaagaaaaagaggagtagtcagttagct atacccgagggggcatttggatacccacaccatttattggccatccttggcctaagatgt tcacgtaaacataatttcccaacatccagtcctacccttcttattaagagcatgtttacg acgtttggaatggagcagtgcaggagggatgactatgaccctgacgcaagcctggagtac agcgaggaagaaacctaccaacagttcctagacttctatgaggatgtgttgcccgagttc aagaacgtggggaaagtgattcagttcaaggtttatttgaaatacaacaatgtccaagag gaaagcactgcaactttcttcatgtgttcagaaatcccaacaatgaattctgggaagcta atagagacatctacttgtctccagatcggactggctcctcctttgggaagaactccgaaa ggagggagaggatgggccaccacgacgactactacagcaggctgcggggaaggagaaacc ctagtccagaccactcctacaaaagaaatggggaatccgagaggaaaagtagtcgtcaca gggggaagaaatctcacaaacgcacatcaaagagtcgggagaggcacaattcacgaagca gaggaagaaatagggaccgcagcagggaccgcagccggggccggggcagccggagccgga gccggagccggagccgcaggagccgccgcagccggagccaaagttcctctaggtcccgaa gtcgtggcaggaggaggtcgggtaatagagacagaactgttcagagtcccaaatccaaat aaactagttttgttcttaaatgattgtatatcttatttattatggttgctacatactcgg atagaaggctatgaacgcagatgggttcgcaagtctggctga >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_6|316_aa MDEAGNHDSQQTIARTKNQTLHVLTHRLRTLQNPVLLSLNKQDSRPRASRRHLQERAGAR ALGPGKRRFRVTWPGFRFAPPLSEETSNRLPWTQRLPALPPMQFMLLFSRQGKLRLQKWY VPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYKRYASLYFCCAIEDQDNELIT LEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLGGEVQETSKKNVLKAIEQADL LQEFIDWNMCQSRQLSYLWMDGSVWRWSIGQASSQWPFLDPKTLESKPLEDRLFEEGKRL AESSSISKRLKTFIAT >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_6|951_bp atggatgaagctggaaaccatgattctcagcagactattgcaaggacaaaaaaccaaaca ctgcatgttctcactcacaggctaaggacgctgcagaatcccgtcctcctatcattaaac aagcaggacagccgcccgcgcgcgtcccggaggcacctgcaagagcgcgcgggcgcgcgg gcgctcgggcccggcaaacgccgcttccgggtcacgtggcccggcttccggttcgcacct cccctcagcgaagaaacctccaatcggctgccttggactcagcggttacccgcgctgcct ccgatgcagtttatgttgctttttagtcgtcagggaaagcttcgactgcaaaaatggtat gtcccactatcagacaaagagaagaaaaagatcacaagagaacttgttcagaccgtttta gcacggaaacctaaaatgtgcagcttccttgagtggcgagatctgaagattgtttacaaa agatatgctagtctgtatttttgctgtgctattgaggatcaggacaatgaactaattacc ctggaaataattcatcgttatgtggaattacttgacaagtatttcggcagtgtctgtgaa ctagatatcatctttaattttgagaaggcttattttattttggatgagtttcttttggga ggggaagttcaggaaacatccaagaaaaatgtccttaaagcaattgagcaggctgatcta ctgcaggagtttattgactggaatatgtgtcagtcacgtcaactcagttatctatggatg gatggttcagtttggagatggtctatcggtcaggcctcttcccaatggccctttttggac cctaaaacactggagtctaaacctttagaggacagactatttgaagaagggaaaagattg gctgaaagtagcagtataagtaaacggttaaaaaccttcattgctacttaa >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_7|87_aa MQSMPVHPVHKDTKSLPNALEKTAHCLYHITLINKPNIISEGDGVCGQEDKKGENKAGSS AAMPFVQSLHYFCLQMATVLKKAAKSD >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_7|264_bp atgcagagcatgcctgtccatcctgtccacaaggataccaagtccttgccaaatgcgttg gagaaaactgcacactgtttgtaccacattactttgatcaacaagcctaacatcatcagt gaaggtgatggagtatgtggtcaagaagacaaaaagggggaaaacaaggcagggagctca gcagctatgccgtttgtacaaagccttcattatttctgcttgcagatggccacagtactt aagaaagctgccaagtcagattaa >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_8|470_aa MWKTLELPRDLLNGFAQNANSDNSDMDNKVQAEVVSDENEECVGNWSKGDSCYVLAKRLL AFCSCPRDLWNCGLERDDLGYLVEEISKQQSIQEVTWMLLKAFSFIREAKHKSSENLQPV NAIEKKIPFSEERFKLAAEICINNEELNVNPQDNGKMSPGHVRGLEAYEAKWFHGPSPAL PAVCSLKTWCPASQPVQPWLKGANVELGLWLQRVEAPSLGSFHMVLSLHVKCQLPFSFCH DYKFPETSPEAKQMPESCFLYSLRNHEPIKPPFFTNGPVSGKNAEQLLRSLESLSSPKDD KDVPMDLVLCVPASAAVAKKGHGTAWAMASEGASPEPWQLPCGIGPVGAWKSRIEVWELP PRFQRMYGNAWMSRQKFSARAGPSWRTSVRPVQKGNVGLPIHTVSFLISDFTVETPLKVQ LTSAPERKLVLNAPGRRSASSSGRMDAGKPTLLAGFRQQDVPLTKRPFFL >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_8|1413_bp atgtggaaaactttggaacttcctagagacttgttgaatggctttgcccaaaatgctaac agtgataacagtgatatggacaataaagtccaggctgaagtggtctcagatgaaaatgag gaatgtgttgggaactggagcaaaggtgactcttgttatgttttagcaaagagactgctg gcattttgctcctgccctagagatttgtggaactgtggacttgagagagatgatttaggg tatctggtggaagaaatttccaagcagcaaagcattcaagaggtgacttggatgctgtta aaggcattcagttttataagagaagcaaagcataaaagttcagaaaatttgcagcctgtc aatgctatagaaaagaaaatcccattttctgaggagagattcaagctggctgcagaaatt tgcataaataacgaggagctgaatgttaatccccaagacaatggaaaaatgtctccaggg catgtcagaggcctggaggcctatgaggcaaagtggtttcatgggcccagcccagcactc cctgctgtgtgtagcctaaagacttggtgccctgcatcccagccagtccagccatggctg aaaggagctaatgtagagcttgggctgtggcttcagagggtggaagccccaagccttggc agcttccacatggtgttgagcctccatgtgaagtgtcagctccccttctccttctgccat gattataagtttcctgagacctccccagaagccaaacagatgccagaatcatgcttcctg tacagcctgcggaaccatgagccaattaaacctcctttctttacaaatggcccagtctca ggtaagaatgctgagcaactcctcagaagtttggagagcctttcaagtcctaaagatgac aaagatgtgcctatggacttggtgctctgtgtcccagcctctgcagctgtggctaaaaag ggccacggtacagcttgggccatggcttcagagggtgcaagccccgagccttggcagctt ccatgtggtattgggcctgtgggtgcatggaagtcaagaattgaggtttgggaacttcca cctagatttcagaggatgtatggaaatgcctggatgtccaggcagaagttttctgcaagg gcagggccttcatggagaacctctgttaggccagtgcaaaagggaaatgtaggcctacct attcatacagtgtcttttctaatctctgatttcactgtagaaaccccacttaaggtccag ttaacttctgcccctgaaaggaagcttgtgcttaatgcccctggaagaaggtctgccagc tcttcaggccgaatggatgctgggaagccaactctcctggcaggcttccgacagcaagat gttcctctaaccaagaggcccttcttcctctga >gi568815575f:15690496_15923239|GENSCAN_predicted_peptide_9|60_aa MVPWDREAVPQWTFNYLDGDDDDNSGNFGGGDNVNGGDDADVAGDDDGQERDKEESLIYG >gi568815575f:15690496_15923239|GENSCAN_predicted_CDS_9|183_bp atggttccctgggaccgggaagctgtgccacagtggacgtttaactatctggatggagat gatgatgataatagtggtaattttggaggaggtgacaatgttaatggtggtgatgatgct gatgttgctggtgatgatgatggccaagaaagagataaagaagaaagtcttatttatggt tga