GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:47:58 Sequence gi568815590r:132938527_133160160 : 221634 bp : 45.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2825 3016 192 0 0 107 80 41 0.066 4.66 1.02 Intr + 10250 10417 168 0 0 108 89 91 0.645 11.12 1.03 Intr + 22482 22547 66 1 0 102 105 35 0.889 5.48 1.04 Intr + 24468 24548 81 1 0 97 84 4 0.512 0.51 1.05 Intr + 28034 28171 138 0 0 93 99 88 0.994 10.84 1.06 Intr + 29268 29444 177 1 0 111 105 129 0.994 16.79 1.07 Intr + 30932 31043 112 0 1 38 94 52 0.934 0.24 1.08 Intr + 33268 33347 80 1 2 87 85 74 0.848 6.19 1.09 Intr + 34072 34215 144 2 0 106 100 88 0.953 12.05 1.10 Intr + 34524 34638 115 1 1 79 68 24 0.402 -0.99 1.11 Intr + 45171 45369 199 0 1 55 49 123 0.148 4.45 1.12 Intr + 61442 61631 190 1 1 72 64 58 0.012 0.96 1.13 Intr + 63247 63335 89 2 2 86 64 37 0.015 0.79 1.14 Intr + 64454 64517 64 1 1 99 82 31 0.013 1.89 1.15 Intr + 73375 73509 135 2 0 99 90 48 0.799 6.64 1.16 Intr + 75074 75238 165 0 0 107 115 89 0.994 13.43 1.17 Intr + 79252 79471 220 2 1 140 86 140 0.974 16.36 1.18 Intr + 81076 81169 94 1 1 42 89 35 0.938 -1.03 1.19 Intr + 83465 83624 160 1 1 85 105 275 0.948 28.46 1.20 Intr + 91295 91497 203 0 2 19 115 248 0.588 19.60 1.21 Intr + 91855 92004 150 0 0 -14 28 162 0.195 0.36 1.22 Term + 95163 95219 57 2 0 91 48 59 0.657 -0.11 1.23 PlyA + 95802 95807 6 1.05 2.15 PlyA - 96040 96035 6 1.05 2.14 Term - 100211 99998 214 1 1 104 42 232 0.995 16.90 2.13 Intr - 101604 101472 133 1 1 111 109 35 0.818 7.60 2.12 Intr - 106589 106458 132 0 0 116 99 144 0.994 18.92 2.11 Intr - 109407 109304 104 2 2 106 113 142 0.999 18.32 2.10 Intr - 111462 111376 87 2 0 109 92 35 0.971 5.09 2.09 Intr - 112389 112290 100 1 1 121 61 108 0.961 10.57 2.08 Intr - 113465 113402 64 2 1 69 47 28 0.069 -4.91 2.07 Intr - 117357 117299 59 1 2 57 24 231 0.217 12.20 2.06 Intr - 121674 121574 101 2 2 115 110 50 0.650 9.65 2.05 Intr - 123194 123170 25 0 1 35 96 6 0.584 -6.82 2.04 Intr - 123620 123465 156 0 0 52 92 108 0.701 7.58 2.03 Intr - 132871 132744 128 0 2 115 45 16 0.056 0.32 2.02 Intr - 134726 134516 211 0 1 72 47 78 0.076 0.07 2.01 Init - 145397 145274 124 2 1 72 84 33 0.343 1.66 2.00 Prom - 147242 147203 40 -6.36 3.00 Prom + 149147 149186 40 0.04 3.01 Init + 152105 152218 114 1 0 56 66 26 0.426 -2.59 3.02 Intr + 156518 156682 165 1 0 101 105 187 0.995 21.86 3.03 Intr + 157680 157847 168 2 0 52 121 154 0.999 15.24 3.04 Intr + 174896 175077 182 1 2 89 119 188 0.998 20.67 3.05 Intr + 178083 178190 108 0 0 91 110 135 0.980 15.40 3.06 Intr + 193286 193420 135 2 0 100 53 147 0.998 12.08 3.07 Intr + 194944 195134 191 1 2 70 69 222 0.958 17.73 3.08 Term + 196150 196268 119 2 2 109 47 129 0.995 9.60 3.09 PlyA + 196350 196355 6 1.05 4.00 Prom + 203382 203421 40 -5.86 4.01 Init + 205697 205899 203 1 2 53 113 87 0.433 6.05 4.02 Intr + 212537 212644 108 2 0 102 101 41 0.744 6.20 4.03 Term + 216662 216800 139 2 1 129 38 14 0.376 -2.06 4.04 PlyA + 217018 217023 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 106091 105901 191 0 2 82 48 103 0.868 3.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:132938527_133160160|GENSCAN_predicted_peptide_1|999_aa XQGSTTTLQKRFEPTGFQNMLSGLYNPIVFSASGANLTDAHLFCLLACDRDLCCDGFVLT QVQGGAIICGLLSSPSVLLCNVKDWMDPSEAWANATCPGVTYDQESHQVILRLGDQEFIK SLTPLEGTQDTFTNFQQVYLWKDSDMGSRPESMGCRKDTVPRPASPTEAGLTTELFSPVD LNQVIVNGNQSLSSQKHWLFKHLFSAQQANLWCLSRCVQEHSFCQLAEITESASLYFTCT LYPEAQVCDDIMESNAQGCRLILPQMPKALFRKKVILEDKVKNFYTRLPFQKLMGISIRN KVPMSEKSISNGFFECERRCDADPCCTGFGFLNVSQLKGGEVTCLTLNSLGIQMCSEENG GAWRILDCGSPDIEVHTYPFGWYQKPRFSGYIQPLNSRRRVWSHLGLVSTLPEGEWDSLT VLTRWIISPYRVMTRHTQSGGNVKNGVIYGQKGKEVSSLLVQKSWSLQLSATLGSTVTSN DETLALSPGAQEDEGVATVTSLLLTFTLNIKFWLKKPCFTSDSKDAIMHSTMQTVVRAQA LEVDVWEVKSLTNVRIMKVCWGPERLFLPERFQNVTVYPSQPVQIHMYLAKRTTPCFPKG LENIPVSLDSWQSLALSSVVVDPSIRHFDVAHVSTAATSNFSAVRDLCLSECSQHEACLI TTLQTQPGAVRCMFYADTQSCTHSLQGQNCRLLLREEATHIYRKPGISLLSYEASVPSVP ISTHGRLLGRSQAIQVGTSWKQVDQFLGVPYAAPPLAERRFQAPEPLNWTGSWDASKPRA SCWQPGTRTSTSPGVSEDCLYLNVFIPQNVAPNASVLVFFHNTMDREESEGWPAIDGSFL AAVGNLIVVTASYRVGVFGFLSSGSGEVSGNWGLLDQVAALTWVQTHIRGFGGDPRRVSL AADRGGADVASIHLLTARATNSQLFRRAVLMCRLREADDDGDEEEDDGGNDDNDVGDDTM MLIMTIIMMEKITWLATAYALHIQYPPGKFLLSFRAQLK >gi568815590r:132938527_133160160|GENSCAN_predicted_CDS_1|3000_bp ngccaaggatccaccacaacacttcagaaacgctttgaacccactggtttccaaaacatg ctttctggattgtacaaccccattgtgttctcagcctcaggagccaatctaaccgatgct cacctcttctgtcttcttgcatgcgaccgtgatctgtgttgcgatggcttcgtcctcaca caggttcaaggaggtgccatcatctgtgggttgctgagctcacccagtgtcctgctttgt aatgtcaaagactggatggatccctctgaagcctgggctaatgctacatgtcctggtgtg acatatgaccaggagagccaccaggtgatattgcgtcttggagaccaggagttcatcaag agtctgacacccttagaaggaactcaagacacctttaccaattttcagcaggtttatctc tggaaagattctgacatggggtctcggcctgagtctatgggatgtagaaaagacacagtg ccaaggccagcatctccaacagaagcaggtttgacaacagaacttttctcccctgtggac ctcaaccaggtcattgtcaatggaaatcaatcactatccagccagaagcactggcttttc aagcacctgttttcagcccagcaggcaaacctatggtgcctttctcgttgtgtgcaggag cactctttctgtcagctcgcagagataacagagagtgcatccttgtacttcacctgcacc ctctacccagaggcacaggtgtgtgatgacatcatggagtccaatgcccagggctgcaga ctgatcctgcctcagatgccaaaggccctgttccggaagaaagttatactggaagataaa gtgaagaacttttacactcgcctgccgttccaaaaactgatggggatatccattagaaat aaagtgcccatgtctgaaaaatctatttctaatgggttctttgaatgtgaacgacggtgc gatgcggacccatgctgcactggctttggatttctaaatgtttcccagttaaaaggagga gaggtgacatgtctcactctgaacagcttgggaattcagatgtgcagtgaggagaatgga ggagcctggcgcattttggactgtggctctcctgacattgaagtccacacctatcccttc ggatggtaccagaagcccaggtttagtggttatattcaacctttaaattcaaggaggaga gtttggtcccacctgggcttagtcagcacgctgccagagggagagtgggacagcttgacg gtcctaacaagatggattatttcgccctatcgagtaatgactagacacacacaatccgga ggcaatgtcaagaatggtgttatttatggccaaaaagggaaagaagtcagctccttatta gtgcaaaagagttggagcttacagttatcggccacactgggcagcactgtgacatctaat gatgaaacactggccctgagtccaggagcccaggaagatgagggggtagcaactgtaaca tcactactattaacattcactcttaatattaaattttggttgaaaaagccctgttttaca agtgacagcaaggatgccatcatgcattcaacaatgcagacagtggtgagagctcaggct ctggaggttgatgtatgggaggtcaaatccttaacaaatgttaggatcatgaaggtttgc tgggggcctgaaagacttttcctgcctgaaaggtttcagaatgtcacggtttaccccagc cagcctgtgcagattcacatgtacctggccaaacgaacaactccatgttttccaaaaggc ttggagaacataccagtgtctctggactcgtggcagtccctggccctctcttcagtggtt gttgatccatccattaggcactttgatgttgcccatgtcagcactgctgccaccagcaat ttctctgctgtccgagacctctgtttgtcggaatgttcccaacatgaggcctgtctcatc accactctgcaaacccaacctggggctgtgagatgtatgttctatgctgatactcaaagc tgcacacatagtctgcagggtcagaactgccgacttctgcttcgtgaagaggccacccac atctaccggaagccaggaatctctctgctcagctatgaggcatctgtaccttctgtgccc atttccacccatggccggctgctgggcaggtcccaggccatccaggtgggtacctcatgg aagcaagtggaccagttccttggagttccatatgctgccccgcccctggcagagaggcgc ttccaggcaccagagcccttgaactggacaggctcctgggatgccagcaagccaagggcc agctgctggcagccaggcaccagaacatccacgtctcctggagtcagtgaagattgtttg tatctcaatgtgttcatccctcagaatgtggcccctaacgcgtctgtgctggtgttcttc cacaacaccatggacagggaggagagtgaaggatggccggctatcgacggctccttcttg gctgctgttggcaacctcatcgtggtcactgccagctaccgagtgggtgtcttcggcttc ctgagttctgggtccggagaggtgagtggcaactgggggctgctggaccaggtggcggct ctgacctgggtgcagacccacatccgaggatttggcggggaccctcggcgcgtgtccctg gcagcagaccgtggcggggctgatgtggccagcatccaccttctcacggccagggccacc aactcccaacttttccggagagctgtgctgatgtgccggctacgtgaggctgatgatgat ggtgatgaggaggaggatgatggtggtaatgatgataatgatgttggtgatgatacaatg atgttgataatgacaataattatgatggaaaaaataacctggttggccactgcatacgcc ctgcatatccagtatccacctggaaagttcctgctgtccttcagggctcagctgaaatga >gi568815590r:132938527_133160160|GENSCAN_predicted_peptide_2|545_aa MQREGSQGENWEPLGAEGPSEEEVASLEPGRGSRALDEKGTYLQSRSEQQDGNIERLCRN SCTCVNILGRPPKNSVTLYKSLCPFEYFPCKKKTSFSVCLKEVYMYYLGDNRPKDTVAGS QALEPDAWIPVPPPALSSWANSDTLCECSGLGFPVTKINRIQKGWDLRFTLNMLPVLANP SVVHYEFIAFASLVIVFGILLCISLSSLILSCRHRLWASPAAPGKKKEMGNSMKSTPAPA ERPLPNPEELCALLLLLLLLLLLLLLLLNMSSVKAARMGLLFSPMLLTTGLDSDFLAVLS DYPSPDISPPIFRRGEKLRVISDEGGWWKAISLSTGRESYIPGICVARVYHGWLFEGLGR DKAEELLQLPDTKVGSFMIRESETKKGFYSLSVRHRQVKHYRIFRLPNNWYYISPRLTFQ CLEDLVNHYSEVADGLCCVLTTPCLTQSTAAPAVRASSSPVTLRQKTVDWRRVSRLQEDP EGTENPLGVDESLFSYGLRESIASYLSLTSEDNTSFDRKKKSISLMYGGSKRKSSFFSSP PYFED >gi568815590r:132938527_133160160|GENSCAN_predicted_CDS_2|1638_bp atgcagagagaggggagccagggggaaaactgggaaccactgggtgcagagggaccatct gaggaggaggtggcttcactagaacctggacggggcagcagggcactggatgagaagggc acatatctacaatctagatcagagcagcaagatggcaacatagaaagattgtgccgtaat agctgcacctgtgtcaacattctgggtcggccaccaaagaactctgtgactctgtacaag tcactttgccccttcgagtattttccttgtaaaaagaagaccagtttctctgtatgtttg aaagaagtttacatgtactatctaggagataataggcctaaggacacggtagctgggagc caggctctagagccagatgcctggattccagtaccacctccagcactttccagctgggca aactccgacacattatgtgaatgctctggccttggtttccccgtcacaaagatcaacagg atccaaaaaggatgggatttgcgcttcactttaaacatgctgcccgttctagccaaccct tctgtggtgcactatgagtttatagcttttgcgagcctggtcattgtcttcggaatatta ctatgcattagcttgagcagtttgatcctgagctgcagacacaggctctgggcatcacca gcggccccagggaaaaagaaagaaatgggaaacagcatgaaatccacccctgcgcctgcc gagaggcccctgcccaacccggaggagctgtgcgccctgctgctgctgctgctgctgctg ctgctgctgctgctgctgctgctgaacatgagctccgtgaaagcagcaagaatgggcttg cttttctcacctatgctgcttacgacaggactggatagcgacttccttgccgtgctaagt gactacccgtctcctgacatcagccccccgatattccgccgaggggagaaactgcgtgtg atttctgatgaagggggctggtggaaagctatttctcttagcactggtcgagagagttac atccctggaatatgtgtggccagagtttaccatggctggctgtttgagggcctgggcaga gacaaggccgaggagctgctgcagctgccagacacaaaggtcggctccttcatgatcaga gagagtgagaccaagaaagggttttactcactgtcggtgagacacaggcaggtaaagcat taccgcattttccgtctgcccaacaactggtactacatttccccgaggctcaccttccag tgcctggaggacctggtgaaccactattctgaggtggctgatggcctgtgctgtgtgctc accacgccctgcctgacacaaagcacggctgccccagcagtgagggcctccagctcacct gtcaccttgcgtcagaagactgtggactggaggagagtgtccagactgcaggaggacccc gagggaacagagaacccgcttggggtagacgagtcccttttcagctatggccttcgagag agcattgcctcttacctgtccctgaccagtgaggacaacacctcctttgatcgaaagaag aaaagcatctccctgatgtatggtggcagcaagagaaagagctcattcttctcatcacca ccttactttgaggactag >gi568815590r:132938527_133160160|GENSCAN_predicted_peptide_3|393_aa MNTTNFECKLHIRYFPSIVSFNLHISSVNADHNISILQGGSALSPAAVISHERAQQQAIA LAKEVSCPMSSSQEVVSCLRQKPANVLNDAQTKLLAVSGPFHYWGPVIDGHFLREPPARA LKRSLWVEVDLLIGSSQDDGLINRAKAVKQFEESRGRTSSKTAFYQALQNSLGGEDSDAR VEAAATWYYSLEHSTDDYASFSRALENATRDYFIICPIIDMASAWAKRARGNVFMYHAPE NYGHGSLELLADVQFALGLPFYPAYEGQFSLEEKSLSLKIMQYFSHFIRSGNPNYPYEFS RKVPTFATPWPDFVPRAGGENYKEFSELLPNRQGLKKADCSFWSKYISSLKTSADGAKGG QSAESEEEELTAGSGLREDLLSLQEPGSKTYSK >gi568815590r:132938527_133160160|GENSCAN_predicted_CDS_3|1182_bp atgaatacaaccaactttgagtgcaaattacacatcaggtattttccctccattgtctcc tttaatctgcacatcagctctgtgaacgctgatcataatatctccattttacagggaggc tccgcactctccccggccgccgtcatcagccatgagagggctcagcagcaggcaattgct ttggcaaaggaggtcagttgccccatgtcatccagccaagaagtggtgtcctgcctccgc cagaagcctgccaatgtcctcaatgatgcccagaccaagctcctggccgtgagtggccct ttccactactggggtcctgtgatcgatggccacttcctccgtgagcctccagccagagca ctgaagaggtctttatgggtagaggtcgatctgctcattgggagttctcaggacgacggg ctcatcaacagagcaaaggctgtgaagcaatttgaggaaagtcgaggccggaccagtagc aaaacagccttttaccaggcactgcagaattctctgggtggcgaggactcagatgcccgc gtcgaggctgctgctacatggtattactctctggagcactccacggatgactatgcctcc ttctcccgggctctggagaatgccacccgggactactttatcatctgccctataatcgac atggccagtgcctgggcaaagagggcccgaggaaacgtcttcatgtaccatgctcctgaa aactacggccatggcagcctggagctgctggcggatgttcagtttgccttggggcttccc ttctacccagcctacgaggggcagttttctctggaggagaagagcctgtcgctgaaaatc atgcagtacttttcccacttcatcagatcaggaaatcccaactacccttatgagttctca cggaaagtacccacatttgcaaccccctggcctgactttgtaccccgtgctggtggagag aactacaaggagttcagtgagctgctccccaatcgacagggcctgaagaaagccgactgc tccttctggtccaagtacatctcgtctctgaagacatctgcagatggagccaagggcggg cagtcagcagagagtgaagaggaggagttgacggctggatctgggctaagagaagatctc ctaagcctccaggaaccaggctctaagacctacagcaagtga >gi568815590r:132938527_133160160|GENSCAN_predicted_peptide_4|149_aa MHLDSQVHVNLARGDFRAQSMPVSKVSPGFCRRTLVIQVRDHRPASCSFQTSPETSMSPL EGNKLRPRKLGTRFSTLLQNITGAIYHLEHRLAISYTPVSLSGWIWSLLVGWPGNNSFAQ AAQGGERKTYSPLPESKPLFFGYYKIIYK >gi568815590r:132938527_133160160|GENSCAN_predicted_CDS_4|450_bp atgcacctggacagccaagtacatgtaaatttggccagaggagacttcagggcccagagc atgcccgtcagcaaagtctcccctggcttctgcagaagaactctggtcatccaggtgagg gaccacaggccagccagctgttccttccagacgtctccagaaacctctatgagtcccttg gaaggtaacaaactcagaccaaggaagctgggaactcgcttctcaactctcctacagaat atcactggtgccatttaccacttagaacatcgacttgcaatcagttacacacctgtctcc ctctctggatggatttggtccttgcttgttgggtggccaggaaataacagctttgcacag gctgctcaaggtggtgagagaaagacctatagccccttgcctgaatccaaaccactgttt tttggctattacaagataatttataagtga