GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:57:46 Sequence gi568815584r:74186464_74402912 : 216449 bp : 48.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11497 11683 187 0 1 58 23 158 0.123 5.56 1.02 Intr + 16645 16664 20 2 2 113 61 13 0.003 -2.57 1.03 Intr + 33965 34229 265 1 1 86 71 107 0.613 5.89 1.04 Term + 35646 35833 188 1 2 95 43 73 0.617 1.15 1.05 PlyA + 36286 36291 6 1.05 2.00 Prom + 52950 52989 40 -5.86 2.01 Init + 53099 53468 370 1 1 78 77 379 0.522 32.96 2.02 Intr + 54719 54803 85 0 1 91 94 114 0.878 11.18 2.03 Intr + 58702 58825 124 1 1 102 92 220 0.997 24.39 2.04 Intr + 73139 73319 181 1 1 102 86 254 0.995 26.04 2.05 Term + 74074 74456 383 2 2 105 42 404 0.803 32.50 2.06 PlyA + 76255 76260 6 1.05 3.21 PlyA - 77652 77647 6 1.05 3.20 Term - 91380 91273 108 0 0 48 38 188 0.856 8.31 3.19 Intr - 92083 91577 507 1 0 -37 39 370 0.257 12.77 3.18 Intr - 100353 100238 116 1 2 108 75 213 0.169 22.17 3.17 Intr - 101423 101347 77 1 2 115 94 106 0.995 13.06 3.16 Intr - 101796 101744 53 0 2 63 99 46 0.887 0.91 3.15 Intr - 102302 102253 50 0 2 75 119 48 0.900 5.00 3.14 Intr - 103056 103020 37 0 1 94 99 49 0.848 4.44 3.13 Intr - 103655 103564 92 0 2 29 82 75 0.909 0.71 3.12 Intr - 104036 103828 209 1 2 99 64 95 0.921 6.92 3.11 Intr - 105913 105824 90 0 0 105 94 160 0.998 17.41 3.10 Intr - 106179 106088 92 0 2 86 94 219 0.884 21.09 3.09 Intr - 106452 106285 168 0 0 68 100 165 0.211 15.84 3.08 Intr - 108304 108154 151 0 1 92 -11 51 0.154 -4.24 3.07 Intr - 108735 108685 51 2 0 86 109 64 0.980 6.32 3.06 Intr - 109516 109391 126 0 0 120 75 119 0.998 13.59 3.05 Intr - 109986 109870 117 2 0 68 67 227 0.992 18.18 3.04 Intr - 111606 111467 140 0 2 62 29 274 0.994 18.16 3.03 Intr - 113212 113085 128 2 2 142 107 124 0.999 20.10 3.02 Intr - 113805 113687 119 2 2 116 92 -8 0.972 2.51 3.01 Init - 116449 116412 38 1 2 81 115 88 0.985 8.32 3.00 Prom - 121038 120999 40 -3.46 4.00 Prom + 124851 124890 40 -4.46 4.01 Init + 140704 140710 7 0 1 73 82 0 0.085 -1.02 4.02 Intr + 151260 151421 162 1 0 85 111 92 0.486 11.15 4.03 Term + 161725 161738 14 2 2 77 46 15 0.012 -5.44 4.04 PlyA + 163271 163276 6 1.05 5.00 Prom + 167187 167226 40 -4.16 5.01 Sngl + 170321 172429 2109 1 0 83 48 2261 0.997 215.88 5.02 PlyA + 173523 173528 6 1.05 6.00 Prom + 189142 189181 40 -5.96 6.01 Sngl + 190017 190241 225 2 0 62 42 275 0.999 15.44 6.02 PlyA + 190957 190962 6 1.05 7.02 PlyA - 191013 191008 6 -0.45 7.01 Sngl - 191925 191260 666 0 0 77 46 812 0.992 72.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 71994 72092 99 2 0 64 46 115 0.883 3.17 S.002 Term - 100353 100211 143 1 2 108 55 241 0.831 20.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:74186464_74402912|GENSCAN_predicted_peptide_1|219_aa DGDVFLGDHTAYPVPTAWFQKAWLPEPAISDQKLSALLNPFKKSENMALQDEDEDEEGAV TLRLRHHKKCTEHCRWGVQAPVCELHGFFMSVSPVLGTQWLLYHIIEQIKTPPPKSLCDC PSVGYALLSLLPKHTVIMPLKIRQVIAVENIEKDKILGEHSSVLFPKQFVDLIQSFQAPV TQGGTDHPHFTDERGDEALLAKTIWTQPLPPTCYVTLSK >gi568815584r:74186464_74402912|GENSCAN_predicted_CDS_1|660_bp gacggagatgtgttcttgggagaccacacagcctaccccgtgcctactgcatggttccag aaggcctggcttccagagccagccatcagcgatcagaaactttctgcactgttaaatcca tttaagaaatctgaaaacatggccttgcaggatgaggatgaggatgaggaaggggctgtt acactacgattaagacatcacaagaagtgcactgagcactgccgatggggtgttcaggcc cccgtttgtgagctccatggcttcttcatgtctgtatcaccagtgttgggcacacagtgg ttgctctaccacatcattgaacaaataaagacaccacctccaaaaagcctctgtgattgc cctagtgtaggttatgcgctcctttctttgctccccaaacacactgtgattatgccttta aaaataaggcaagtcattgctgtggagaatatagagaaggacaaaattttaggagagcac agctctgtactgttcccaaagcagtttgtggacctcatccaatctttccaagcacctgtg acccagggcggaaccgatcacccccattttactgatgaaaggggagatgaggctttattg gcaaagacaatctggactcaaccactgccaccgacttgctacgtgaccttgagcaagtga >gi568815584r:74186464_74402912|GENSCAN_predicted_peptide_2|380_aa MTGKAGEALSKPKSETVAKSTSGGAPARCTGFGIQEILGLNKEPPSSHPRAALDGLAPGH LLAARSVLSPAGVGGMGLLGPGGLPGFYTQPTFLEVLSDPQSVHLQPLGRASGPLDTSQT ASSDSEDVSSSDRKMSKSALNQTKKRKKRRHRTIFTSYQLEELEKAFNEAHYPDVYAREM LAMKTELPEDRIQVWFQNRRAKWRKREKCWGRSSVMAEYGLYGAMVRHSIPLPESILKSA KDGIMDSCAPWLLVQDGFPRRFSKPEYQQFFLGMHKKSLEAAAESGRKPEGERQALPKLD KMEQDERGPDAQAAISQEELRENSIAVLRAKAQEHSTKVLGTVSGPDSLARSTEKPEEEE AMDEDRPAERLSPPQLEDMA >gi568815584r:74186464_74402912|GENSCAN_predicted_CDS_2|1143_bp atgacggggaaagcaggggaagcgctgagcaagcccaaatccgagacagtggccaagagt acctcggggggcgccccggccaggtgcactgggttcggcatccaggagatcctgggcttg aacaaggagcccccgagctcccacccgcgggcagcgctcgacggcctggcccccgggcac ttgctggcggcgcgctcagtgctcagccccgcgggggtgggcggcatggggcttctgggg cccggggggctccctggcttctacacgcagcccaccttcctggaagtgctgtccgacccg cagagcgtccacttgcagccattgggcagagcatcggggccgctggacaccagccagacg gccagctcggattctgaagatgtttcctccagcgatcgaaaaatgtccaaatctgcttta aaccagaccaagaaacggaagaagcggcgacacaggacaatctttacctcctaccagcta gaggagctggagaaggcattcaacgaagcccactacccagacgtctatgcccgggagatg ctggccatgaaaacggagctgccggaagacaggatacaggtctggttccagaaccgtcga gccaagtggaggaagcgggagaagtgctggggccggagcagtgtcatggcggagtatggg ctctacggggccatggtgcggcactccatccccctgcccgagtccatcctcaagtcagcc aaggatggcatcatggactcctgtgccccgtggctactggttcaagatggctttcccagg cgcttttctaaacccgaataccaacaattctttctagggatgcacaaaaagtcgctggag gcagcagccgagtcggggaggaagcccgagggggaacgccaggccctgcccaagctcgac aagatggagcaggacgagcggggccccgacgctcaggcggccatctcccaggaggaactg agggagaacagcattgcggtgctccgggccaaagctcaggagcacagcaccaaagtgctg gggactgtgtctgggccggacagcctggcccggagtaccgagaagccagaggaggaggag gccatggatgaagacaggccggcggagaggctcagtccaccgcagctggaggacatggct tag >gi568815584r:74186464_74402912|GENSCAN_predicted_peptide_3|822_aa MAVAGPAPGAGARPRLDLQFLQRFLQILKVLFPSWSSQNALMFLTLLCLTLLEQFVIYQV GLIPSQYYGVLGNKDLEGFKTLTFLAVMLIVLNSTLKSFDQFTCNLLYVSWRKDLTEHLH RLYFRGRAYYTLNVLRDDIDNPDQRISQDVERFCRQLSSMASKLIISPFTLVYYTYQCFQ STGWLGPVSIFGYFILGTVVNKTLMGPIVMKLVHQEKLEGDFRFKHMQIRVNAEPAAFYS VGWNGSFSENQETCSPQCCLMEPSVWSPDGSSRSSSSGMKSHPFPPGPEPGMWMVPGCLT DVFSTVGINTFDYLGSILSYVVIAIPIFSGVYGDLSPAELSTLVSKNAFVCIYLISCFTQ LIDLSTTLSDVAGYTHRIGQLRETLLDMSLKSQDCEILGESEWGLDTPPGWPAAEPADTA FLLERVSISAPSSDKPLIKDLSLKISEGQSLLITGNTGTGKTSLLRVLGGLWTSTRGSVQ MLTDFGPHGVLFLPQKPFFTDGTLREQVIYPLKEVYPDSGSADDERILRFLELAGLSNLV ARTEGLDQQVDWNWYDVLSPGEMQRLSFARLFYLQPKYAVLDEATSALTEEVESELYRIG QQLGMTFISVGHRQSLEKVSELSSSGAGSRAGARAGAGSSFKAAIGADSSSRADTKAEYS SVAGTTTGLSSGTGDGASSRASTGFSSGGGAKAGTSPGLGSGAGARAGSSFSSGAGASDG AGSSCAAGAGAGTSSGPRTSFSASVGAEPASGSGAGTKSGYSSRVDARVGAGVGAELSVA LELELVLLVLKLMVELTPDLALALDLVSELALELEEEKGLPA >gi568815584r:74186464_74402912|GENSCAN_predicted_CDS_3|2469_bp atggcggtcgcggggcccgcgcccggagctggcgccaggcccaggttagatctgcaattt ctccagcggttcctgcagatactgaaggttttgtttccttcttggtcatcacaaaatgcc ttgatgttcctgacccttttgtgcctgaccctactggagcaatttgtgatctaccaggtt ggcttgatccccagtcagtactatggggtcctgggaaacaaagacttggaagggtttaag actctgacattcctggctgtcatgctcattgttctgaactccacgctgaagagctttgat cagttcacctgcaacctgctgtatgtgagctggaggaaggacctcactgagcaccttcac cgcctctacttccggggccgtgcgtactacaccctcaacgtgctgcgggatgacatcgat aacccggaccagcgcatcagccaggacgtggagcgattctgccggcagctcagcagcatg gccagcaagctcatcatctccccgttcaccctcgtctactacacttaccagtgcttccaa agcacaggctggctcgggcctgtgagcatcttcgggtatttcatcctggggaccgtggtg aacaaaactttgatgggccccattgtgatgaagctggtgcatcaggagaagctggaggga gattttaggttcaagcacatgcagattcgggtgaatgcggagcctgctgctttctacagc gtgggatggaatgggagcttctcagaaaaccaggagacctgctccccacaatgctgtcta atggagccaagcgtctggtctcctgacggcagcagcaggagcagcagcagcggcatgaaa tcccatcccttccctccagggccagaaccgggcatgtggatggttcctgggtgtctcaca gatgtgttttctacagtcggcatcaacacctttgactatctgggcagcatcctgagttac gttgtcatcgcaatccccattttcagcggggtctatggagacctgagtcccgcagagctt agcaccctggtcagcaagaatgcctttgtgtgcatctacctcatcagctgcttcacccag ctcatcgacctgtccacgacgctctcagatgtggctggctacacgcacagaattgggcag cttcgggagacgcttctggacatgtccctgaagtcacaggactgcgagatcctgggcgag agcgagtggggcttggacacacccccagggtggccagcggcagagccagcagacacagca tttctccttgagcgggtctccatctctgccccctcctctgacaaacccctaatcaaggat ctgagcctaaagatctccgagggacagagcctgctcatcacaggcaacacgggcactggc aagacctccttgctccgggttctgggtggcctctggacgagtacacggggctcagtgcag atgctgacggactttgggccccatggggtgctattcctgccacaaaagccattcttcact gacgggacccttcgggagcaggtgatatatcccctgaaggaggtctaccccgactcaggt tctgccgatgatgagaggatcttgaggttcttggaattggcaggcctgtccaacttggtg gcaaggacagagggcctggaccagcaggtggactggaactggtatgatgttctgtccccg ggggagatgcaacggctctcctttgcccgactcttctacctgcagccgaagtacgcagtg cttgatgaagccaccagtgccctgacagaggaagtggagagcgagctctatcgcatcggc cagcagctggggatgacgttcatcagtgtgggacatcggcagagccttgagaaggtgtca gaactgtctagctctggagctggatctagggctggtgctagagcaggagcaggctctagc ttcaaagctgccattggagctgactctagctctagagctgacaccaaagctgaatatagc tctgtggctggcactacaactggcttaagctctggaacaggtgatggagctagttctaga gctagcactggctttagctctggaggtggtgccaaagctggcacttcacctggcctaggc tctggagctggtgctagagctggctctagcttctcatctggtgctggagctagtgatgga gctggctctagctgtgctgctggtgctggagctggcactagctctggacctagaacttcc tttagtgccagcgttggtgctgaacctgcttctggctctggagctggcactaaatctggc tatagctcacgagtggatgccagagttggtgctggagtaggtgcagagctgtctgtagct ctagagctggagctagtgctgcttgtgctgaagctgatggtagagctgacaccggatctg gccctagctctggacctggtgtcagaactggcactggagctggaagaggagaaaggtttg ccagcatga >gi568815584r:74186464_74402912|GENSCAN_predicted_peptide_4|60_aa MVGTFPEDSCSVKPTAPKAPIKDPANKQMAIQVANDIAHPRYLGCLLFPEVGAWLAGLSI >gi568815584r:74186464_74402912|GENSCAN_predicted_CDS_4|183_bp atggtaggtacgtttcctgaggacagctgctctgtgaagcccactgccccaaaggcacct atcaaagaccctgcaaacaagcagatggcaatccaggttgccaatgacattgcccatccc aggtacctgggctgcctgcttttccctgaagtgggagcctggcttgcaggtctttcgatc tga >gi568815584r:74186464_74402912|GENSCAN_predicted_peptide_5|702_aa MTSRNQLVQKVLQELQEAVECEGLEGLIGASLEAKQVLSSFTLPTCREGGPGLQVLEVDS VALSLYPEDAPRNMLPLVCKGEGSLLFEAASMLLWGDAGLSLELRARTVVEMLLHRHYYL QGMIDSKVMLQAVRYSLCSEESPEMTSLPPATLEAIFDADVKASCFPSSFSNVWHLYALA SVLQRNIYSIYPMRNLKIRPYFNRVIRPRRCDHVPSTLHIMWAGQPLTSHFFRHQYFAPV VGLEEVEAEGAPGVAPALPALAPLSSPAKTLELLNREPGLSYSHLCERYSVTKSTFYRWR RQSQEHRQKVAARFSAKHFLQDSFHRGGVVPLQQFLQRFPEISRSTYYAWKHELLGSGTC PALPPREVLGMEELEKLPEEQVAEEELECSALAVSSPGMVLMQRAKLYLEHCISLNTLVP YRCFKRRFPGISRSTYYNWRRKALRRNPSFKPAPALSAAGTPQLASVGEGAVIPWKSEAE EGAGNATGEDPPAPGELLPLRMPLSRWQRRLRRAARRQVLSGHLPFCRFRLRYPSLSPSA FWVWKSLARGWPRGLSKLQVPVPTLGKGGQEAEEKQEKEAGRDVTAVMAPPVGASSEDVE GGPSREGALQEGATAQGQPHSGPLLSQPVVAAAGGRDGRMLVMDMIATTKFKAQAKLFLQ KRFQSKSFPSYKEFSALFPLTARSTYYMWKRALYDGLTLVDG >gi568815584r:74186464_74402912|GENSCAN_predicted_CDS_5|2109_bp atgacttctcggaaccagctggtgcagaaggtgctgcaggagctgcaggaagcagtggag tgcgaaggcctggagggtctcataggtgcttccttggaggccaagcaggtcctgtcttcc ttcactctccccacctgccgggagggaggccctggcctccaggtgctggaagtggactcg gtggccctgagcctgtatccagaagatgctccacggaacatgctgccgctggtgtgcaag ggggagggcagcctgctgttcgaggcggccagcatgctgctgtggggtgacgcaggcctc agcctggagctgcgggcccgcaccgtggtagagatgctgctgcacagacactactacctc cagggcatgatcgactccaaagtgatgctgcaggccgtgcgctactccctatgctctgag gagtcccctgagatgaccagcttgccccccgccacgctggaggccatcttcgatgccgac gtcaaggcctcctgtttccccagcagcttctccaacgtgtggcacttgtatgctctcgcc tctgtcctccagcggaacatctactccatctaccccatgcgcaacctcaagatccggccc tacttcaaccgtgtcatccggccccgccgctgcgaccacgtgccctccacgctgcacatc atgtgggctggccagcccctcaccagccacttcttccgccaccagtactttgcccctgtg gtggggctggaagaggtggaggctgaaggtgcccctggcgtggccccagctcttccagcc ctggccccactctcatcgccggccaagaccctggagctgctcaaccgtgaacctggcctc agctactctcacctctgtgagcgctacagcgtcaccaaaagcaccttctaccgctggcgg cggcagtcccaggagcaccggcagaaggttgctgcccgcttctccgccaagcacttcctg caggacagcttccaccgggggggcgtcgtgccacttcagcagttcctccagcggttcccg gagatctcccgctcaacctactatgcctggaagcatgagctgctgggctctggcacctgc ccggccttgccccccagggaggtgctgggcatggaggagctagagaagctgccggaggag caggtggctgaggaggagctggagtgctccgcactggcggtgtcaagccctggaatggtc ttaatgcagcgggccaagttgtacctggagcattgcatctccctgaacacactggtaccc tatcgctgcttcaaacgcaggttccctggcatctcacggtccacttattataattggcgg cgaaaggccctccggaggaaccccagcttcaagccggcaccagccctctctgctgctggg actccccagctagcatctgttggggaaggggctgtaattccttggaagagtgaggcggaa gagggggcagggaatgccacaggtgaggaccctcccgcccccggggagctcctgccacta aggatgcccctgtcccgttggcagaggcgtctgcgcagggctgcccgcaggcaggtgctg agtgggcatctccctttctgccgcttccgcctccgctaccccagcctgtcaccttctgcc ttttgggtctggaagagtcttgctcggggttggcccagaggcctgtccaaacttcaggtg ccggtccccaccttgggcaaaggggggcaggaggctgaggagaagcaggagaaggaggct ggcagggatgtgacagctgtgatggccccacctgtgggggcttcttcagaagatgtagag ggagggccttccagagagggggccctgcaggagggggccacagcccagggccagccccac agtgggcccttgctgagccaacctgtggtggcagcagcgggtggcagggatggccggatg ctggtgatggacatgatcgctaccacgaagttcaaggcccaggccaagctgttcttgcag aagcgcttccagtccaagagctttccctcctacaaggagttcagtgccctctttcccctc actgcccgctccacatactacatgtggaagcgagccctctatgacggcctgaccctggta gatggctga >gi568815584r:74186464_74402912|GENSCAN_predicted_peptide_6|74_aa MANNFNLGVDEDAIEEVLEVVPKEPMNEGLLELEQEHIAEEQTREKETAEEKELPRKFTV KGLAKAFTDLNQIL >gi568815584r:74186464_74402912|GENSCAN_predicted_CDS_6|225_bp atggcaaataactttaaccttggtgtggatgaggatgccattgaggaggtcctagaggtg gttcctaaggaaccgatgaatgaggggttgttggaactggaacaggaacacatagctgaa gaacagacaagagaaaaggaaactgcagaagaaaaagaacttccaagaaaattcacagtg aagggcttagcaaaagcttttacagacctcaaccagatcctttaa >gi568815584r:74186464_74402912|GENSCAN_predicted_peptide_7|221_aa MKIKSLKEIYIFSLPIKESEIIDLSLGASLKDEVLKIMPVQKQTRAGQRTRFKAFVAIGD YNGHVGLGIKCSKEVATAICGAIILAKLSIVPISRGCWENKTGKPHTVPCKMTDRCGSVP VHLIPAPRGTGIVSAPVPKKLLMMAGINDCYTSAQGCTATLGNFAKATFDAISKTYSYLT PDLWKETVFTKSLYQEFTDHLVKTHTRVSVQWTQAPAVATT >gi568815584r:74186464_74402912|GENSCAN_predicted_CDS_7|666_bp atgaagatcaagtccctgaaggagatctatatcttctccctgcccattaaggaatctgag atcattgacctttccctgggggcctctctcaaggacgaggttttgaagattatgccggtc cagaagcagacccgtgccggccagcgcaccaggttcaaggcgtttgttgccatcggggac tacaatggccacgtcggtctgggtattaagtgctccaaggaggtggccactgccatctgc ggggccatcatcctggccaagctctccattgtccccataagcagaggctgctgggagaac aagactggcaagccccacaccgtcccttgcaagatgacagaccgctgcggctctgtgccg gtgcacctcatccctgcacccaggggcactggcattgtctcagcgcctgtgcccaagaag ctgctcatgatggctggtatcaatgactgctacacctcagcccagggctgcactgccacc ctgggcaacttcgccaaggccacctttgatgccatctctaaaacctacagctacctgacc cccgacctctggaaggagactgtattcactaagtctctctatcaggaattcactgaccac ctcgtcaagacccacactagagtctcggtgcagtggacccaggctccagctgtggctaca acatag