GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:06:45 Sequence gi568815581r:19809181_20068463 : 259283 bp : 42.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 86 24 63 0 0 88 103 10 0.005 0.27 1.06 Intr - 1258 1198 61 1 1 121 75 47 0.008 4.09 1.05 Intr - 8748 8619 130 2 1 -8 68 90 0.002 -2.82 1.04 Intr - 16002 15914 89 0 2 88 100 49 0.042 3.95 1.03 Intr - 34042 33941 102 1 0 89 96 10 0.628 1.35 1.02 Intr - 36197 36124 74 0 2 49 101 73 0.939 2.91 1.01 Init - 37788 37557 232 0 1 51 67 275 0.288 20.17 1.00 Prom - 41291 41252 40 -4.15 2.00 Prom + 50398 50437 40 -4.55 2.01 Init + 58236 58542 307 2 1 79 10 218 0.472 8.60 2.02 Term + 58607 59064 458 0 2 67 41 195 0.312 7.00 2.03 PlyA + 59487 59492 6 1.05 3.23 PlyA - 62604 62599 6 1.05 3.22 Term - 66877 66770 108 1 0 48 46 102 0.066 -0.47 3.21 Intr - 81451 80788 664 0 1 62 65 234 0.435 9.34 3.20 Intr - 82173 81892 282 2 0 90 92 160 0.789 12.21 3.19 Intr - 82548 82469 80 0 2 33 110 93 0.753 3.43 3.18 Intr - 96050 95966 85 1 1 95 85 32 0.013 2.50 3.17 Intr - 96601 96531 71 1 2 46 76 63 0.009 -2.04 3.16 Intr - 100096 100001 96 1 0 68 116 112 0.916 11.29 3.15 Intr - 100798 100746 53 2 2 86 84 27 0.723 -0.19 3.14 Intr - 110938 110856 83 0 2 62 76 107 0.227 5.26 3.13 Intr - 115337 115228 110 2 2 63 102 86 0.252 5.66 3.12 Intr - 126671 126537 135 2 0 16 52 136 0.067 2.64 3.11 Intr - 127250 126956 295 1 1 74 84 210 0.815 15.29 3.10 Intr - 130669 130533 137 1 2 80 89 153 0.999 13.05 3.09 Intr - 131830 131707 124 0 1 91 88 42 0.999 4.17 3.08 Intr - 132730 132646 85 2 1 72 89 101 0.998 6.56 3.07 Intr - 137969 137841 129 0 0 2 38 182 0.195 4.35 3.06 Intr - 138325 138227 99 2 0 130 87 81 0.999 11.36 3.05 Intr - 139449 139222 228 1 0 65 38 110 0.545 0.62 3.04 Intr - 149391 148834 558 1 0 111 87 217 0.945 15.77 3.03 Intr - 153842 153660 183 0 0 28 106 69 0.243 1.54 3.02 Intr - 159281 159234 48 0 0 69 115 55 0.540 4.03 3.01 Init - 161007 160929 79 0 1 86 96 30 0.637 4.87 3.00 Prom - 161991 161952 40 -3.65 4.03 PlyA - 162449 162444 6 1.05 4.02 Term - 169150 169050 101 2 2 43 53 143 0.427 3.61 4.01 Init - 171554 171446 109 2 1 54 78 155 0.868 11.53 4.00 Prom - 172551 172512 40 -6.05 5.00 Prom + 173910 173949 40 -4.25 5.01 Init + 182201 182336 136 1 1 70 77 66 0.709 4.07 5.02 Term + 187411 187547 137 2 2 110 48 107 0.804 6.10 5.03 PlyA + 188490 188495 6 1.05 6.06 PlyA - 191722 191717 6 1.05 6.05 Term - 200594 199816 779 0 2 40 41 398 0.682 22.84 6.04 Intr - 207743 207533 211 2 1 69 59 144 0.790 7.16 6.03 Intr - 208223 208085 139 1 1 23 85 108 0.166 3.55 6.02 Intr - 213609 213527 83 0 2 79 94 21 0.013 -0.68 6.01 Init - 227985 227929 57 0 0 74 86 44 0.849 4.16 6.00 Prom - 228291 228252 40 -2.85 7.02 PlyA - 228662 228657 6 1.05 7.01 Sngl - 238176 237985 192 0 0 72 48 132 0.757 2.49 7.00 Prom - 240549 240510 40 -2.85 8.02 PlyA - 240637 240632 6 1.05 8.01 Term - 247344 247191 154 2 1 79 45 161 0.750 7.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 96050 95910 141 2 0 95 54 133 0.964 7.55 S.002 Term - 126671 126498 174 2 0 16 44 202 0.902 5.38 S.003 Term - 220910 220800 111 2 0 64 40 98 0.861 0.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_1|251_aa MQMVFDFVCLNIYVTPFVPAKGTLSEDTIRVFLHQIAAAMRILHSKGIIHRDLKPQNILL SYANRRKSSVSGIRIKIADFGFARYLHSNMMAATLCGSPMYMAPEVIMSQHYDAKADLWS IGTVIYQCLVGKPPFQLAQFQCPCILVLSLEAPVAALHLVVLLLHQYKVSSKSDKERLCQ VHVTHMKISGISDRYHGTVTFKGAAAVNQVICQWGLLADVLQMNSWCVEGPDSNHILLSV SMNLTTLDTSX >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_1|753_bp atgcagatggtatttgattttgtgtgcttaaatatttacgtgactccttttgtaccagcg aaagggactctcagtgaagacacgatcagagtgtttctgcatcagattgctgctgccatg cgaatcctgcacagcaaaggaatcatccacagagatctcaaaccacagaacatcttgctg tcctatgccaatcgcagaaaatcaagtgtcagtggtattcgcatcaaaatagcggatttt ggttttgctcgttacctacatagtaacatgatggctgcaacactgtgtggatccccgatg tacatggctcctgaggttattatgtctcaacattatgatgctaaggctgacttgtggagc ataggaacagtgatataccaatgcctagttggaaaaccaccttttcagcttgcccagttc cagtgcccatgtattctggttctgtctctggaagctcctgtggcagctctccatcttgtc gttttgcttctccaccaatacaaagtctcttcaaaaagtgataaggagagattgtgtcaa gtgcatgtcacacacatgaaaataagtggtatcagtgaccgataccatggcacagtgaca ttcaaaggagcagcagcagtgaaccaggtgatatgccagtggggactgctggcagacgtg cttcaaatgaattcttggtgtgtggagggccctgatagtaaccacattctgctctctgtt tctatgaatttgactactttagatacctcatnn >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_2|254_aa MAAPPGHTADGRAAQCGAGISTAAPRARSAPACGGSGAAAAPGPGGLSCRETGAETGEAA ARSQGQRGPARPLPLMARPAAGRWLSGEAASALPGQGCRHRHSRRASSALPARGAAATAT AAALEPAPPRPRASPPRGGAQRPRARYPLPRDWLPSATSARLEGRSASRLPPPGKGVRPG LGARSWADPRPLLRRGRRARPAAATAGYGLRAPFWAWGADSSRPRPGLQAAGRRTSEKRV LNPSCRQDGIICAG >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_2|765_bp atggccgcgcccccggggcacacagcggacgggcgggcggcgcagtgcggcgcaggtatc agcaccgcggctccgcgggcccggagcgcgccagcgtgcggcgggtctggggcagccgca gccccgggcccgggcggactctcatgccgagagaccggagcggaaactggggaagctgcc gcgcggagccagggtcagcgaggcccggcccggcccctgccgctcatggcccggcctgcc gccggccgctggctgtctggcgaagcggcctcggcgctgccgggccaggggtgccgtcac cgtcactctcgtcgggcctcctcggccctccccgctcggggcgctgctgcgacggcgacg gccgccgccctagaacccgcaccgccgcggccccgcgcttccccgcctcgcggcggcgcc caacgccctcgcgcgcgctacccgctcccgcgcgactggctgccctccgcgacgtccgcg cgccttgaaggccgcagcgccagccggctccctccgccggggaaaggagtgcggccaggg cttggcgcccgcagctgggcagatcctcggccgctcctgaggcggggacgaagggcgcga cccgctgcggccacagcgggttacgggctgcgggctccgttctgggcctggggagcagat tcgagccgaccacgccccgggctccaggccgctgggaggcggacttcagagaagcgggtc ttgaaccccagctgcaggcaggacgggattatctgcgcagggtag >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_3|1243_aa MKTEVGTGLGEDDMFPFRQVELWKMAVKGKEQEKTSDVKSIKASISVHSPQKSTKNHALL EAAGPSHVAINAISANMDSFSSSRTATLKKQPSHMEAAHFGDLGRSCLDYQTQETKSSLS KTLEQVLHDTIVLPYFIQFMELRRMEHLVKFWLEAESFHSTTWSRIRAHSLNTVKQSSLA EPVSPSKKHETTASFLTDSLDKRLEDSGSAQLFMTHSEGIDLNNRTNSTQNHLLLSQECD SAHSLRLEMARAGTHQVSMETQESSSTLTVASRNSPASPLKELSGKLMKKSQGFRFPILQ HTSCSYVNLCGHDFFFLGLQFLLSESRILGAAVIAAVAALMLPGSRRRERNKTKQPRGFP PLSLIRIEQDAVNTFTKYISPDAAKPIPITEAMRNDIIAGPAGGYANRHRPPDGLAGSPA PVHEVGGTETSGVGSPVRLKAARICGEDGQVDPNCFVLAQSIVFSAMEQEHFSEFLRSHH FCKYQIEVLTSGTVYLADILFCESALFYFSEYMEKEDAVNILQFWLAADNFQSQLAAKKG QYDGQEAQNDAMILYDKYFSLQATHPLGFDDVVRLEIESNICREGGPLPNCFTTPLRQAW TTMEKVTQNFKRIKLQEVLLVELIKYKVGKPSRIAQWKLKQSSKVIKSRGLESGLKTVNC KTASDRSFRRFPEGGIVIKGDDGSMRVTAPEDLPVGQDVESSVKKASIKILKNFDEAIIV DAASLDPESLYQRTYAGKMTFGRVSDLGQFIRESEPEPDVRKSKGSMFSQAMKKWVQGNT DEAQEELAWKIAKMIVSDIMQQAQYDQPLEKSTKVSQPHREPGSFEWQRGASSFSGTGIV RDSLTALLLSWLLHRDHTAISRTAQLWLKTDTARSPWKPTGPSQTLWVTYSGGSKPSSHP PLVSPHLNPQVWDTSTPSLVTDHARLTIPLKPNHPYSNQRQYPIPWHALKGLKPVITRLL QHGLLKPINSPYNSPILPVQKPDKSYSSHNFQNLFSSSHLTHILSAPQLLQLCSVFVESP TITIVPGPDFNPNSHIIMDIRTDPHDCISLIHLTVTTFPHISVFPVSHHDHTWFIDGSST RPNHHSPAKAGYAIVSSTSIIEATALPASTTSQQAKLIALTWALTLAKGLHVNIYTDSKY AFRILHHHAVTWAERNFLTTQESSIIIASLIKTLLKVALLPKEAGVIHCKGHQKASDVIA QGNAYADKQSLNIIISISTDGTEAQRGEGKPKVLQPGCGTIWN >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_3|3732_bp atgaagactgaggtaggaacaggtttgggagaagatgacatgttcccttttagacaagtg gaattatggaagatggcagtgaaaggcaaagaacaagagaagacctcagatgtgaagtcc attaaagcttcaatatccgtacattccccacaaaaaagcactaaaaatcatgccttgctg gaggctgcaggaccaagtcatgttgcaatcaatgccatttctgccaacatggactccttt tcaagtagcaggacagccacacttaagaagcagccaagccacatggaggccgctcatttt ggtgacctgggcagatcttgtctggactaccagactcaagagaccaaatcaagcctttct aagacccttgaacaagtcttgcacgacactattgtcctcccttacttcattcaattcatg gaacttcggcgaatggagcatttggtgaaattttggttagaggctgaaagttttcattca acaacttggtcgcgaataagagcacacagtctaaacacagtgaagcagagctcactggct gagcctgtctctccatctaaaaagcatgaaactacagcgtcttttttaactgattctctt gataagagattggaggattctggctcagcacagttgtttatgactcattcagaaggaatt gacctgaataatagaactaacagcactcagaatcacttgctgctttcccaggaatgtgac agtgcccattctctccgtcttgaaatggccagagcaggaactcaccaagtttccatggaa acccaagaatcttcctctacacttacagtagccagtagaaatagtcccgcttctccacta aaagaattgtcaggaaaactaatgaaaaaaagtcagggctttagattccctatacttcag cacacttcctgtagctatgtcaacctctgtggccacgacttcttcttcttgggactgcag tttctcttgtcagaaagtaggattcttggagctgctgtcattgctgctgtggctgctctg atgctgcctgggagtcgaaggagagaaaggaacaaaacaaaacaacccaggggatttcct ccactctctttgatccgtatagaacaagatgcagtgaatacttttaccaaatatatatct ccagatgctgctaaaccaataccaattacagaagcaatgagaaatgacatcatagccggg ccggcgggcggctacgctaaccggcacagaccaccggatggactggccggcagccccgca ccagtgcacgaagtgggcgggacagaaacttctggggttggaagtccagtgaggctaaaa gccgcaaggatttgtggagaagatggacaggtggatcccaactgtttcgttttggcacag tccatagtctttagtgcaatggagcaagagcactttagtgagtttctgcgaagtcaccat ttctgtaaataccagattgaagtgctgaccagtggaactgtttacctggctgacattctc ttctgtgagtcagccctcttttatttctctgagtacatggaaaaagaggatgcagtgaat atcttacaattctggttggcagcagataacttccagtctcagcttgctgccaaaaagggc caatatgatggacaggaggcacagaatgatgccatgattttatatgacaagtacttctcc ctccaagccacacatcctcttggatttgatgatgttgtacgattagaaattgaatccaat atctgcagggaaggtgggccactccccaactgtttcacaactccattacgtcaggcctgg acaaccatggagaaggtaacccagaacttcaaacgtatcaaactacaagaagttttattg gtagaactcataaaatataaggtgggaaaaccaagcagaatagcacagtggaaattgaag cagtccagcaaagtgattaagagcagaggccttgagtctggcctgaaaacagttaactgt aaaacagcttcagacaggtccttcaggaggtttccagaaggaggcattgttatcaaagga gatgacggctccatgcgtgttactgcccctgaagaccttccagtgggacaagatgtggag tccagtgtgaaaaaagccagtattaaaatactgaaaaattttgatgaagcgataattgtg gatgcggcaagtctggatccagaatctttatatcaacggacatatgccgggaagatgaca tttggaagagtcagtgacttggggcaattcatccgagaatctgagcctgaacctgatgta aggaaatcaaaaggatccatgttctcacaagctatgaagaaatgggtgcaaggaaatact gatgaggcccaggaagagctagcttggaagattgctaaaatgatagtcagtgacattatg cagcaggctcagtatgatcaaccgttagagaaatctacaaaggtgagtcagccccaccga gagccaggcagctttgagtggcagcgtggtgctagcagcttcagcggaacagggatagta agagactcactcactgcacttctgctcagttggcttctgcatcgggatcacacagccatc agcaggactgcccagttgtggctgaagactgacactgcccgatcgccttggaagcctaca ggaccatcacagacgctttgggtgacttacagtggaggttcgaagccttcttcgcatcct ccccttgtatctccccaccttaatccacaagtatgggacacctctactccctccttggtg accgatcatgcacgccttaccatcccattaaaacctaatcacccttactccaatcaacgc caatatcccatcccatggcatgctttaaaaggattaaagcctgttatcactcgcctgcta cagcatggtcttctaaagcctataaactctccttacaattcccccattttacctgtccaa aaaccagacaagtcttacagttcccataactttcaaaatctattttcctcctcacacttg acacatatactttctgccccccagctccttcagctatgctcagtctttgttgagtctccc acaattacaattgttcctggcccagacttcaatccgaactcccacattattatggatatc agaactgacccccatgactgtatctctctgatccacctgacagtcaccacatttccccat atttccgtctttcctgtttctcaccatgatcacacttggtttattgatggcagttccacc aggcctaatcaccactcaccagcaaaggcaggctatgctatagtatcctccacatctatc attgaggctactgctctgcccgcctccactacctctcagcaagccaaactcattgcctta acttgggccctcactcttgcaaagggactacacgtcaatatttatactgactctaaatat gccttccgtatcctgcaccaccatgctgttacatgggcagaaagaaatttcctcactaca caagagtcctccatcattattgcctccttaataaaaacgcttcttaaagtcgctttactt ccaaaggaagctggagtcattcactgcaagggccatcaaaaggcatcagatgtcatcgct cagggcaacgcttatgctgataagcagtcactaaacatcatcatctccatttctacagat ggcacagaggcccagagaggtgaaggaaagcctaaggtcctacagccaggatgtggcaca atttggaactga >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_4|69_aa MTAIIMTEVYAAVGDYGDFRISGGQGSKAANHTGSRGFCIASNPQAVAGSPEEAGTGVQG EAARAGQRL >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_4|210_bp atgacggccattatcatgactgaggtgtatgcagctgtcggggattacggcgacttcaga atttctggtgggcagggctcaaaggcagcaaatcacactggaagtcgagggttctgcatc gcctcgaacccgcaggccgtggcgggttctcctgaggaagcagggactggggtgcagggt gaagctgctcgtgccggccagcgcctgtga >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_5|90_aa MLLMKRTSVGHSQKSGQLGQRPRPPGREWSINTGITSTGASVWKEAVVKVLIWAVLSPLH HMFAFASYIKGWIDPLTEAKSHSKAPAAKV >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_5|273_bp atgcttctcatgaaaaggacctcagtaggacactcacagaagtccgggcaactgggacag agaccacgacccccaggaagggagtggtcaatcaacacagggatcaccagtacaggagcc tctgtttggaaggaagctgtggtaaaagtcctaatttgggctgttctctcacctttgcat catatgttcgcctttgcatcatatatcaaagggtggattgaccctttgacagaagctaag tcacattccaaagccccagctgccaaagtgtga >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_6|422_aa MIKWDLFQKRKSGSTFENKVLPHIQQWLVIKSNDTSGQEVKRMSEDSLADSGVKLQTFAA SVTALKGAHLELFVPPGEFVVSLASGVKLQTFELDIKVLKSPPDSGAQLASPSGSGISAT GGAACQSRALRPHSSALWWSMGLGAVEQGVVLSGRLGPHRSPGGPQLSGATWLSVPGPCD NPRARRGGGGHLPLPPAAPVPPLRAGQDRPGRPSWAGDRTRDEGTGAPAAKTQHAFPERQ SRAPTTDCRTPALARPSLQTTRLRRKEPAGGRAGGAASTHRPRWLSAAATPSGHGSGRSL MLRRGTRAPPAPPAIAARGRARAARGSLGDVVPGPPPAPSSFSRRPGAPSLPALSVSSPY LLILWSHIIDVWASQARCGPHFRTLHWRFSKGGARTSSFSLIWELEKMQILRPTQPIRFF LH >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_6|1269_bp atgatcaagtgggacttattccagaaacgcaagtctggttcaacatttgaaaacaaggta ttgcctcacatacagcagtggctggtcataaagtcaaatgacactagtggccaggaggtc aagagaatgagtgaggacagtctcgctgactcaggagtgaagctgcagaccttcgcggcg agtgttacagctcttaagggggcgcatctagagttgttcgttcctcctggtgagttcgtg gtctcgctagcttcaggagtgaagctgcagaccttcgagctagacataaaggttctcaag tccccaccagactcaggagcccagctggcttcacccagtggatccggcatcagtgccaca ggtggagctgcctgccagtcccgcgccctgcgcccgcactcctcagccctctggtggtcg atgggactgggcgccgtggagcagggggtggtgctgtcagggaggctcgggccgcacagg agcccaggagggccgcagctgtcgggagccacctggctctcagtcccgggtccctgcgac aaccctcgggcccggaggggaggaggcggccacctgccgctgccacctgcggcaccggtc ccaccgctccgggccgggcaggacaggccaggacgtccctcctgggctggggacaggaca cgcgacgaggggaccggggcccccgcggcgaagacgcagcacgccttcccagaaaggcag tcccgtgcccccacgacggactgccggacccccgcgctcgcccgcccatcccttcagacc acgcggctgaggcgcaaagagccggccggcgggcgggctggcggcgcggctagtactcac cggccccgctggctcagcgccgccgcaacccccagcggccacggctccgggcgctcactg atgctcaggagagggacccgcgctccgccggcgcctccagccatcgccgccagggggcga gcgcgagccgcgcggggctcgctgggagatgtagtacccggaccgccgcctgcgccgtcc tccttcagccggcggccgggggccccctctctcccagctctcagtgtctcatctccctat ctgctcatcctctggtcgcacataatcgatgtttgggcgtcccaagccagatgtggaccc catttccgcactctacactggaggttttctaagggtggtgcccggaccagcagcttcagc ctcatctgggaacttgagaaaatgcagattctccgtcccacccagcctattcggtttttc ctgcactaa >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_7|63_aa MRKKADHLQSSPEHWSLKTKECCLLSQKGEKVTQHIFTKLECHALFKGCKSHERHERTQA FTT >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_7|192_bp atgcggaaaaaggcagaccatctgcagtcttctccagaacactggagtctgaagacaaaa gaatgctgcctactgagccagaagggagagaaagtgacccaacacatctttaccaagtta gaatgtcacgcattatttaaaggctgcaaaagccatgaaagacatgaaagaacacaagca tttacaacatga >gi568815581r:19809181_20068463|GENSCAN_predicted_peptide_8|51_aa XLPNPLKHNRKSPGSLFCTYHRGIHSASSKLTLHDTSGLALLRTPVLAAVS >gi568815581r:19809181_20068463|GENSCAN_predicted_CDS_8|156_bp nngcttcctaatcccttgaagcacaatcgaaaaagccctggatctcttttctgcacatat catcgcggaattcattcggcttccagcaagctgacactccatgatacaagcggcctcgcc cttctccggacgccagtccttgctgcggttagctag