GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:51:20 Sequence gi568815593r:93582840_93783717 : 200878 bp : 38.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2116 2647 532 0 1 137 63 370 0.896 30.97 1.02 Intr + 5078 5605 528 0 0 79 94 885 0.997 80.18 1.03 Term + 10723 11003 281 2 2 76 41 416 0.959 30.32 1.04 PlyA + 11321 11326 6 1.05 2.08 PlyA - 11361 11356 6 1.05 2.07 Term - 12200 12057 144 2 0 79 53 164 0.919 8.93 2.06 Intr - 13257 13090 168 0 0 67 10 149 0.455 4.22 2.05 Intr - 14275 14114 162 1 0 80 39 82 0.399 1.75 2.04 Intr - 14454 14357 98 1 2 92 86 31 0.837 2.11 2.03 Intr - 15087 14915 173 2 2 36 93 135 0.670 7.46 2.02 Intr - 16961 16860 102 1 0 48 77 87 0.543 1.97 2.01 Init - 21571 21138 434 1 2 67 9 200 0.602 5.99 2.00 Prom - 21846 21807 40 -6.55 3.05 PlyA - 22433 22428 6 1.05 3.04 Term - 29112 29029 84 0 0 116 43 86 0.072 3.57 3.03 Intr - 38290 38152 139 0 1 101 50 70 0.035 4.05 3.02 Intr - 47641 47525 117 0 0 84 91 36 0.211 2.16 3.01 Init - 49846 49839 8 1 2 77 111 0 0.250 1.56 3.00 Prom - 86925 86886 40 -3.25 4.03 PlyA - 88298 88293 6 1.05 4.02 Term - 100456 99998 459 1 0 -33 48 765 0.992 54.50 4.01 Init - 100878 100726 153 0 0 45 16 224 0.355 10.93 4.00 Prom - 104892 104853 40 -4.75 5.00 Prom + 105603 105642 40 -8.85 5.01 Init + 108223 108354 132 0 0 69 13 108 0.598 1.59 5.02 Term + 108400 108798 399 0 0 5 43 317 0.697 13.03 5.03 PlyA + 108905 108910 6 1.05 6.00 Prom + 109664 109703 40 -6.95 6.01 Sngl + 109992 110393 402 2 0 45 48 240 0.987 11.72 6.02 PlyA + 110451 110456 6 1.05 7.00 Prom + 112232 112271 40 -6.25 7.01 Init + 115611 115717 107 2 2 50 72 70 0.337 1.34 7.02 Term + 120378 120450 73 0 1 77 36 148 0.359 4.90 7.03 PlyA + 121565 121570 6 1.05 8.00 Prom + 127785 127824 40 -5.35 8.01 Init + 128700 128853 154 2 1 81 110 60 0.662 7.69 8.02 Term + 152630 152814 185 0 2 49 38 162 0.736 4.02 8.03 PlyA + 153365 153370 6 1.05 9.02 PlyA - 153894 153889 6 1.05 9.01 Sngl - 158652 157738 915 0 0 61 49 611 0.909 48.56 9.00 Prom - 162088 162049 40 -5.35 10.00 Prom + 163891 163930 40 -5.05 10.01 Sngl + 179615 180322 708 1 0 59 38 299 0.808 17.97 10.02 PlyA + 180494 180499 6 1.05 11.00 Prom + 180663 180702 40 -5.75 11.01 Sngl + 182585 182938 354 1 0 60 38 206 0.851 8.60 11.02 PlyA + 184348 184353 6 1.05 12.04 PlyA - 185923 185918 6 1.05 12.03 Term - 187080 186883 198 0 0 83 47 54 0.335 -2.78 12.02 Intr - 190006 189884 123 1 0 98 84 76 0.421 8.06 12.01 Init - 192794 192732 63 2 0 74 57 49 0.340 1.60 12.00 Prom - 200072 200033 40 -3.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_1|446_aa RARAPRGPRRAARLPPALPGPKDMAMVVSSWRDPQDDVAGGNPGGPNPAAQAARGGGGGA GEQQQQAGSGAPHTPQTPGQPGAPATPGTAGDKGQGPPGSGQSQQHIECVVCGDKSSGKH YGQFTCEGCKSFFKRSVRRNLTYTCRANRNCPIDQHHRNQCQYCRLKKCLKVGMRREAVQ RGRMPPTQPNPGQYALTNGDPLNGHCYLSGYISLLLRAEPYPTSRYGSQCMQPNNIMGIE NICELAARLLFSAVEWARNIPFFPDLQITDQVSLLRLTWSELFVLNAAQCSMPLHVAPLL AAAGLHASPMSADRVVAFMDHIRIFQEQVEKLKALHVDSAEYSCLKAIVLFTSDACGLSD AAHIESLQEKSQCALEEYVRSQYPNQPSRFGKLLLRLPSLRTVSSSVIEQLFFVRLVGKT PIETLIRDMLLSGSSFNWPYMSIQCS >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_1|1341_bp cgcgcccgcgcgccccgcggccctcggcgagcagctcggctccccccagcgctccccggg cccaaagatatggcaatggtagttagcagctggcgagatccgcaggacgacgtggccggg ggcaaccccggcggccccaaccccgcagcgcaggcggcccgcggcggcggcggcggcgcc ggcgagcagcagcagcaggcgggctcgggcgcgccgcacacgccgcagaccccgggccag cccggagcgcccgccacccccggcacggcgggggacaagggccagggcccgcccggttcg ggccagagccagcagcacatcgagtgcgtggtgtgcggggacaagtcgagcggcaagcac tacggccaattcacctgcgagggctgcaaaagtttcttcaagaggagcgtccgcaggaac ttaacttacacatgccgtgccaacaggaactgtcccatcgaccagcaccaccgcaaccag tgccaatactgccgcctcaagaagtgcctcaaagtgggcatgaggcgggaagcggttcag cgaggaagaatgcctccaacccagcccaatccaggccagtacgcactcaccaacggggac cccctcaacggccactgctacctgtccggctacatctcgctgctgctgcgcgccgagccc taccccacgtcgcgctacggcagccagtgcatgcagcccaacaacattatgggcatcgag aacatctgcgagctggccgcgcgcctgctcttcagcgccgtcgagtgggcccgcaacatc cccttcttcccggatctgcagatcaccgaccaggtgtccctgctacgcctcacctggagc gagctgttcgtgctcaacgcggcccagtgctctatgccgctgcacgtggcgccgttgctg gccgccgccggcctgcatgcctcgcccatgtctgccgaccgcgtcgtggccttcatggac cacatccgcatcttccaggagcaggtggagaagctcaaggcgctacacgtcgactcagcc gagtacagctgcctcaaagccatcgtgctgttcacgtcagacgcctgtggcctgtcggat gcggcccacatcgagagcctgcaggagaagtcgcagtgcgcactggaggagtacgtgagg agccagtaccccaaccagcccagccgttttggcaaactgctgctgcgactgccctcgctg cgcaccgtgtcctcctccgtcatcgagcagctcttcttcgtccgtttggtaggtaaaacc cccatcgaaactctcatccgcgatatgttactgtctgggagcagcttcaactggccttac atgtccatccagtgctcctag >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_2|426_aa MRGAASPARSQTSTSHPGGARPGAASPRPGRSMSRSPTAPRAGRPSPRLDSDVALGPVTW PRPPVPPRSPLLSPRVSTRGRRGWGGAAWWMRPEDDRGTLALPRLTTQPPNTLGGGPKHL ANQDSQSFEENGPGPPPVFQTKGTKHRISNNHMEVQAPASTEKSFSPPDLKTKGEEAWCR QAQVGSDRGDSRSPSPLNAKLGGRTSWEPTKNLCGCQSLRAWTNWKGGVPKAHGSPGEVW LRAQGDCPAYTCWEPQAPVPRLPLAPRMKVLGHSRNRGLGKHRTLNPGQPGPYWGLSLRA ERKPLTAPPSARRTGHCQPGLGRPTPPQSGSGSQWGSLILLGKRSGVNYGLSPTPPALGV QPAGKSTEALRRYSWAWCRLLQLSKFPHAVKPRLPPPDTEAGCAPGAWCLETYRARDPAY TTPWPN >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_2|1281_bp atgagaggtgcggcttctccagcgcggagccagacaagtacttctcatcccggcggagcg cggcccggcgcggccagcccgaggcccggacgttctatgtcaagaagccccacggccccc cgggctggccgcccctccccgcggctggacagcgacgtcgcgctcgggcctgtcacgtgg ccgcgcccccccgtgcccccgcggagccctttgttaagcccccgggtctccacgcgcggc cgccgtggctggggcggtgccgcatggtggatgcgcccggaagacgatcggggtaccctg gctttgcctcggctcaccacgcagcccccaaacaccttgggcggaggccccaagcacttg gcaaatcaagattctcagtcctttgaggaaaacggccctgggccacctccagtgtttcaa accaaaggcacaaagcaccgaatctccaacaaccacatggaagttcaggctccagcttcc accgagaaatctttttctcctccagatctgaagacaaaaggagaggaggcctggtgcaga caagctcaggtgggctcagatagaggggactccagatccccatcacccctgaatgcaaag ctgggtggtcggacctcttgggagccaaccaagaacctttgtggctgccagagtttaaga gcctggaccaactggaagggtggagtccccaaagctcatggttctcctggagaggtgtgg ctgcgagcccagggcgactgcccggcctacacctgctgggagcctcaagccccggttcca cggctccccctggcgccccgcatgaaggtcctgggccacagccgcaaccgggggctgggg aaacacaggacgctgaacccggggcagcctggcccctactggggcctgtccctccgcgcc gagaggaaaccgctgaccgctccgccctcagcccgcaggactggccactgtcagcccggc ctgggtcggcccaccccaccccaatccggatccggcagccagtggggctccctgatactt cttgggaagcggtcaggggttaactatggactctccccaacccctcctgctttaggtgtc caacctgcagggaagtcaacagaggccctaagacgttattcctgggcgtggtgcagactt ctccagctcagcaagttcccacatgcggtaaagccccggttgccgccgccagacactgaa gccggctgcgcccccggtgcttggtgcctggagacttaccgggcgcgcgaccctgcctac accactccctggcccaactga >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_3|115_aa MNCLLSLVKLDFLPLQSQYNLNFKGLATVLAMERELACILCRHRPSRANFLEELSVYFQI LYRSLRGQDQLPEAGCDAPLPPHQARRATLSPSCAFRHLSLFGSLLFAVALSKVQ >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_3|348_bp atgaattgcctattatctctggtcaaacttgacttcttgcctttacagtctcaatacaat ctcaactttaaagggcttgcaactgtgttagcaatggaaagagaacttgcttgtatctta tgcaggcaccgaccgtcacgagctaacttcctggaagagctttccgtctattttcaaatt ctttaccgaagcctcagaggccaagaccagctccctgaagccggctgtgacgcgccgctc ccaccgcatcaagcacgaagagctaccttaagtccaagctgtgcttttcggcacctctcc ctgtttggctctcttctttttgccgtcgcgttaagcaaagtacaataa >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_4|203_aa MEDSMDMDMSPLRPQNYLFGCELKADKDDHFKVDNDENERQLSLRTASLGLRSAPGGGSK VPQKKVKLAADEDDDDDDEEDDDEDDDDDDFDDEEAEEKAPVKKSIRDTPAKNAQKSNQN GKDSKPSTPRSKGQESFKKQEKTPKTPKGPSSVEDIKAKMQASIEKGGSLPKVEAKFINY VKNCFRMTNQEAIQDLWQWRKSL >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_4|612_bp atggaagattcgatggacatggacatgagccccctgaggccccagaactatcttttcggt tgtgaactaaaggccgacaaagatgatcactttaaggtggataatgatgaaaatgagcgc cagttatccttaagaacggccagtttagggctgcggtctgcccctggaggtggtagcaag gttccacagaaaaaagtaaaacttgctgctgatgaagatgatgacgatgatgatgaagag gatgatgatgaagatgatgatgatgatgattttgatgatgaggaagctgaagaaaaagcg ccagtgaagaaatctatacgagatactccagccaaaaatgcacaaaagtcaaatcagaat ggaaaagactcaaaaccatcaacaccaagatcaaaaggacaagaatccttcaaaaaacag gaaaaaactcctaaaacaccaaaaggacctagttctgtagaagacattaaagcaaaaatg caagcaagtatagaaaaaggtggttctcttcccaaagtggaagccaaattcatcaattat gtgaagaattgcttccggatgactaaccaagaggctattcaagatctctggcagtggagg aagtctctttaa >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_5|176_aa MRKNQCKKAENSKNQNASPPPNDHNSWPAREQNRMENDFDKLTEEHVLAQCKEAKNLEKR LEELLTRINSLEKNINGLMELKSTAQELREAYTSINSQIDQAEGRISGIEHQLNEIKHED KIREKRMKRNKQSLQEIWDYVKRPNLRLTGVPESDGENGTELENTLQDIIQEKFPN >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_5|531_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaaatcagaatgcctctcctcct ccaaatgatcacaactcctggccagcaagggaacaaaacaggatggagaatgactttgac aaattgacagaagagcatgttctagctcaatgcaaggaagctaagaaccttgaaaaaagg ttagaggaattgctaactagaataaacagtttagagaagaacataaatggcctgatggag ctcaaaagcacagcacaagaacttcgtgaagcatatacaagtattaacagccaaattgat caagcagaaggaaggatatcagggattgaacatcaacttaacgaaataaagcatgaagac aagattagagaaaaaagaatgaaaaggaacaaacaaagcctccaagaaatatgggactat gtgaaaagaccaaacctgcgtttgactggtgtacctgaaagtgacggggagaatggaacc gagttggaaaacactcttcaggatattatccaggagaaattccccaactag >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_6|133_aa MEIITNTLSDHSAIKLELRIKKLTQNRTTTWKLNNLLLNDYWVNNKIKAEINTFFETNEN KDTMYQNLWDTVKAVFRGKFIALNAHKRKQERFKLDTLTSQLKELEKQEQTNSKATRRQE ITKIRAELKETET >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_6|402_bp atggaaatcataacgaacactctctcagaccacagtgcaatcaaattagaactcaggatt aagaaactcactcaaaaccgcacaactacatggaaactgaacaacctgctcctgaatgac tattgggtaaataacaaaattaaggcagaaataaatacattctttgaaaccaatgagaac aaagacacaatgtaccagaatctctgggacacagtgaaagcagtgtttagagggaaattt atagcactaaatgcccacaagagaaagcaggaaagatttaaactcgacaccctaacatca caattaaaagaactagagaagcaagagcaaacaaattcaaaagctaccagaagacaagaa ataactaagatcagagcagaactgaaggagacagagacatga >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_7|59_aa MKQNRRRDRNAVSNEAVRIGNTEKVIFEQRSKENEGGCYHGKPSGTASGAAVLWRVSAK >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_7|180_bp atgaaacagaacagaagaagggatagaaatgcagtttcaaatgaggcagtcaggatagga aacaccgagaaggtgatatttgaacaaagatccaaagaaaatgagggaggctgctaccat ggcaaaccttcaggaactgcctctggtgctgccgttctctggcgagtgtcggcgaagtag >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_8|112_aa MASEEVGKSSSTEQLKRWQNCQVQPFQHSRNQPKAKKQTEKCLFKEICCITEAAEELKEQ LRSVPLYSSYVHNTIRVVYSGIQHGLTLLKHLLSKAFQKGCLGKGGNGVGAV >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_8|339_bp atggcaagtgaagaagttggcaaatcctcttccactgaacaactaaaaaggtggcaaaat tgtcaagtacaaccatttcagcactccagaaatcaaccaaaagcaaaaaaacaaactgag aaatgtttattcaaggaaatctgctgtattacagaggctgcagaagaactgaaggagcag ctgaggtcagtgcctctttacagctcttatgttcacaacacaatcagagttgtttattct ggcattcaacatggtctgactctgctgaaacatctcctttctaaggctttccagaagggc tgtctaggaaaaggaggaaatggagttggtgctgtgtga >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_9|304_aa MPLRVDTLTWLSTQAAPGRVMVWPAVRPGICPGPDVWRIPLGPLPHEFRGWIAPCRPRLG ASEAGDWLRRPSEGALPGPYIALRSIPKLPPPEDISGILKELQQLAKELRQKRLSLGYSQ ADVGIAVGALFGKVLSQTTICRFEAQQLSVANMWKLRPLLKKWLKEVEAENLLGLCKMEM ILQQSGKWRRASRERRIGNSLEKFFQRCPKPTPQQISHIAGCLQLQKDVVRVWFYNRSKM GSRPTNDASPREIVGTAGPPCPGAPVCFHLGLGLPVDIPHYTRLYSAGVAHSSAPATTLG LLRF >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_9|915_bp atgcccctgcgggttgacactctgacctggttgagcacccaggcggcccctggcagggtg atggtctggccggcagtcaggccagggatctgcccaggccctgacgtgtggaggattccc ctgggtcccctgccacacgaattccggggctggatagcaccctgcaggccccgtcttgga gctagtgaggcaggggactggttgcgacgcccctccgaaggcgccctcccggggccctac attgccctgcggagcattccgaagttgccgccgccagaggacatctcgggcatactgaaa gagttgcagcaattggccaaggagttgaggcagaagaggttgagcctagggtactcgcag gccgatgtggggatcgctgtgggagctctgtttgggaaggtgcttagccagacgaccatc tgccgcttcgaggcccagcagctaagcgtcgccaacatgtggaagctgcgaccactgctg aaaaagtggctgaaggaagtggaagcagagaaccttctgggcttatgcaaaatggagatg atcctgcaacagtctgggaagtggagacgggcaagcagagagcgacgaatcggaaacagc ctggagaaattcttccagcggtgccctaagcccacaccccagcaaatcagccacattgct gggtgcctccagctgcagaaggatgtggttcgagtttggttctataaccgcagcaagatg ggcagtcgaccaaccaatgatgcttccccacgggagattgtggggacagccgggcctcct tgcccaggagcaccagtgtgctttcacctgggactggggctcccagtggatatcccccac tatacacgtctctactctgcaggggtagcccactcctctgccccagccaccactctgggc ctcctcagattttag >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_10|235_aa MEDQMNEMKQEEKFRDKRVKRNEQSLQEIWDYVKRPNLCLIGVPESDMENGTKLENTLRD IIQEKFPNLARQANIQIQEIQRTPQRYSWRRATPRHIIVRFTKVEMKEKMLRAAREKGWV THKGKPIRLTADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKSFTDK QMLSDFVTNCPKRAPEGSTKHGKEQPVPATAKTCQIVKTIETRKKLHQLTSKITS >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_10|708_bp atggaagatcaaatgaatgaaatgaagcaagaagagaagtttagagacaaaagagtaaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctatgtctg attggtgtacctgaaagtgatatggagaatggaacgaagttggaaaacactctgcgggat attatccaagagaaattccccaatctagcaaggcaggccaacattcaaattcaggaaata cagagaacgccacaaagatactcctggagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagagcagccagagagaaagggtgggtt acccacaaagggaagcccatcagactaacagcggatctcttggcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatcctttacagacaag caaatgctgagtgattttgtcaccaactgccctaaaagagctcctgaaggaagcactaaa catggaaaggaacaaccagtaccagccactgcaaaaacatgccaaattgtaaagaccatc gagactaggaagaaactgcatcaactaacgagcaaaataaccagctaa >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_11|117_aa MSELPFTIASKRIKYLGIQLARDVKDLFKENYKPLLNEMKEDTNKWKNIPCSWVGRINIM KMAIQPKVIYRFNAIPIKLPMTRIGKNYFKVHMEPKKSPHCQVNPTPKEQSWRHHTT >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_11|354_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt gcaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaatgaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaacatcatg aaaatggccatacagcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccacat tgccaagtcaatcctacgccaaaagaacaaagctggagacatcacactacctga >gi568815593r:93582840_93783717|GENSCAN_predicted_peptide_12|127_aa MDLKMFTGNKGKTGLPIEREELAHCDGNQLPGCKLPCGEAHMARNWYIRPKAREDLRSTN SHTELATECNFSFPVGFCSNKFGLNQKEIVHTFPSTYPFITKVFIDIYCVPGTVLEAGDI AVNKSSS >gi568815593r:93582840_93783717|GENSCAN_predicted_CDS_12|384_bp atggatctgaagatgttcacagggaacaaaggcaagactggactgcctattgaaagggaa gagcttgctcactgtgatggaaaccagctgccaggttgtaagctgccctgtggagaggcc cacatggcaaggaactggtatatcaggccaaaagccagggaggacctgaggtccaccaac agccatacagaacttgctactgagtgtaacttttcattccctgtgggattttgttctaac aaatttggcctaaatcagaaggaaattgtacatacattccccagcacatatccatttatc actaaagtatttattgacatttactgtgttccaggcactgttctggaagctggagacata gcagtgaacaagagctcatcatga