GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:23:51 Sequence gi568815596r:75564669_75810855 : 246187 bp : 39.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 2181 2176 6 1.05 1.04 Term - 3218 3047 172 1 1 76 38 118 0.400 1.92 1.03 Intr - 5331 5204 128 0 2 24 75 84 0.226 -0.74 1.02 Intr - 6482 6276 207 2 0 101 84 140 0.745 13.25 1.01 Init - 15346 15224 123 1 0 94 59 43 0.307 2.22 1.00 Prom - 26427 26388 40 -3.75 2.09 PlyA - 27821 27816 6 1.05 2.08 Term - 33154 32890 265 0 1 67 39 266 0.787 13.60 2.07 Intr - 33956 33695 262 0 1 64 -3 216 0.356 5.72 2.06 Intr - 34410 34268 143 2 2 4 72 64 0.244 -4.52 2.05 Intr - 36081 35741 341 0 2 116 62 205 0.538 14.05 2.04 Intr - 44180 44054 127 1 1 143 89 6 0.004 5.96 2.03 Intr - 44713 44532 182 1 2 22 2 217 0.001 4.24 2.02 Intr - 51011 50824 188 0 2 56 44 98 0.001 0.69 2.01 Init - 56896 56824 73 1 1 100 82 71 0.182 8.98 2.00 Prom - 64558 64519 40 -6.55 3.00 Prom + 68474 68513 40 -5.45 3.01 Init + 82140 82242 103 2 1 91 92 96 0.699 9.70 3.02 Intr + 82434 82551 118 1 1 121 94 358 0.977 38.30 3.03 Intr + 87474 87592 119 0 2 109 57 54 0.998 3.59 3.04 Intr + 87855 87989 135 1 0 70 115 153 0.949 15.82 3.05 Intr + 90068 90249 182 0 2 72 64 226 0.945 17.27 3.06 Term + 90396 90617 222 2 0 83 37 197 0.986 10.03 3.07 PlyA + 90759 90764 6 1.05 4.00 Prom + 115381 115420 40 -5.45 4.01 Init + 117815 118042 228 1 0 88 57 211 0.436 16.42 4.02 Term + 118748 118891 144 1 0 46 39 195 0.458 7.33 4.03 PlyA + 118988 118993 6 1.05 5.14 PlyA - 120099 120094 6 1.05 5.13 Term - 123309 123145 165 0 0 29 37 145 0.667 0.43 5.12 Intr - 124557 124358 200 1 2 80 63 105 0.907 5.35 5.11 Intr - 125413 125301 113 0 2 26 86 98 0.971 2.50 5.10 Intr - 126051 125970 82 1 1 49 77 106 0.948 3.38 5.09 Intr - 127432 127309 124 1 1 55 37 79 0.855 -1.26 5.08 Intr - 129759 129573 187 2 1 66 83 114 0.979 7.47 5.07 Intr - 131647 131532 116 1 2 90 71 69 0.975 3.73 5.06 Intr - 136619 136522 98 0 2 59 74 95 0.975 4.01 5.05 Intr - 137755 137612 144 2 0 87 44 166 0.416 11.43 5.04 Intr - 141983 141855 129 0 0 31 92 121 0.801 6.55 5.03 Intr - 146176 145592 585 2 0 30 -40 494 0.803 22.49 5.02 Intr - 147374 147099 276 0 0 69 68 307 0.974 23.37 5.01 Init - 148167 148065 103 0 1 60 68 120 0.649 5.66 5.00 Prom - 167541 167502 40 -4.75 6.03 PlyA - 167685 167680 6 1.05 6.02 Term - 169256 169063 194 0 2 21 40 166 0.107 1.70 6.01 Init - 174460 174388 73 1 1 77 87 49 0.417 5.00 6.00 Prom - 196877 196838 40 -1.45 7.03 PlyA - 198355 198350 6 1.05 7.02 Term - 200452 200247 206 2 2 92 49 130 0.908 6.05 7.01 Init - 202640 202550 91 2 1 64 63 68 0.366 2.60 7.00 Prom - 204768 204729 40 -5.55 8.00 Prom + 212280 212319 40 -6.25 8.01 Init + 214795 214905 111 0 0 36 91 87 0.655 4.06 8.02 Intr + 223505 223637 133 1 1 61 86 163 0.406 12.70 8.03 Intr + 235408 235519 112 2 1 33 90 121 0.234 5.32 8.04 Term + 243042 243132 91 0 1 86 52 75 0.168 -0.09 8.05 PlyA + 243865 243870 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 44527 44673 147 2 0 55 109 154 0.917 13.49 S.002 Init - 72953 72891 63 2 0 54 66 79 0.868 3.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_1|209_aa MSTARLLPTFTQGPRALQSACGKCCQPGSLPSEQEAPLWLRDLAATQLMTCNFYLPPPPN HAFSFLTGWAKIKEIKIIHEKKWVSAAAANTLRQQPKSAAQVSSSNKLGHVECANLRQRD EQEPIDLTGLGYGRRNRVQSSTVKMCHRCPGLRIKILDSSFTSTVDPKLHQMCSDTGDRL VAGRGHWFKQWKSLRFGHWGHEHIAIQLV >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_1|630_bp atgagtactgccagactactgccaacgttcactcaaggcccaagggctcttcagtcagct tgtggtaaatgctgccagcctgggtctctcccctcagagcaggaggctcctctgtggctc agggatttagctgccactcagctcatgacctgtaacttctatctcccgccccctcccaac cacgctttcagttttctcacaggctgggcaaaaattaaagagattaaaatcatccatgag aaaaagtgggttagtgcagctgctgcaaataccctgcgtcagcagcccaagtcagcagct caagtcagcagctcaaataaacttggccatgtggagtgtgcaaacctgagacagagagat gaacaagaacctattgacctcacaggactcggatatgggagacgtaacagggtgcagagc tctacagtgaaaatgtgtcacagatgcccaggactcaggatcaaaatcttagattcttca tttacatctacagtagatcccaaactccatcagatgtgttcagacacaggagaccgactg gttgctggtcgcggacactggtttaagcaatggaaatcactccgatttgggcactgggga catgaacacattgccatccagttggtttga >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_2|526_aa MKNLGSLGGALDFTWHHVEAEPLESNSPEQGQEEETCGVDSRLDVHICPSCEDHYVSTLF FAAQNICLILTATPGMGAFMPFLMMRPQVVIRKPASPRDSPVLEGPGVCENRDALSAAAR RRHTVAPLDGPTVFHKAAFVRIPAQPPRLAPLMEARLVFPLVYKPKKLALFLPGKGTTDL IFKSFASTPLQALVRPAATSQSQQGYYKMLGPAASLNSALQYSIDKANKASLFVLRMSSE MSWQFCSNTEELHFSASLNYSCMSIQVLSTEYTSSPSYPIPPTLRQSVKPAFKYQRKLQI LSRRLSGSRAHSLNPFIPLASCVDLFLKSPPLVTRFAYELLPMRKGKNDVEGEKPEAHRS EQGLWWIKNDSELAQVHRAALHGGRPVGVSVGLEDLGLIIGRQGWGWLLLRGTAALERWL VVGGQRVGILVGFAEGGRGPYGTSGFHFLLWESGLSEPKCHTARKLQQPQRHMWEGVRVP GQQPLPTVSILELTRKWILQLHSRLPCWKYMEQEQPSAQSSPQNAD >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_2|1581_bp atgaagaatcttgggagtcttggaggggcactggacttcacatggcatcatgtagaagca gaacctttggaatcaaatagcccagagcaggggcaggaagaagaaacctgtggtgtggac tccaggcttgatgttcacatctgcccttcctgtgaggaccactatgtctctacactgttc tttgctgcacaaaacatttgtttaatccttacagctactcctggaatgggcgcatttatg ccctttttaatgatgaggccacaagtagtgatcagaaaaccagcatctcctcgagactca cctgtcttagaaggtccaggtgtgtgtgagaaccgggatgcgctgtctgccgctgcccgg cgccgccacacggtggcgccgttggacggtcccacagtcttccacaaagcggcttttgtc cgtatccccgcacagccgccgaggttagcacccctcatggaagccaggctggttttccct ctggtctacaagccaaagaagctggcgctattcttaccaggaaaggggactacagattta atttttaaatcttttgcaagcacccctcttcaagcactggtgagacctgcagccacctct cagagtcagcagggctattacaagatgttggggccagccgcctccttaaactcagcacta cagtacagcattgataaagcaaacaaggcaagcttgtttgtacttcgtatgtcttcagaa atgagctggcagttttgtagtaacactgaggaattacactttagtgcatcactgaactat tcctgcatgtccatacaagtgttgtccacagagtacacatcttccccgtcctatcccata cccccaaccctaagacagtctgtcaagccagctttcaagtatcagagaaagctgcagatt ctatctagaagattgtctggatctagagcccactcattgaaccctttcattccacttgcg agctgtgttgatctctttttgaaatctcctccactggttacacgttttgcttatgagtta ttgcccatgagaaagggcaagaatgatgtggaaggtgagaagcctgaggcccacaggtca gagcaaggattatggtggattaaaaatgacagcgaattggcccaggtacacagggctgcc ctacatggtgggcgtcctgtaggcgtctcagttggcttggaggacctggggcttataatt gggcgccagggttgggggtggctgcttctgcggggcaccgcagccttggagaggtggctg gtagttggagggcagcgagtaggcattctggtgggctttgcggaaggagggcgggggcct tacgggaccagcggcttccacttcctgctctgggaatctggactctcggaacccaaatgc catactgcgaggaagcttcagcagccccagagacacatgtgggaaggagttagggtccct ggtcagcagcctctgccgacagtcagcattttggagctaactaggaagtggatcctccag ctccattcaagacttccttgttggaaatacatggaacaagaacagccgtctgctcagagt tctccccaaaatgcagattaa >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_3|292_aa MAACIAAGHWAAMGLGRSFQAARTLLPPPASIACRVHAGPVRQQSTGPSEPGAFQPPPKP VIVDKHRPVEPERRFLSPEFIPRRGRTDPLKFQIERKDMLERRKVLHIPEFYVGSILRVT TADPYASGKISQFLGICIQRSGRGLGATFILRNVIEGQGVEICFELYNPRVQEIQVVKLE KRLDDSLLYLRDALPEYSTFDVNMKPVVQEPNQKVPVNELKVKMKPKPWSKRWERPNFNI KGIRFDLCLTEQQMKEAQKWNQPWLEFDMMREYDTSKIEAAIWKEIEASKRS >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_3|879_bp atggcggcctgcattgcagcggggcactgggctgcaatgggcctaggccggagtttccaa gccgccaggactctgctccccccgccggcctctatcgcctgcagggtccacgcggggcct gtccggcagcagagcactgggccttccgagcccggtgcgttccaaccgccgccgaaaccg gtcatcgtggacaagcaccgccccgtggaaccggaacgcaggttcttgagtcctgaattc attcctcgaaggggaagaacagatcctctgaaatttcaaatagaaagaaaagatatgtta gaaaggagaaaagtactccacattccagagttctatgttggaagtattcttcgtgttact acagctgacccatatgccagtggaaaaatcagccagtttctggggatttgcattcagaga tcaggaagaggacttggagctactttcatccttaggaatgttatcgaaggacaaggtgtc gagatttgctttgaactttataatcctcgggtccaggagattcaggtggtcaaattagag aaacggctggatgatagcttgctatacttacgagatgcccttcctgaatatagcactttt gatgtgaatatgaagccagtagtacaagagcctaaccaaaaagttcctgttaatgagctg aaagtaaaaatgaagcctaagccctggtctaaacgctgggaacgtccaaattttaatatt aaaggaatcagatttgatctttgtttaactgaacagcaaatgaaagaagctcagaagtgg aatcagccatggcttgaatttgatatgatgagggaatatgatacttcaaaaattgaagct gcaatatggaaggaaattgaagcgtcgaaaaggtcttga >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_4|123_aa MGRNQCKKAENSKNQNARSSKNHNSSPAREQNRMENEFDEVTEVGFRRWVITNSSELKEH VLTQWKEAKNLEKRLEACLTRAPEGSAKHGKVQLVPDAAKTYRIVKTINAMKKLRQLMSK MTS >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_4|372_bp atggggagaaaccagtgcaaaaaggctgaaaattccaaaaaccagaatgcccgttcttca aagaatcacaactcatcaccagcaagggaacaaaaccggatggagaatgagtttgacgaa gtgacagaagtaggcttcagaaggtgggtaataacaaactcctctgagctaaaggagcat gttctaacccaatggaaggaagctaagaaccttgaaaaaaggttagaggcctgccttaca agagctcctgaaggaagcgctaaacatggaaaggtacaactggtaccagacgctgcaaaa acataccgaattgtaaagaccatcaacgctatgaagaaactgcgtcaactaatgagcaaa atgaccagctag >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_5|773_aa MKLRTLAVSVTALKVARLGSAPSDVQMCLEFLPSDSGAQLASPSGSRTGAAGGAACQSGA VRSYSSALGWSMGLGAVEQGVVLVGDARAAQEPMEWMGGSGMAGCRSRALHRGKAAKARR EIEHSAGRKGLFGSARLIPATAMAPRSRLLSLGRRGNFRSRVLRRKSRPLEEAARRWRDC PTGFGALVAGAGSGRAPGVPPKRLPARTKAQVSRETRGSGEVTAQGGKGPRGLWILLWGF YNLLPTTSHAESYPERLIQKVVSEKIARALARYLHLPFGLLGGNKTIVYSWEALRFHHQL TPSVITNTVIKVYEEPKLSQQKSRTLDVSTDEEDKIHHSSESKDDQGLSSDSSSSLGEKE LSSTVKIPDAAFIQAARRKRELARAQDDYISLDVQHTSSISGMKRESEDDPEISRNEETS EESQEDEKQDTWEQQQMRKAVKIIEERDIDLSCGNGSSKVKKFDTSISFPPVNLEIIKKQ LNTRLTLLQETHRSHLREYEKYVQDVKSSKSTIQNLESSSNQALNCKFYKSMKIYVENLI DCLNEKIINIQEIESSMHALLLKQAMTFMKRRQDELKHESTYLQQLSRKDETSTSGNFSV DEKTQWILEEIESRRTKRRQARVLSGNCNHQEGTSSDDELPSAEMIDFQKSQGDILQKQK KVFEEVQDDFCNIQNILLKFQQWREKFPDSYYEAFISLCIPKLLNPLIRVQLIDWNPLKL ESTGLKEMPWFKSVEEFMDSSVEDSKKESSSDKKVLSAIINKTIIPRLTGTAS >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_5|2322_bp atgaagctgcggaccctcgcggtgagcgttacagctcttaaggtggcgcgtctggggtct gccccttctgatgttcagatgtgtttggagtttcttccttctgactcaggagcccaactg gcttcacctagtggatcccgcaccggggctgcaggtggagctgcctgccagtccggcgcc gtgcgctcgtactcctcagcccttgggtggtcgatgggactgggcgccgtggagcagggg gtggtgctcgtcggggatgctcgggcagcacaggagcccatggagtggatgggaggctca ggcatggcgggctgcaggtcccgagccctgcaccgcgggaaggcagctaaggcccggcga gaaatcgagcacagcgccggccgaaaaggacttttcggcagcgcgcggctgattccagcg acagcgatggcgccgaggagtcgcctgctgagcctggggcgccgagggaacttccggtcc cgggttctgcggaggaagagccgccctctggaggaggccgcgcgcaggtggcgggactgc cccaccgggttcggggccctcgtggccggggccgggtctgggcgagctcccggcgtgcca ccaaagcggctccccgcgcggacgaaggctcaggtgtccagggagacgcggggaagcggg gaggtgacggcacagggcgggaaggggccgcggggtctctggatccttctctgggggttc tataacttgctgccgaccacaagccatgctgagtcttatccagaaagacttatccagaag gtagtgagcgagaaaatagctcgagcacttgcccgctatctgcatctgccttttggactc cttgggggaaataaaacaatcgtgtacagttgggaggctcttcgttttcaccatcagttg actccatcagttataactaacacagttatcaaagtttatgaagaacctaaactgtcacaa caaaaatccagaacccttgatgtgtccacagatgaagaggataaaatacatcactcctca gaaagtaaggatgatcagggtttgtcttctgacagttctagctctcttggagaaaaagaa ctttcatcaacagttaagatcccagatgcagcttttattcaggcagcccgcagaaaacgt gaattggccagggcccaagatgactatatttctttggatgtacaacatacctcctccatc tctggtatgaagagagagagcgaagatgaccctgagataagcagaaatgaagaaacaagt gaagaaagtcaggaagatgaaaagcaagatacttgggaacaacagcaaatgaggaaagca gttaaaatcatagaggaaagagacatagatctttcctgtggcaatggatcttcaaaagtg aagaaatttgatacttccatttcatttccgccagtaaatttagaaattataaagaagcaa ttaaatactagattaacattactacaggaaactcaccgctcacacctgagggagtatgaa aaatacgtacaagatgtcaaaagctcaaagagtaccatccagaacctagagagttcatca aatcaagctctaaattgtaaattctataaaagcatgaaaatttatgtggaaaatttaatt gactgccttaatgaaaagattatcaacatccaagaaatagaatcatccatgcatgcactc cttttaaaacaagctatgacctttatgaaacgcaggcaagatgaattaaaacatgaatca acgtatttacaacagttatcacgcaaagatgagacatccacaagtggaaacttctcagta gatgaaaaaactcagtggattttagaagagattgaatctcgaaggacaaaaagaagacaa gcaagggtgctttctgggaattgtaaccatcaggaaggaacatctagtgatgatgaactg ccttcagcagagatgattgacttccaaaaaagccaaggtgacattttacagaaacagaag aaagtttttgaagaagtgcaagatgatttttgtaacatccagaatattttgttgaaattt cagcaatggcgagaaaagtttcctgactcctattatgaagctttcattagtttatgcata ccaaagcttttaaatcccctaatacgagttcagttgattgattggaatcctcttaagttg gaatccacaggtttaaaagagatgccatggttcaaatctgtagaagaatttatggatagc agtgtggaagattcaaagaaggaaagtagttcagataaaaaagtcttgtctgcaatcatc aacaaaacaattattccccgacttacaggtactgcttcgtaa >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_6|88_aa MQALAVPPTREHVCWRWLFYRSDAGSNVQLGPWLQRVEAPSLGIFHMPGCPGKSLLQGQG LMENLCYGNAEGKCGVGAPTQSPYWGTA >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_6|267_bp atgcaggctctggctgttccaccaacccgggagcatgtctgctggaggtggctgttttac aggtcagatgcagggtccaatgtacagcttgggccatggctccagagggtggaagcccca agccttggcatcttccacatgcctggatgcccaggcaaaagtttgctgcaggggcagggc ctcatggagaacctctgctacggcaatgcagaagggaaatgtggggtgggagcccccaca cagagtccctactggggcactgcctag >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_7|98_aa MGKSQSIRGGEEQQEQSAQGPADCFLPREQGAKDEVIQESDADITSTKCDKGCKTESSAI GTQRRESLTLPNTERESFRKLNGKWNSKKNIVYNKLEV >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_7|297_bp atgggtaagtcacagagtataagaggaggagaagaacagcaggagcagtcagcacaggga cctgccgactgctttcttcccagagaacaaggggctaaggatgaagtgatacaagagtca gatgcagacataacttcaacaaaatgtgataaaggctgtaagacagaatcaagtgctata gggacacaaaggagagaaagcctcactctgcctaacaccgagagggagtctttcaggaag ttaaacggtaaatggaattccaagaaaaacattgtatacaataaactggaagtgtga >gi568815596r:75564669_75810855|GENSCAN_predicted_peptide_8|148_aa MTEAQVKETWVSESLRGRLPTEQLIGLYVRENPKSVVGRKIQYKDIVMHKTEQQNVRTDF RNMAAGGQSHWMWQSDMDEVAAANIQCGEQHPPLQLLGTEIEEALQAQRLTSADPLGSRA RKAINAYASNYLIRWVAVYLEEVTLEQK >gi568815596r:75564669_75810855|GENSCAN_predicted_CDS_8|447_bp atgacagaggcacaggtgaaagaaacctgggtctctgaatcactacgtggaaggctgcct acggaacaacttattggactgtacgtgagggagaacccaaagtctgttgtggggagaaaa atccagtataaggacattgtgatgcacaaaactgagcagcagaatgtaagaactgatttc aggaacatggcagcaggtggccagtctcactggatgtggcaatcggatatggatgaagtg gcagcagctaacattcagtgtggagagcagcaccctcccctccagcttttgggaacagaa atagaggaagcgttgcaggcacaacgtctcacatcagcagatcctctgggttcaagggcg agaaaggctattaacgcatatgcaagtaattacctcatacgttgggtagcagtctacctt gaggaggtgacacttgagcagaaatga