GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:28:13 Sequence gi568815595r:48194640_48398524 : 203885 bp : 45.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2520 2568 49 2 1 123 79 46 0.345 5.45 1.02 Term + 16077 16402 326 1 2 43 55 193 0.034 6.43 1.03 PlyA + 18435 18440 6 1.05 2.04 PlyA - 18970 18965 6 -0.45 2.03 Term - 19620 19596 25 2 1 124 48 3 0.643 -2.40 2.02 Intr - 20822 20668 155 2 2 114 16 118 0.529 6.07 2.01 Init - 21224 21021 204 2 0 72 96 102 0.862 8.25 2.00 Prom - 22392 22353 40 -7.66 3.00 Prom + 28821 28860 40 -6.16 3.01 Init + 28873 29073 201 0 0 100 89 402 0.809 38.38 3.02 Intr + 29715 29822 108 2 0 71 60 125 0.997 8.48 3.03 Intr + 29964 30035 72 2 0 77 113 71 0.981 8.10 3.04 Term + 30654 30785 132 2 0 64 36 108 0.686 1.29 3.05 PlyA + 30825 30830 6 1.05 4.00 Prom + 37247 37286 40 -5.96 4.01 Init + 39835 39966 132 0 0 49 110 182 0.527 16.71 4.02 Intr + 61982 62104 123 1 0 99 8 84 0.016 2.28 4.03 Intr + 66174 66300 127 2 1 34 82 34 0.015 -2.35 4.04 Intr + 73276 74132 857 2 2 99 -6 260 0.077 8.78 4.05 Term + 74274 75143 870 2 0 53 45 406 0.118 25.44 4.06 PlyA + 78796 78801 6 1.05 5.11 PlyA - 80246 80241 6 1.05 5.10 Term - 94210 94152 59 2 2 123 39 23 0.120 -1.25 5.09 Intr - 100164 100042 123 1 0 112 22 123 0.266 8.66 5.08 Intr - 100596 100436 161 2 2 88 100 201 0.995 20.93 5.07 Intr - 101519 101480 40 0 1 99 80 6 0.883 -1.82 5.06 Intr - 102190 102088 103 1 1 39 97 98 0.664 5.55 5.05 Intr - 124687 124641 47 2 2 53 100 33 0.016 -0.87 5.04 Intr - 133752 133636 117 1 0 39 97 88 0.340 5.24 5.03 Intr - 134589 134514 76 0 1 55 103 43 0.004 1.59 5.02 Intr - 137622 137478 145 2 1 8 15 160 0.033 0.98 5.01 Init - 155042 154987 56 2 2 83 113 33 0.543 6.06 5.00 Prom - 165433 165394 40 -3.16 6.00 Prom + 171166 171205 40 -2.36 6.01 Init + 178129 178218 90 0 0 55 98 114 0.920 9.62 6.02 Intr + 183744 183887 144 2 0 60 115 58 0.753 6.18 6.03 Intr + 184713 184919 207 2 0 97 96 5 0.506 1.47 6.04 Intr + 186063 186273 211 2 1 118 102 31 0.904 5.99 6.05 Intr + 187061 187239 179 0 2 16 68 198 0.956 10.24 6.06 Intr + 201147 201173 27 2 0 121 91 27 0.007 4.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 62299 62057 243 1 0 89 41 175 0.834 6.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:48194640_48398524|GENSCAN_predicted_peptide_1|124_aa GDQDPTHVSSGLAVEDNMKRSRCRDRPQPPPPDRQEDGVQRAAELSQSLPPRRRAPPGRQ RLEERTGPAGPEGKEQPPALASQSAEIAASARLPPRLGSEECLCLAAHRLGCEEPLCLAA QSGK >gi568815595r:48194640_48398524|GENSCAN_predicted_CDS_1|375_bp ggggatcaagaccccactcatgtgagctcaggcttggccgtggaagacaatatgaagcgg agccgctgccgcgaccgaccgcagccgccgccgcccgaccgccaggaggatggagttcag cgggcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgcccgggaggcag cggctggaggagcggacgggccccgcggggcccgagggcaaggagcagccgcctgccttg gcctcccaaagtgccgagattgcagcctctgcccggctgccaccccgtctgggaagtgag gagtgtctctgcctggccgcccatcgtctgggatgtgaggagcccctctgcctggctgcc cagtctggaaagtga >gi568815595r:48194640_48398524|GENSCAN_predicted_peptide_2|127_aa MRATDKSLEQGPLSRKRQNQDLDSMPPPTNLAVLLMGTCGRIQALPPALCSKLEMQMWAA MRVEIWDRQVKNCRGYFSAAEDPKKLPICESQNFAGVSKSLEKEWRAEEWVQKPELQAGR SREPSEA >gi568815595r:48194640_48398524|GENSCAN_predicted_CDS_2|384_bp atgagggccacagataaatcacttgaacaaggacccctatccagaaaacggcagaaccag gacttggactcaatgcccccacccaccaaccttgctgtccttctcatggggacatgtgga agaattcaagccctgcctcctgccctctgcagcaaactggaaatgcagatgtgggctgcc atgagggtggaaatatgggacaggcaagtgaagaactgccgggggtacttcagtgctgct gaggaccccaagaagttgcccatctgtgaatcccaaaactttgctggggtgagcaaatcc ctggagaaggagtggagggcagaggagtgggtccagaagccagaacttcaggctgggaga tcacgcgagccctcagaggcttaa >gi568815595r:48194640_48398524|GENSCAN_predicted_peptide_3|170_aa MKTQRDGHSLGRWSLVLLLLGLVMPLAIIAQVLSYKEAVLRAIDGINQRSSDANLYRLLD LDPRPTMDGDPDTPKPVSFTVKETVCPRTTQQSPEDCDFKKDGLVKRCMGTVTLNQARGS FDISCDKDNKRFALLGDFFRKSKEKIGKEFKRIVQRIKDFLRNLVPRTES >gi568815595r:48194640_48398524|GENSCAN_predicted_CDS_3|513_bp atgaagacccaaagggatggccactccctggggcggtggtcactggtgctcctgctgctg ggcctggtgatgcctctggccatcattgcccaggtcctcagctacaaggaagctgtgctt cgtgctatagatggcatcaaccagcggtcctcggatgctaacctctaccgcctcctggac ctggaccccaggcccacgatggatggggacccagacacgccaaagcctgtgagcttcaca gtgaaggagacagtgtgccccaggacgacacagcagtcaccagaggattgtgacttcaag aaggacgggctggtgaagcggtgtatggggacagtgaccctcaaccaggccaggggctcc tttgacatcagttgtgataaggataacaagagatttgccctgctgggtgatttcttccgg aaatctaaagagaagattggcaaagagtttaaaagaattgtccagagaatcaaggatttt ttgcggaatcttgtacccaggacagagtcctag >gi568815595r:48194640_48398524|GENSCAN_predicted_peptide_4|702_aa MNTVAADYTGLTAPFMTLVAAHEKRPFKDEHRWNRQVKQTHAGQACDLSGQHASMGLDEL AKQQNIIIVLLVVQVPVADGALARGGPVTFEDVAVLFTEAEWKRLSLEQRNLYKEVMLEN LRNLVSLAESKPEVHTCPSCPLAFGSQQFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQP QSQHPSDKNHRGAEAEDQRVEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEI QLSPAQNASSEEVDRISKRAETPGFGAVTFGECALAFNQKSNLFRQKAVTAEKSSDKRQS QVCRECGRGFSRKSQLIIHQRTHTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCS HCGRGFSCKPYLIRHQRTHTREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYWTHSEVK PHVCEECGHGFSQKSSLKSHRRTHSGEKPYVCGECGRGFSRRIVLNGHWRTHTGEKPYTC FECGRNFSLKSALSVHQRIHSGEKPYACTECGQGFITKSQLIRHQRTHTGEKPYVCGECG RGFIAQSTLHYHRSTHSKEKPYVCSQCGRGFCDKSTLLAHEQTHSGEKPYVCGECGRGFG RKILLNRHWRTHTGEKPYACIECGRNFSHKSTLSLHQRIHSGEKPYACVECGQSFRRKSQ LIIHQKIHSGKSFRGARSEDVILATSQPSATPAEMLREKPCL >gi568815595r:48194640_48398524|GENSCAN_predicted_CDS_4|2109_bp atgaacactgtggcagcagactacactggactcacagccccgttcatgactcttgtggct gcccatgaaaaacgaccattcaaagatgaacatcgatggaatcgacaagttaaacagacc catgctggccaggcctgtgatctttcggggcagcatgcctccatggggctggatgaactg gctaagcagcaaaacatcatcatagttttacttgtggttcaggttccagtggcagatggg gcactggccagagggggaccagtgactttcgaggatgtggctgtgcttttcactgaggca gagtggaagagactgagccttgagcagaggaacctatacaaagaagtgatgctggaaaat ctcaggaatctggtctcattggcagaatcaaagccagaagtccatacctgcccttcttgc cctctggcctttggcagtcagcagttcctcagccaagatgagctacacaatcatcctatt ccaggtttccatgcaggaaatcaactccacccaggaaatccctgcccagaggatcagcca cagtcacaacatccttctgataaaaatcacaggggggctgaagcagaagatcaacgagtg gaaggaggcgtcagacccttgttttggagtacaaatgaaaggggggctttagtgggtttc tctagcctgttccagagaccaccaataagctcttggggaggcaacagaatattagagata cagctcagtccagcccagaatgcaagctctgaggaagtagacagaatttccaagagggca gaaaccccagggtttggagcagtcacgtttggggagtgtgcactagcttttaaccagaag tcaaacctgttcagacagaaggcagtcacagcagaaaaatcttcagacaaaaggcagtca caggtgtgcagggagtgtgggcgaggctttagcaggaagtcacagctcatcatacaccag aggacacacacaggagaaaagccttatgtctgcggagagtgtgggcgaggctttatagtt gagtcagtcctccgcaaccacctgagtacacactccggggagaaaccttatgtgtgcagc cattgtgggcgaggctttagctgcaagccatacctcatcagacatcagaggacacacaca agggagaaatcgtttatgtgcacagtgtgtgggcgaggctttcgtgaaaagtcagagctc attaagcaccagagaattcacacgggggataagccttattggacacattcagaggtgaaa cctcacgtgtgtgaggagtgtgggcatggatttagccagaagtcgtcgctcaaatcacat cggagaacacactcaggggagaagccttatgtgtgtggggaatgtgggcggggatttagc cggaggatagtcctcaatggacactggaggacacacacgggagagaagccttacacgtgc tttgagtgtgggcgaaactttagcctcaagtccgctcttagtgtacatcagaggatacac tctggggagaagccttatgcatgcacggagtgtgggcaaggctttatcacgaaatcacag ctcatcagacaccagaggacacacacaggagaaaagccttatgtctgcggagagtgtggg cgaggctttatagctcagtcaaccctccactaccaccggagtacacactccaaggaaaaa ccttatgtgtgcagccagtgtgggcgaggcttttgtgataaatcaactctcctcgcacac gagcagacacattcaggggagaagccttatgtgtgtggggaatgtgggcggggatttggc cggaagatactcctcaacagacactggaggacacacacaggagagaaaccttacgcatgc atcgagtgtgggcgaaactttagccacaagtccactctcagcttacatcagaggatacac tcgggggagaagccttatgcatgcgtggagtgtgggcaaagctttaggagaaagtcacag ctcatcatacaccagaagatacactcggggaaaagctttagaggtgcaaggagtgaggat gtgattttagcaacaagtcagccatcagccacaccagcggaaatgcttagggagaagcct tgtttgtaa >gi568815595r:48194640_48398524|GENSCAN_predicted_peptide_5|308_aa MHSLITPVIQVDHRHPPATDQKEQAELDKRDKKKATALVMAFRQVDFGGSGKVKGWANRM PNRDLIPFIMNKLSTKTILFSQFLRATMGVTADVCPTSKVAYFHSSSVTMKGICSDAILV LATSMWMAFAIGTCDPIQTMQKQKSTKAVHQQILSNKFLIVRMRELLWRKEDCQRFYREH EGRFFYQRLVEFMASGPIRAYILAHKDAIQLWRTLMGPTRVFRARHVAPDSIRGSFGLTD TRNTTHGSDSVVSASREIAAFFPDFSEQRWYEEEEPQLRCGPVCYSPEGGPNLNVLSIGR PSYILQSN >gi568815595r:48194640_48398524|GENSCAN_predicted_CDS_5|927_bp atgcacagcctcattactcctgtcatacaagtggaccatcgtcatcctccagccacagat cagaaggagcaggcagaactggacaaacgggataaaaagaaggccaccgctttagtcatg gccttcaggcaagtggactttggaggctctggaaaagtgaaaggctgggcaaatcgaatg cctaatagggatttgattccattcattatgaacaaactatcaacaaaaaccatcctgttc agccagtttctcagagcaaccatgggagtcacagcagatgtttgtcctacatcaaaagtt gcatatttccacagttcttctgtcaccatgaaggggatctgctcagacgccatccttgtt ctagctacctccatgtggatggcctttgcaattggtacatgtgatcctattcagacaatg cagaaacagaaatccactaaggctgttcatcagcagattctaagcaacaagttcctgatt gtacgaatgagagaactactgtggagaaaggaagattgccagaggttttaccgagagcat gaagggcgttttttctatcagaggctggtggagttcatggccagcgggccaatccgagcc tacatccttgcccacaaggatgccatccagctctggaggacgctcatgggacccaccaga gtgttccgagcacgccatgtggccccagattctatccgtgggagtttcggcctcactgac acccgcaacaccacccatggttcggactctgtggtttcagccagcagagagattgcagcc ttcttccctgacttcagtgaacagcgctggtatgaggaggaagagccccagttgcgctgt ggccctgtgtgctatagcccagagggaggtcccaacttaaatgtactttccataggaagg ccttcctacatcctgcagtccaactag >gi568815595r:48194640_48398524|GENSCAN_predicted_peptide_6|286_aa MEIRLPDLALKRIFSFLDLFGLLQVSQVNKMHLAITMDRKKTIKVWNCQDRDALAVLPMP QPCYCMEAYLTKDGPFLMDFTYLPIDMVGMLWFQVGDAAGDIYTFTLPGLRDVSKVTAFQ YGIVLLHCSPDKKWVFACGTYSRTLPQVFLTESLLRPSEGSVPLSTFLPHKLCASACWTP KVKNRITLMSQSSTGKKTEFITFDLTTKKTGGQTVIQAYEIASFQVAAHLKCPIWMGASD GYMIVFTSGPYLLLFSITGFLLQRFEDHQAAINNFWVTSSDGYCPL >gi568815595r:48194640_48398524|GENSCAN_predicted_CDS_6|858_bp atggagatccgattgcctgacttagctttgaagcgaatcttctctttcctggacctgttc ggcttgctgcaggtttcccaggtgaacaagatgcatctcgccatcactatggatcggaaa aaaactatcaaagtgtggaactgtcaggacagggacgctctggctgttctccccatgcca cagccctgttattgcatggaagcctatcttacaaaggatggcccattcctgatggatttc acatatcttcctattgatatggtggggatgctttggtttcaggttggcgatgctgcaggt gacatctacacatttacactgcctgggttaagagatgtttctaaagttactgcatttcaa tatggtattgtacttctacactgctctcctgacaagaaatgggtatttgcatgtgggaca tacagtcgtaccttgccacaggtattcctcacagagtccttactgagaccatcagaaggc agtgttcctctgtctacctttctcccacataaattatgtgccagcgcctgctggacccca aaggtgaaaaacaggataacactgatgtcccaaagtagcactggaaaaaagacagaattt atcacctttgatctaacaaccaagaagactggaggccaaacagtcatccaagcatatgag atcgcaagtttccaggtggcagctcatctgaagtgccctatctggatgggagccagtgat ggatatatgattgtctttaccagtggaccatacttgttactcttcagcatcactggcttc ctgctgcaacgatttgaggaccatcaggcagccatcaacaacttctgggtgacctcatct gacggctattgccctttg