GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:57:19 Sequence gi568815595f:48141172_48368783 : 227612 bp : 45.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 PlyA - 1776 1771 6 1.05 1.13 Term - 17914 17774 141 1 0 111 55 226 0.999 19.53 1.12 Intr - 18284 18173 112 1 1 83 110 53 0.999 7.38 1.11 Intr - 23266 23136 131 1 2 74 84 127 0.998 10.39 1.10 Intr - 24563 24465 99 2 0 90 86 187 0.999 19.01 1.09 Intr - 24722 24660 63 2 0 89 87 19 0.637 0.81 1.08 Intr - 26773 26675 99 1 0 85 86 78 0.951 7.61 1.07 Intr - 33286 33113 174 1 0 55 88 80 0.917 4.84 1.06 Intr - 36271 36200 72 1 0 90 70 22 0.509 0.20 1.05 Intr - 36817 36683 135 1 0 89 81 139 0.910 14.06 1.04 Intr - 39723 39550 174 0 0 65 80 78 0.636 4.84 1.03 Intr - 41859 41758 102 0 0 69 82 81 0.806 5.97 1.02 Intr - 42665 42629 37 1 1 78 95 -24 0.480 -4.44 1.01 Init - 46776 46607 170 0 2 105 110 207 0.999 22.91 1.00 Prom - 50540 50501 40 -3.66 2.05 PlyA - 50986 50981 6 1.05 2.04 Term - 64220 64112 109 1 1 94 39 88 0.254 2.48 2.03 Intr - 65188 65124 65 1 2 131 27 44 0.127 0.12 2.02 Intr - 74290 74207 84 1 0 114 72 81 0.298 9.12 2.01 Init - 74692 74489 204 1 0 72 96 102 0.862 8.25 2.00 Prom - 75860 75821 40 -7.66 3.00 Prom + 82289 82328 40 -6.16 3.01 Init + 82341 82541 201 2 0 100 89 402 0.809 38.38 3.02 Intr + 83183 83290 108 1 0 71 60 125 0.997 8.48 3.03 Intr + 83432 83503 72 1 0 77 113 71 0.981 8.10 3.04 Term + 84122 84253 132 1 0 64 36 108 0.686 1.29 3.05 PlyA + 84293 84298 6 1.05 4.00 Prom + 90715 90754 40 -5.96 4.01 Init + 93303 93434 132 2 0 49 110 182 0.527 16.71 4.02 Intr + 115450 115572 123 0 0 99 8 84 0.016 2.28 4.03 Intr + 119642 119768 127 1 1 34 82 34 0.015 -2.35 4.04 Intr + 126744 127600 857 1 2 99 -6 260 0.077 8.78 4.05 Term + 127742 128611 870 1 0 53 45 406 0.118 25.44 4.06 PlyA + 132264 132269 6 1.05 5.11 PlyA - 133714 133709 6 1.05 5.10 Term - 147678 147620 59 1 2 123 39 23 0.120 -1.25 5.09 Intr - 153632 153510 123 0 0 112 22 123 0.266 8.66 5.08 Intr - 154064 153904 161 1 2 88 100 201 0.995 20.93 5.07 Intr - 154987 154948 40 2 1 99 80 6 0.883 -1.82 5.06 Intr - 155658 155556 103 0 1 39 97 98 0.664 5.55 5.05 Intr - 178155 178109 47 1 2 53 100 33 0.016 -0.87 5.04 Intr - 187220 187104 117 0 0 39 97 88 0.340 5.24 5.03 Intr - 188057 187982 76 2 1 55 103 43 0.004 1.59 5.02 Intr - 191090 190946 145 1 1 8 15 160 0.033 0.98 5.01 Init - 208510 208455 56 1 2 83 113 33 0.541 6.06 5.00 Prom - 218901 218862 40 -3.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 69547 69870 324 0 0 72 55 185 0.802 9.61 S.002 Sngl - 115767 115525 243 0 0 89 41 175 0.834 6.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:48141172_48368783|GENSCAN_predicted_peptide_1|502_aa MELGPEPPHRRRLLFACSPPPASQPVVKALFGASAAGGLSPVTNLTVTMDQLQGLGSLEN PMRRIHSLPQKLLGCSPALKRSHSDSLDHDIFQLIDPDENKENLRPELVFMDSRCLNYVL QEAFEFKKPVRPVSRGCLHSHGLQEGKDLFTQRQNSAPARMLSSNERDSSEPGNFIPLFT PQSPVTATLSDEDDGFVDLLDGENLKNEEETPSCMASLWTAPLVMRTTNLDNRCKLFDSP SLCSSSTRSVLKRPERSQEESPPGSTKRRKSMSGASPKESTNPEKAHETLHQSLSLASSP KGTIENILDNDPRDLIGDFSKGYLFHTVAGKHQDLKYISPEIMASVLNGKFANLIKEFVI IDCRYPYEYEGGHIKGAVNLHMEEEVEDFLLKKPIVPTDGKRVIVVFHCEFSSERGPRMC RYVRERDRLGNEYPKLHYPELYVLKGGYKEFFMKCQSYCEPPSYRPMHHEDFKEDLKKFR TKSRTWAGEKSKREMYSRLKKL >gi568815595f:48141172_48368783|GENSCAN_predicted_CDS_1|1509_bp atggaactgggcccggagcccccgcaccgccgccgcctgctcttcgcctgcagcccccct cccgcgtcgcagcccgtcgtgaaggcgctatttggcgcttcagccgccgggggactgtcg cctgtcaccaacctgaccgtcactatggaccagctgcagggtctgggcagccttgaaaat cctatgagaagaatacattccctacctcagaagctgttgggatgtagtccagctctgaag aggagccattctgattctcttgaccatgacatctttcagctcatcgacccagatgagaac aaggaaaatctgaggcctgagttggttttcatggacagtagatgtctcaactatgtcttg caggaagcctttgagtttaagaagccagtaagacctgtatctcgtggctgcctgcactct catggactccaggagggtaaagatctcttcacacagaggcagaactctgccccagctcgg atgctttcctcaaatgaaagagatagcagtgaaccagggaatttcattcctctttttaca ccccagtcacctgtgacagccactttgtctgatgaggatgatggcttcgtggaccttctc gatggagagaatctgaagaatgaggaggagaccccctcgtgcatggcaagcctctggaca gctcctctcgtcatgagaactacaaaccttgacaaccgatgcaagctgtttgactcccct tccctgtgtagctccagcactcggtcagtgttgaagagaccagaacgatctcaagaggag tctccacctggaagtacaaagaggaggaagagcatgtctggggccagccccaaagagtca actaatccagagaaggcccatgagactcttcatcagtctttatccctggcatcttccccc aaaggaaccattgagaacattttggacaatgacccaagggaccttataggagacttctcc aagggttatctctttcatacagttgctgggaaacatcaggatttaaaatacatctctcca gaaattatggcatctgttttgaatggcaagtttgccaacctcattaaagagtttgttatc atcgactgtcgatacccatatgaatacgagggaggccacatcaagggtgcagtgaacttg cacatggaagaagaggttgaagacttcttattgaagaagcccattgtacctactgatggc aagcgtgtcattgttgtgtttcactgcgagttttcttctgagagaggtccccgcatgtgc cggtatgtgagagagagagatcgcctgggtaatgaataccccaaactccactaccctgag ctgtatgtcctgaaggggggatacaaggagttctttatgaaatgccagtcttactgtgag ccccctagctaccggcccatgcaccacgaggactttaaagaagacctgaagaagttccgc accaagagccggacctgggcaggggagaagagcaagagggagatgtacagtcgtctgaag aagctctga >gi568815595f:48141172_48368783|GENSCAN_predicted_peptide_2|153_aa MRATDKSLEQGPLSRKRQNQDLDSMPPPTNLAVLLMGTCGRIQALPPALCSKLEMQMWAA MRVEIWDRQVKNCRGYFSAAEDPKKLPICESQNFAGAVPLLLWLLKTRNKDSKETLGKIR STPSYPNLAGNWELGELACALLSSPVPLEPMQA >gi568815595f:48141172_48368783|GENSCAN_predicted_CDS_2|462_bp atgagggccacagataaatcacttgaacaaggacccctatccagaaaacggcagaaccag gacttggactcaatgcccccacccaccaaccttgctgtccttctcatggggacatgtgga agaattcaagccctgcctcctgccctctgcagcaaactggaaatgcagatgtgggctgcc atgagggtggaaatatgggacaggcaagtgaagaactgccgggggtacttcagtgctgct gaggaccccaagaagttgcccatctgtgaatcccaaaactttgctggggctgtacccctg ctcctctggctgcttaagaccaggaacaaagactccaaggagactctgggcaaaattagg agcacccccagctacccgaacctagcgggcaactgggaacttggagagctggcctgtgcc ctcctctccagcccagtgcccttggaacccatgcaggcatag >gi568815595f:48141172_48368783|GENSCAN_predicted_peptide_3|170_aa MKTQRDGHSLGRWSLVLLLLGLVMPLAIIAQVLSYKEAVLRAIDGINQRSSDANLYRLLD LDPRPTMDGDPDTPKPVSFTVKETVCPRTTQQSPEDCDFKKDGLVKRCMGTVTLNQARGS FDISCDKDNKRFALLGDFFRKSKEKIGKEFKRIVQRIKDFLRNLVPRTES >gi568815595f:48141172_48368783|GENSCAN_predicted_CDS_3|513_bp atgaagacccaaagggatggccactccctggggcggtggtcactggtgctcctgctgctg ggcctggtgatgcctctggccatcattgcccaggtcctcagctacaaggaagctgtgctt cgtgctatagatggcatcaaccagcggtcctcggatgctaacctctaccgcctcctggac ctggaccccaggcccacgatggatggggacccagacacgccaaagcctgtgagcttcaca gtgaaggagacagtgtgccccaggacgacacagcagtcaccagaggattgtgacttcaag aaggacgggctggtgaagcggtgtatggggacagtgaccctcaaccaggccaggggctcc tttgacatcagttgtgataaggataacaagagatttgccctgctgggtgatttcttccgg aaatctaaagagaagattggcaaagagtttaaaagaattgtccagagaatcaaggatttt ttgcggaatcttgtacccaggacagagtcctag >gi568815595f:48141172_48368783|GENSCAN_predicted_peptide_4|702_aa MNTVAADYTGLTAPFMTLVAAHEKRPFKDEHRWNRQVKQTHAGQACDLSGQHASMGLDEL AKQQNIIIVLLVVQVPVADGALARGGPVTFEDVAVLFTEAEWKRLSLEQRNLYKEVMLEN LRNLVSLAESKPEVHTCPSCPLAFGSQQFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQP QSQHPSDKNHRGAEAEDQRVEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEI QLSPAQNASSEEVDRISKRAETPGFGAVTFGECALAFNQKSNLFRQKAVTAEKSSDKRQS QVCRECGRGFSRKSQLIIHQRTHTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCS HCGRGFSCKPYLIRHQRTHTREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYWTHSEVK PHVCEECGHGFSQKSSLKSHRRTHSGEKPYVCGECGRGFSRRIVLNGHWRTHTGEKPYTC FECGRNFSLKSALSVHQRIHSGEKPYACTECGQGFITKSQLIRHQRTHTGEKPYVCGECG RGFIAQSTLHYHRSTHSKEKPYVCSQCGRGFCDKSTLLAHEQTHSGEKPYVCGECGRGFG RKILLNRHWRTHTGEKPYACIECGRNFSHKSTLSLHQRIHSGEKPYACVECGQSFRRKSQ LIIHQKIHSGKSFRGARSEDVILATSQPSATPAEMLREKPCL >gi568815595f:48141172_48368783|GENSCAN_predicted_CDS_4|2109_bp atgaacactgtggcagcagactacactggactcacagccccgttcatgactcttgtggct gcccatgaaaaacgaccattcaaagatgaacatcgatggaatcgacaagttaaacagacc catgctggccaggcctgtgatctttcggggcagcatgcctccatggggctggatgaactg gctaagcagcaaaacatcatcatagttttacttgtggttcaggttccagtggcagatggg gcactggccagagggggaccagtgactttcgaggatgtggctgtgcttttcactgaggca gagtggaagagactgagccttgagcagaggaacctatacaaagaagtgatgctggaaaat ctcaggaatctggtctcattggcagaatcaaagccagaagtccatacctgcccttcttgc cctctggcctttggcagtcagcagttcctcagccaagatgagctacacaatcatcctatt ccaggtttccatgcaggaaatcaactccacccaggaaatccctgcccagaggatcagcca cagtcacaacatccttctgataaaaatcacaggggggctgaagcagaagatcaacgagtg gaaggaggcgtcagacccttgttttggagtacaaatgaaaggggggctttagtgggtttc tctagcctgttccagagaccaccaataagctcttggggaggcaacagaatattagagata cagctcagtccagcccagaatgcaagctctgaggaagtagacagaatttccaagagggca gaaaccccagggtttggagcagtcacgtttggggagtgtgcactagcttttaaccagaag tcaaacctgttcagacagaaggcagtcacagcagaaaaatcttcagacaaaaggcagtca caggtgtgcagggagtgtgggcgaggctttagcaggaagtcacagctcatcatacaccag aggacacacacaggagaaaagccttatgtctgcggagagtgtgggcgaggctttatagtt gagtcagtcctccgcaaccacctgagtacacactccggggagaaaccttatgtgtgcagc cattgtgggcgaggctttagctgcaagccatacctcatcagacatcagaggacacacaca agggagaaatcgtttatgtgcacagtgtgtgggcgaggctttcgtgaaaagtcagagctc attaagcaccagagaattcacacgggggataagccttattggacacattcagaggtgaaa cctcacgtgtgtgaggagtgtgggcatggatttagccagaagtcgtcgctcaaatcacat cggagaacacactcaggggagaagccttatgtgtgtggggaatgtgggcggggatttagc cggaggatagtcctcaatggacactggaggacacacacgggagagaagccttacacgtgc tttgagtgtgggcgaaactttagcctcaagtccgctcttagtgtacatcagaggatacac tctggggagaagccttatgcatgcacggagtgtgggcaaggctttatcacgaaatcacag ctcatcagacaccagaggacacacacaggagaaaagccttatgtctgcggagagtgtggg cgaggctttatagctcagtcaaccctccactaccaccggagtacacactccaaggaaaaa ccttatgtgtgcagccagtgtgggcgaggcttttgtgataaatcaactctcctcgcacac gagcagacacattcaggggagaagccttatgtgtgtggggaatgtgggcggggatttggc cggaagatactcctcaacagacactggaggacacacacaggagagaaaccttacgcatgc atcgagtgtgggcgaaactttagccacaagtccactctcagcttacatcagaggatacac tcgggggagaagccttatgcatgcgtggagtgtgggcaaagctttaggagaaagtcacag ctcatcatacaccagaagatacactcggggaaaagctttagaggtgcaaggagtgaggat gtgattttagcaacaagtcagccatcagccacaccagcggaaatgcttagggagaagcct tgtttgtaa >gi568815595f:48141172_48368783|GENSCAN_predicted_peptide_5|308_aa MHSLITPVIQVDHRHPPATDQKEQAELDKRDKKKATALVMAFRQVDFGGSGKVKGWANRM PNRDLIPFIMNKLSTKTILFSQFLRATMGVTADVCPTSKVAYFHSSSVTMKGICSDAILV LATSMWMAFAIGTCDPIQTMQKQKSTKAVHQQILSNKFLIVRMRELLWRKEDCQRFYREH EGRFFYQRLVEFMASGPIRAYILAHKDAIQLWRTLMGPTRVFRARHVAPDSIRGSFGLTD TRNTTHGSDSVVSASREIAAFFPDFSEQRWYEEEEPQLRCGPVCYSPEGGPNLNVLSIGR PSYILQSN >gi568815595f:48141172_48368783|GENSCAN_predicted_CDS_5|927_bp atgcacagcctcattactcctgtcatacaagtggaccatcgtcatcctccagccacagat cagaaggagcaggcagaactggacaaacgggataaaaagaaggccaccgctttagtcatg gccttcaggcaagtggactttggaggctctggaaaagtgaaaggctgggcaaatcgaatg cctaatagggatttgattccattcattatgaacaaactatcaacaaaaaccatcctgttc agccagtttctcagagcaaccatgggagtcacagcagatgtttgtcctacatcaaaagtt gcatatttccacagttcttctgtcaccatgaaggggatctgctcagacgccatccttgtt ctagctacctccatgtggatggcctttgcaattggtacatgtgatcctattcagacaatg cagaaacagaaatccactaaggctgttcatcagcagattctaagcaacaagttcctgatt gtacgaatgagagaactactgtggagaaaggaagattgccagaggttttaccgagagcat gaagggcgttttttctatcagaggctggtggagttcatggccagcgggccaatccgagcc tacatccttgcccacaaggatgccatccagctctggaggacgctcatgggacccaccaga gtgttccgagcacgccatgtggccccagattctatccgtgggagtttcggcctcactgac acccgcaacaccacccatggttcggactctgtggtttcagccagcagagagattgcagcc ttcttccctgacttcagtgaacagcgctggtatgaggaggaagagccccagttgcgctgt ggccctgtgtgctatagcccagagggaggtcccaacttaaatgtactttccataggaagg ccttcctacatcctgcagtccaactag