GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:49:47 Sequence gi568815595f:48123512_48325421 : 201910 bp : 46.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3387 3488 102 2 0 73 40 138 0.895 5.78 1.02 PlyA + 6667 6672 6 1.05 2.14 PlyA - 7458 7453 6 1.05 2.13 Term - 35574 35434 141 0 0 111 55 226 0.999 19.53 2.12 Intr - 35944 35833 112 0 1 83 110 53 0.999 7.38 2.11 Intr - 40926 40796 131 0 2 74 84 127 0.998 10.39 2.10 Intr - 42223 42125 99 1 0 90 86 187 0.999 19.01 2.09 Intr - 42382 42320 63 1 0 89 87 19 0.637 0.81 2.08 Intr - 44433 44335 99 0 0 85 86 78 0.951 7.61 2.07 Intr - 50946 50773 174 0 0 55 88 80 0.917 4.84 2.06 Intr - 53931 53860 72 0 0 90 70 22 0.509 0.20 2.05 Intr - 54477 54343 135 0 0 89 81 139 0.910 14.06 2.04 Intr - 57383 57210 174 2 0 65 80 78 0.636 4.84 2.03 Intr - 59519 59418 102 2 0 69 82 81 0.806 5.97 2.02 Intr - 60325 60289 37 0 1 78 95 -24 0.480 -4.44 2.01 Init - 64436 64267 170 2 2 105 110 207 0.999 22.91 2.00 Prom - 68200 68161 40 -3.66 3.05 PlyA - 68646 68641 6 1.05 3.04 Term - 81880 81772 109 0 1 94 39 88 0.254 2.48 3.03 Intr - 82848 82784 65 0 2 131 27 44 0.127 0.12 3.02 Intr - 91950 91867 84 0 0 114 72 81 0.298 9.12 3.01 Init - 92352 92149 204 0 0 72 96 102 0.862 8.25 3.00 Prom - 93520 93481 40 -7.66 4.00 Prom + 99949 99988 40 -6.16 4.01 Init + 100001 100201 201 1 0 100 89 402 0.809 38.38 4.02 Intr + 100843 100950 108 0 0 71 60 125 0.997 8.48 4.03 Intr + 101092 101163 72 0 0 77 113 71 0.981 8.10 4.04 Term + 101782 101913 132 0 0 64 36 108 0.686 1.29 4.05 PlyA + 101953 101958 6 1.05 5.00 Prom + 108375 108414 40 -5.96 5.01 Init + 110963 111094 132 1 0 49 110 182 0.527 16.71 5.02 Intr + 133110 133232 123 2 0 99 8 84 0.016 2.28 5.03 Intr + 137302 137428 127 0 1 34 82 34 0.015 -2.35 5.04 Intr + 144404 145260 857 0 2 99 -6 260 0.077 8.78 5.05 Term + 145402 146271 870 0 0 53 45 406 0.118 25.44 5.06 PlyA + 149924 149929 6 1.05 6.07 PlyA - 151374 151369 6 1.05 6.06 Term - 165338 165280 59 0 2 123 39 23 0.120 -1.25 6.05 Intr - 171292 171170 123 2 0 112 22 123 0.266 8.66 6.04 Intr - 171724 171564 161 0 2 88 100 201 0.995 20.93 6.03 Intr - 172647 172608 40 1 1 99 80 6 0.883 -1.82 6.02 Intr - 173318 173216 103 2 1 39 97 98 0.665 5.55 6.01 Init - 175005 174916 90 0 0 71 94 58 0.693 3.29 6.00 Prom - 186172 186133 40 -3.46 7.03 PlyA - 186235 186230 6 1.05 7.02 Term - 190815 190677 139 2 1 93 55 49 0.312 -0.46 7.01 Intr - 196107 195986 122 0 2 74 84 65 0.454 3.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 87207 87530 324 2 0 72 55 185 0.802 9.61 S.002 Sngl - 133427 133185 243 2 0 89 41 175 0.834 6.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:48123512_48325421|GENSCAN_predicted_peptide_1|33_aa VLHISPEPENSNDVSGKSQAIGTNVYGVNTYRV >gi568815595f:48123512_48325421|GENSCAN_predicted_CDS_1|102_bp gtactacacatctccccagaaccagagaacagcaatgatgtcagtgggaaatcccaggca attggcactaatgtctacggtgtaaacacctacagagtgtag >gi568815595f:48123512_48325421|GENSCAN_predicted_peptide_2|502_aa MELGPEPPHRRRLLFACSPPPASQPVVKALFGASAAGGLSPVTNLTVTMDQLQGLGSLEN PMRRIHSLPQKLLGCSPALKRSHSDSLDHDIFQLIDPDENKENLRPELVFMDSRCLNYVL QEAFEFKKPVRPVSRGCLHSHGLQEGKDLFTQRQNSAPARMLSSNERDSSEPGNFIPLFT PQSPVTATLSDEDDGFVDLLDGENLKNEEETPSCMASLWTAPLVMRTTNLDNRCKLFDSP SLCSSSTRSVLKRPERSQEESPPGSTKRRKSMSGASPKESTNPEKAHETLHQSLSLASSP KGTIENILDNDPRDLIGDFSKGYLFHTVAGKHQDLKYISPEIMASVLNGKFANLIKEFVI IDCRYPYEYEGGHIKGAVNLHMEEEVEDFLLKKPIVPTDGKRVIVVFHCEFSSERGPRMC RYVRERDRLGNEYPKLHYPELYVLKGGYKEFFMKCQSYCEPPSYRPMHHEDFKEDLKKFR TKSRTWAGEKSKREMYSRLKKL >gi568815595f:48123512_48325421|GENSCAN_predicted_CDS_2|1509_bp atggaactgggcccggagcccccgcaccgccgccgcctgctcttcgcctgcagcccccct cccgcgtcgcagcccgtcgtgaaggcgctatttggcgcttcagccgccgggggactgtcg cctgtcaccaacctgaccgtcactatggaccagctgcagggtctgggcagccttgaaaat cctatgagaagaatacattccctacctcagaagctgttgggatgtagtccagctctgaag aggagccattctgattctcttgaccatgacatctttcagctcatcgacccagatgagaac aaggaaaatctgaggcctgagttggttttcatggacagtagatgtctcaactatgtcttg caggaagcctttgagtttaagaagccagtaagacctgtatctcgtggctgcctgcactct catggactccaggagggtaaagatctcttcacacagaggcagaactctgccccagctcgg atgctttcctcaaatgaaagagatagcagtgaaccagggaatttcattcctctttttaca ccccagtcacctgtgacagccactttgtctgatgaggatgatggcttcgtggaccttctc gatggagagaatctgaagaatgaggaggagaccccctcgtgcatggcaagcctctggaca gctcctctcgtcatgagaactacaaaccttgacaaccgatgcaagctgtttgactcccct tccctgtgtagctccagcactcggtcagtgttgaagagaccagaacgatctcaagaggag tctccacctggaagtacaaagaggaggaagagcatgtctggggccagccccaaagagtca actaatccagagaaggcccatgagactcttcatcagtctttatccctggcatcttccccc aaaggaaccattgagaacattttggacaatgacccaagggaccttataggagacttctcc aagggttatctctttcatacagttgctgggaaacatcaggatttaaaatacatctctcca gaaattatggcatctgttttgaatggcaagtttgccaacctcattaaagagtttgttatc atcgactgtcgatacccatatgaatacgagggaggccacatcaagggtgcagtgaacttg cacatggaagaagaggttgaagacttcttattgaagaagcccattgtacctactgatggc aagcgtgtcattgttgtgtttcactgcgagttttcttctgagagaggtccccgcatgtgc cggtatgtgagagagagagatcgcctgggtaatgaataccccaaactccactaccctgag ctgtatgtcctgaaggggggatacaaggagttctttatgaaatgccagtcttactgtgag ccccctagctaccggcccatgcaccacgaggactttaaagaagacctgaagaagttccgc accaagagccggacctgggcaggggagaagagcaagagggagatgtacagtcgtctgaag aagctctga >gi568815595f:48123512_48325421|GENSCAN_predicted_peptide_3|153_aa MRATDKSLEQGPLSRKRQNQDLDSMPPPTNLAVLLMGTCGRIQALPPALCSKLEMQMWAA MRVEIWDRQVKNCRGYFSAAEDPKKLPICESQNFAGAVPLLLWLLKTRNKDSKETLGKIR STPSYPNLAGNWELGELACALLSSPVPLEPMQA >gi568815595f:48123512_48325421|GENSCAN_predicted_CDS_3|462_bp atgagggccacagataaatcacttgaacaaggacccctatccagaaaacggcagaaccag gacttggactcaatgcccccacccaccaaccttgctgtccttctcatggggacatgtgga agaattcaagccctgcctcctgccctctgcagcaaactggaaatgcagatgtgggctgcc atgagggtggaaatatgggacaggcaagtgaagaactgccgggggtacttcagtgctgct gaggaccccaagaagttgcccatctgtgaatcccaaaactttgctggggctgtacccctg ctcctctggctgcttaagaccaggaacaaagactccaaggagactctgggcaaaattagg agcacccccagctacccgaacctagcgggcaactgggaacttggagagctggcctgtgcc ctcctctccagcccagtgcccttggaacccatgcaggcatag >gi568815595f:48123512_48325421|GENSCAN_predicted_peptide_4|170_aa MKTQRDGHSLGRWSLVLLLLGLVMPLAIIAQVLSYKEAVLRAIDGINQRSSDANLYRLLD LDPRPTMDGDPDTPKPVSFTVKETVCPRTTQQSPEDCDFKKDGLVKRCMGTVTLNQARGS FDISCDKDNKRFALLGDFFRKSKEKIGKEFKRIVQRIKDFLRNLVPRTES >gi568815595f:48123512_48325421|GENSCAN_predicted_CDS_4|513_bp atgaagacccaaagggatggccactccctggggcggtggtcactggtgctcctgctgctg ggcctggtgatgcctctggccatcattgcccaggtcctcagctacaaggaagctgtgctt cgtgctatagatggcatcaaccagcggtcctcggatgctaacctctaccgcctcctggac ctggaccccaggcccacgatggatggggacccagacacgccaaagcctgtgagcttcaca gtgaaggagacagtgtgccccaggacgacacagcagtcaccagaggattgtgacttcaag aaggacgggctggtgaagcggtgtatggggacagtgaccctcaaccaggccaggggctcc tttgacatcagttgtgataaggataacaagagatttgccctgctgggtgatttcttccgg aaatctaaagagaagattggcaaagagtttaaaagaattgtccagagaatcaaggatttt ttgcggaatcttgtacccaggacagagtcctag >gi568815595f:48123512_48325421|GENSCAN_predicted_peptide_5|702_aa MNTVAADYTGLTAPFMTLVAAHEKRPFKDEHRWNRQVKQTHAGQACDLSGQHASMGLDEL AKQQNIIIVLLVVQVPVADGALARGGPVTFEDVAVLFTEAEWKRLSLEQRNLYKEVMLEN LRNLVSLAESKPEVHTCPSCPLAFGSQQFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQP QSQHPSDKNHRGAEAEDQRVEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEI QLSPAQNASSEEVDRISKRAETPGFGAVTFGECALAFNQKSNLFRQKAVTAEKSSDKRQS QVCRECGRGFSRKSQLIIHQRTHTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCS HCGRGFSCKPYLIRHQRTHTREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYWTHSEVK PHVCEECGHGFSQKSSLKSHRRTHSGEKPYVCGECGRGFSRRIVLNGHWRTHTGEKPYTC FECGRNFSLKSALSVHQRIHSGEKPYACTECGQGFITKSQLIRHQRTHTGEKPYVCGECG RGFIAQSTLHYHRSTHSKEKPYVCSQCGRGFCDKSTLLAHEQTHSGEKPYVCGECGRGFG RKILLNRHWRTHTGEKPYACIECGRNFSHKSTLSLHQRIHSGEKPYACVECGQSFRRKSQ LIIHQKIHSGKSFRGARSEDVILATSQPSATPAEMLREKPCL >gi568815595f:48123512_48325421|GENSCAN_predicted_CDS_5|2109_bp atgaacactgtggcagcagactacactggactcacagccccgttcatgactcttgtggct gcccatgaaaaacgaccattcaaagatgaacatcgatggaatcgacaagttaaacagacc catgctggccaggcctgtgatctttcggggcagcatgcctccatggggctggatgaactg gctaagcagcaaaacatcatcatagttttacttgtggttcaggttccagtggcagatggg gcactggccagagggggaccagtgactttcgaggatgtggctgtgcttttcactgaggca gagtggaagagactgagccttgagcagaggaacctatacaaagaagtgatgctggaaaat ctcaggaatctggtctcattggcagaatcaaagccagaagtccatacctgcccttcttgc cctctggcctttggcagtcagcagttcctcagccaagatgagctacacaatcatcctatt ccaggtttccatgcaggaaatcaactccacccaggaaatccctgcccagaggatcagcca cagtcacaacatccttctgataaaaatcacaggggggctgaagcagaagatcaacgagtg gaaggaggcgtcagacccttgttttggagtacaaatgaaaggggggctttagtgggtttc tctagcctgttccagagaccaccaataagctcttggggaggcaacagaatattagagata cagctcagtccagcccagaatgcaagctctgaggaagtagacagaatttccaagagggca gaaaccccagggtttggagcagtcacgtttggggagtgtgcactagcttttaaccagaag tcaaacctgttcagacagaaggcagtcacagcagaaaaatcttcagacaaaaggcagtca caggtgtgcagggagtgtgggcgaggctttagcaggaagtcacagctcatcatacaccag aggacacacacaggagaaaagccttatgtctgcggagagtgtgggcgaggctttatagtt gagtcagtcctccgcaaccacctgagtacacactccggggagaaaccttatgtgtgcagc cattgtgggcgaggctttagctgcaagccatacctcatcagacatcagaggacacacaca agggagaaatcgtttatgtgcacagtgtgtgggcgaggctttcgtgaaaagtcagagctc attaagcaccagagaattcacacgggggataagccttattggacacattcagaggtgaaa cctcacgtgtgtgaggagtgtgggcatggatttagccagaagtcgtcgctcaaatcacat cggagaacacactcaggggagaagccttatgtgtgtggggaatgtgggcggggatttagc cggaggatagtcctcaatggacactggaggacacacacgggagagaagccttacacgtgc tttgagtgtgggcgaaactttagcctcaagtccgctcttagtgtacatcagaggatacac tctggggagaagccttatgcatgcacggagtgtgggcaaggctttatcacgaaatcacag ctcatcagacaccagaggacacacacaggagaaaagccttatgtctgcggagagtgtggg cgaggctttatagctcagtcaaccctccactaccaccggagtacacactccaaggaaaaa ccttatgtgtgcagccagtgtgggcgaggcttttgtgataaatcaactctcctcgcacac gagcagacacattcaggggagaagccttatgtgtgtggggaatgtgggcggggatttggc cggaagatactcctcaacagacactggaggacacacacaggagagaaaccttacgcatgc atcgagtgtgggcgaaactttagccacaagtccactctcagcttacatcagaggatacac tcgggggagaagccttatgcatgcgtggagtgtgggcaaagctttaggagaaagtcacag ctcatcatacaccagaagatacactcggggaaaagctttagaggtgcaaggagtgaggat gtgattttagcaacaagtcagccatcagccacaccagcggaaatgcttagggagaagcct tgtttgtaa >gi568815595f:48123512_48325421|GENSCAN_predicted_peptide_6|191_aa MASILRSPQALQLTLALIKPDAVAHPLILEAVHQQILSNKFLIVRMRELLWRKEDCQRFY REHEGRFFYQRLVEFMASGPIRAYILAHKDAIQLWRTLMGPTRVFRARHVAPDSIRGSFG LTDTRNTTHGSDSVVSASREIAAFFPDFSEQRWYEEEEPQLRCGPVCYSPEGGPNLNVLS IGRPSYILQSN >gi568815595f:48123512_48325421|GENSCAN_predicted_CDS_6|576_bp atggcctcaatcttgcgaagccctcaggctctccagctcactctagccctgatcaagcct gacgcagtcgcccatccactgattctggaggctgttcatcagcagattctaagcaacaag ttcctgattgtacgaatgagagaactactgtggagaaaggaagattgccagaggttttac cgagagcatgaagggcgttttttctatcagaggctggtggagttcatggccagcgggcca atccgagcctacatccttgcccacaaggatgccatccagctctggaggacgctcatggga cccaccagagtgttccgagcacgccatgtggccccagattctatccgtgggagtttcggc ctcactgacacccgcaacaccacccatggttcggactctgtggtttcagccagcagagag attgcagccttcttccctgacttcagtgaacagcgctggtatgaggaggaagagccccag ttgcgctgtggccctgtgtgctatagcccagagggaggtcccaacttaaatgtactttcc ataggaaggccttcctacatcctgcagtccaactag >gi568815595f:48123512_48325421|GENSCAN_predicted_peptide_7|86_aa VECLKNVNKCWFLSYIKPSEPICGSDQVTYSSDCHLCSKILILENLILTIFASVLVAFVE EILKILTSPFSLSKQKGSLLNEDTET >gi568815595f:48123512_48325421|GENSCAN_predicted_CDS_7|261_bp gttgaatgcctcaagaatgtaaataagtgctggtttttatcctacatcaagcccagtgaa cctatttgtggcagtgaccaggttacctacagtagtgactgccatctgtgctccaaaatt ctaattctggaaaacttgattctgaccatttttgccagtgttcttgttgctttcgtggag gaaattttaaagattcttacttctccattttcactgtccaagcaaaagggctcacttttg aatgaagacactgagacctga