GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:55:59 Sequence gi568815595r:48058948_48287947 : 229000 bp : 47.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15163 15295 133 0 1 78 47 67 0.611 1.90 1.02 Intr + 29691 29892 202 1 1 106 75 58 0.958 4.64 1.03 Intr + 30120 30323 204 0 0 79 39 190 0.947 11.52 1.04 Term + 33091 33199 109 1 1 70 38 109 0.883 2.08 1.05 PlyA + 34026 34031 6 1.05 2.02 PlyA - 35805 35800 6 1.05 2.01 Sngl - 62268 61924 345 0 0 70 55 205 0.789 11.44 2.00 Prom - 86302 86263 40 -1.56 3.14 PlyA - 86770 86765 6 1.05 3.13 Term - 100138 99998 141 1 0 111 55 226 0.999 19.53 3.12 Intr - 100508 100397 112 1 1 83 110 53 0.999 7.38 3.11 Intr - 105490 105360 131 1 2 74 84 127 0.998 10.39 3.10 Intr - 106787 106689 99 2 0 90 86 187 0.999 19.01 3.09 Intr - 106946 106884 63 2 0 89 87 19 0.637 0.81 3.08 Intr - 108997 108899 99 1 0 85 86 78 0.951 7.61 3.07 Intr - 115510 115337 174 1 0 55 88 80 0.917 4.84 3.06 Intr - 118495 118424 72 1 0 90 70 22 0.509 0.20 3.05 Intr - 119041 118907 135 1 0 89 81 139 0.910 14.06 3.04 Intr - 121947 121774 174 0 0 65 80 78 0.636 4.84 3.03 Intr - 124083 123982 102 0 0 69 82 81 0.806 5.97 3.02 Intr - 124889 124853 37 1 1 78 95 -24 0.480 -4.44 3.01 Init - 129000 128831 170 0 2 105 110 207 0.999 22.91 3.00 Prom - 132764 132725 40 -3.66 4.05 PlyA - 133210 133205 6 1.05 4.04 Term - 146444 146336 109 1 1 94 39 88 0.254 2.48 4.03 Intr - 147412 147348 65 1 2 131 27 44 0.127 0.12 4.02 Intr - 156514 156431 84 1 0 114 72 81 0.298 9.12 4.01 Init - 156916 156713 204 1 0 72 96 102 0.862 8.25 4.00 Prom - 158084 158045 40 -7.66 5.00 Prom + 164513 164552 40 -6.16 5.01 Init + 164565 164765 201 2 0 100 89 402 0.809 38.38 5.02 Intr + 165407 165514 108 1 0 71 60 125 0.997 8.48 5.03 Intr + 165656 165727 72 1 0 77 113 71 0.981 8.10 5.04 Term + 166346 166477 132 1 0 64 36 108 0.686 1.29 5.05 PlyA + 166517 166522 6 1.05 6.00 Prom + 172939 172978 40 -5.96 6.01 Init + 175527 175658 132 2 0 49 110 182 0.527 16.71 6.02 Intr + 197674 197796 123 0 0 99 8 84 0.016 2.28 6.03 Intr + 201866 201992 127 1 1 34 82 34 0.015 -2.35 6.04 Intr + 208968 209824 857 1 2 99 -6 260 0.077 8.78 6.05 Term + 209966 210835 870 1 0 53 45 406 0.118 25.44 6.06 PlyA + 214488 214493 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 151771 152094 324 0 0 72 55 185 0.802 9.61 S.002 Sngl - 197991 197749 243 0 0 89 41 175 0.834 6.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:48058948_48287947|GENSCAN_predicted_peptide_1|215_aa MEHYAAIKKDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIGRRPPPFHASGGPRGV PSCRRARLGQPRDSEEAAPPIGPRLRLHLPAAAETELGQKGARRAPGRAVRRAARAAGGW GEVRLGALASRLRPANAEADRRPIDCLRCSMMAVALSQSLGARSEESARVGAHADRATSG GKEHSLQFQGAQDEKHRTTYSGHTAYGIALLHKQQ >gi568815595r:48058948_48287947|GENSCAN_predicted_CDS_1|648_bp atggaacactatgcagccataaaaaaggatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctgagcaaactatcacaaggacagaaaaccaaacaccgcatg ttctcactcataggccggcggccaccgcccttccacgcctcgggcgggccccgtggggta cccagctgccgcagggcaaggctcgggcagccccgggacagtgaggaggccgcccctcca atcggaccccgcctgcggctgcacctacccgccgccgccgagacggagctggggcagaag ggagccaggagagcgccgggaagagccgtgaggagagctgcccgagccgcgggcggttgg ggcgaggtgcgcctgggggctctagcctcccgcctgcggcctgcgaatgccgaagcggac cgcaggccgatcgactgccttcgctgttctatgatggccgtcgctctctctcagagtctg ggcgcccggagcgaggagagtgcgagagtcggcgcacacgcggaccgggccaccagcggt ggaaaagaacattccctccagttccagggtgcccaggatgaaaagcatagaaccacctat tctgggcatactgcctatgggattgccctgctccacaagcagcagtaa >gi568815595r:48058948_48287947|GENSCAN_predicted_peptide_2|114_aa MVHYSLDPENPAKSCKSRGSNLRVHFKNTRETAQAIMDMHVRKATKYLKDVTLQKQCVPF RHYNGRVGRCTQAKQWGWTQNWCPKKSAEFLLHMLKNAESNAEFKGFDVDSLVI >gi568815595r:48058948_48287947|GENSCAN_predicted_CDS_2|345_bp atggttcactattcacttgacccagagaaccccgcaaaatcatgcaaatcaagaggttcc aatcttcgtgttcactttaagaacactcgtgaaactgcccaggccatcatggatatgcat gtacgaaaagccacgaagtatctgaaagatgtcactttacagaaacagtgcgtaccattc cgacattacaacggtagagttggcaggtgtacccaggccaagcagtggggctggacacaa aattggtgtcccaaaaagagtgctgaatttttgctgcacatgcttaaaaatgcagagagt aatgctgaatttaaaggtttcgatgtagattctctggtcatttga >gi568815595r:48058948_48287947|GENSCAN_predicted_peptide_3|502_aa MELGPEPPHRRRLLFACSPPPASQPVVKALFGASAAGGLSPVTNLTVTMDQLQGLGSLEN PMRRIHSLPQKLLGCSPALKRSHSDSLDHDIFQLIDPDENKENLRPELVFMDSRCLNYVL QEAFEFKKPVRPVSRGCLHSHGLQEGKDLFTQRQNSAPARMLSSNERDSSEPGNFIPLFT PQSPVTATLSDEDDGFVDLLDGENLKNEEETPSCMASLWTAPLVMRTTNLDNRCKLFDSP SLCSSSTRSVLKRPERSQEESPPGSTKRRKSMSGASPKESTNPEKAHETLHQSLSLASSP KGTIENILDNDPRDLIGDFSKGYLFHTVAGKHQDLKYISPEIMASVLNGKFANLIKEFVI IDCRYPYEYEGGHIKGAVNLHMEEEVEDFLLKKPIVPTDGKRVIVVFHCEFSSERGPRMC RYVRERDRLGNEYPKLHYPELYVLKGGYKEFFMKCQSYCEPPSYRPMHHEDFKEDLKKFR TKSRTWAGEKSKREMYSRLKKL >gi568815595r:48058948_48287947|GENSCAN_predicted_CDS_3|1509_bp atggaactgggcccggagcccccgcaccgccgccgcctgctcttcgcctgcagcccccct cccgcgtcgcagcccgtcgtgaaggcgctatttggcgcttcagccgccgggggactgtcg cctgtcaccaacctgaccgtcactatggaccagctgcagggtctgggcagccttgaaaat cctatgagaagaatacattccctacctcagaagctgttgggatgtagtccagctctgaag aggagccattctgattctcttgaccatgacatctttcagctcatcgacccagatgagaac aaggaaaatctgaggcctgagttggttttcatggacagtagatgtctcaactatgtcttg caggaagcctttgagtttaagaagccagtaagacctgtatctcgtggctgcctgcactct catggactccaggagggtaaagatctcttcacacagaggcagaactctgccccagctcgg atgctttcctcaaatgaaagagatagcagtgaaccagggaatttcattcctctttttaca ccccagtcacctgtgacagccactttgtctgatgaggatgatggcttcgtggaccttctc gatggagagaatctgaagaatgaggaggagaccccctcgtgcatggcaagcctctggaca gctcctctcgtcatgagaactacaaaccttgacaaccgatgcaagctgtttgactcccct tccctgtgtagctccagcactcggtcagtgttgaagagaccagaacgatctcaagaggag tctccacctggaagtacaaagaggaggaagagcatgtctggggccagccccaaagagtca actaatccagagaaggcccatgagactcttcatcagtctttatccctggcatcttccccc aaaggaaccattgagaacattttggacaatgacccaagggaccttataggagacttctcc aagggttatctctttcatacagttgctgggaaacatcaggatttaaaatacatctctcca gaaattatggcatctgttttgaatggcaagtttgccaacctcattaaagagtttgttatc atcgactgtcgatacccatatgaatacgagggaggccacatcaagggtgcagtgaacttg cacatggaagaagaggttgaagacttcttattgaagaagcccattgtacctactgatggc aagcgtgtcattgttgtgtttcactgcgagttttcttctgagagaggtccccgcatgtgc cggtatgtgagagagagagatcgcctgggtaatgaataccccaaactccactaccctgag ctgtatgtcctgaaggggggatacaaggagttctttatgaaatgccagtcttactgtgag ccccctagctaccggcccatgcaccacgaggactttaaagaagacctgaagaagttccgc accaagagccggacctgggcaggggagaagagcaagagggagatgtacagtcgtctgaag aagctctga >gi568815595r:48058948_48287947|GENSCAN_predicted_peptide_4|153_aa MRATDKSLEQGPLSRKRQNQDLDSMPPPTNLAVLLMGTCGRIQALPPALCSKLEMQMWAA MRVEIWDRQVKNCRGYFSAAEDPKKLPICESQNFAGAVPLLLWLLKTRNKDSKETLGKIR STPSYPNLAGNWELGELACALLSSPVPLEPMQA >gi568815595r:48058948_48287947|GENSCAN_predicted_CDS_4|462_bp atgagggccacagataaatcacttgaacaaggacccctatccagaaaacggcagaaccag gacttggactcaatgcccccacccaccaaccttgctgtccttctcatggggacatgtgga agaattcaagccctgcctcctgccctctgcagcaaactggaaatgcagatgtgggctgcc atgagggtggaaatatgggacaggcaagtgaagaactgccgggggtacttcagtgctgct gaggaccccaagaagttgcccatctgtgaatcccaaaactttgctggggctgtacccctg ctcctctggctgcttaagaccaggaacaaagactccaaggagactctgggcaaaattagg agcacccccagctacccgaacctagcgggcaactgggaacttggagagctggcctgtgcc ctcctctccagcccagtgcccttggaacccatgcaggcatag >gi568815595r:48058948_48287947|GENSCAN_predicted_peptide_5|170_aa MKTQRDGHSLGRWSLVLLLLGLVMPLAIIAQVLSYKEAVLRAIDGINQRSSDANLYRLLD LDPRPTMDGDPDTPKPVSFTVKETVCPRTTQQSPEDCDFKKDGLVKRCMGTVTLNQARGS FDISCDKDNKRFALLGDFFRKSKEKIGKEFKRIVQRIKDFLRNLVPRTES >gi568815595r:48058948_48287947|GENSCAN_predicted_CDS_5|513_bp atgaagacccaaagggatggccactccctggggcggtggtcactggtgctcctgctgctg ggcctggtgatgcctctggccatcattgcccaggtcctcagctacaaggaagctgtgctt cgtgctatagatggcatcaaccagcggtcctcggatgctaacctctaccgcctcctggac ctggaccccaggcccacgatggatggggacccagacacgccaaagcctgtgagcttcaca gtgaaggagacagtgtgccccaggacgacacagcagtcaccagaggattgtgacttcaag aaggacgggctggtgaagcggtgtatggggacagtgaccctcaaccaggccaggggctcc tttgacatcagttgtgataaggataacaagagatttgccctgctgggtgatttcttccgg aaatctaaagagaagattggcaaagagtttaaaagaattgtccagagaatcaaggatttt ttgcggaatcttgtacccaggacagagtcctag >gi568815595r:48058948_48287947|GENSCAN_predicted_peptide_6|702_aa MNTVAADYTGLTAPFMTLVAAHEKRPFKDEHRWNRQVKQTHAGQACDLSGQHASMGLDEL AKQQNIIIVLLVVQVPVADGALARGGPVTFEDVAVLFTEAEWKRLSLEQRNLYKEVMLEN LRNLVSLAESKPEVHTCPSCPLAFGSQQFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQP QSQHPSDKNHRGAEAEDQRVEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEI QLSPAQNASSEEVDRISKRAETPGFGAVTFGECALAFNQKSNLFRQKAVTAEKSSDKRQS QVCRECGRGFSRKSQLIIHQRTHTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCS HCGRGFSCKPYLIRHQRTHTREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYWTHSEVK PHVCEECGHGFSQKSSLKSHRRTHSGEKPYVCGECGRGFSRRIVLNGHWRTHTGEKPYTC FECGRNFSLKSALSVHQRIHSGEKPYACTECGQGFITKSQLIRHQRTHTGEKPYVCGECG RGFIAQSTLHYHRSTHSKEKPYVCSQCGRGFCDKSTLLAHEQTHSGEKPYVCGECGRGFG RKILLNRHWRTHTGEKPYACIECGRNFSHKSTLSLHQRIHSGEKPYACVECGQSFRRKSQ LIIHQKIHSGKSFRGARSEDVILATSQPSATPAEMLREKPCL >gi568815595r:48058948_48287947|GENSCAN_predicted_CDS_6|2109_bp atgaacactgtggcagcagactacactggactcacagccccgttcatgactcttgtggct gcccatgaaaaacgaccattcaaagatgaacatcgatggaatcgacaagttaaacagacc catgctggccaggcctgtgatctttcggggcagcatgcctccatggggctggatgaactg gctaagcagcaaaacatcatcatagttttacttgtggttcaggttccagtggcagatggg gcactggccagagggggaccagtgactttcgaggatgtggctgtgcttttcactgaggca gagtggaagagactgagccttgagcagaggaacctatacaaagaagtgatgctggaaaat ctcaggaatctggtctcattggcagaatcaaagccagaagtccatacctgcccttcttgc cctctggcctttggcagtcagcagttcctcagccaagatgagctacacaatcatcctatt ccaggtttccatgcaggaaatcaactccacccaggaaatccctgcccagaggatcagcca cagtcacaacatccttctgataaaaatcacaggggggctgaagcagaagatcaacgagtg gaaggaggcgtcagacccttgttttggagtacaaatgaaaggggggctttagtgggtttc tctagcctgttccagagaccaccaataagctcttggggaggcaacagaatattagagata cagctcagtccagcccagaatgcaagctctgaggaagtagacagaatttccaagagggca gaaaccccagggtttggagcagtcacgtttggggagtgtgcactagcttttaaccagaag tcaaacctgttcagacagaaggcagtcacagcagaaaaatcttcagacaaaaggcagtca caggtgtgcagggagtgtgggcgaggctttagcaggaagtcacagctcatcatacaccag aggacacacacaggagaaaagccttatgtctgcggagagtgtgggcgaggctttatagtt gagtcagtcctccgcaaccacctgagtacacactccggggagaaaccttatgtgtgcagc cattgtgggcgaggctttagctgcaagccatacctcatcagacatcagaggacacacaca agggagaaatcgtttatgtgcacagtgtgtgggcgaggctttcgtgaaaagtcagagctc attaagcaccagagaattcacacgggggataagccttattggacacattcagaggtgaaa cctcacgtgtgtgaggagtgtgggcatggatttagccagaagtcgtcgctcaaatcacat cggagaacacactcaggggagaagccttatgtgtgtggggaatgtgggcggggatttagc cggaggatagtcctcaatggacactggaggacacacacgggagagaagccttacacgtgc tttgagtgtgggcgaaactttagcctcaagtccgctcttagtgtacatcagaggatacac tctggggagaagccttatgcatgcacggagtgtgggcaaggctttatcacgaaatcacag ctcatcagacaccagaggacacacacaggagaaaagccttatgtctgcggagagtgtggg cgaggctttatagctcagtcaaccctccactaccaccggagtacacactccaaggaaaaa ccttatgtgtgcagccagtgtgggcgaggcttttgtgataaatcaactctcctcgcacac gagcagacacattcaggggagaagccttatgtgtgtggggaatgtgggcggggatttggc cggaagatactcctcaacagacactggaggacacacacaggagagaaaccttacgcatgc atcgagtgtgggcgaaactttagccacaagtccactctcagcttacatcagaggatacac tcgggggagaagccttatgcatgcgtggagtgtgggcaaagctttaggagaaagtcacag ctcatcatacaccagaagatacactcggggaaaagctttagaggtgcaaggagtgaggat gtgattttagcaacaagtcagccatcagccacaccagcggaaatgcttagggagaagcct tgtttgtaa