GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:55:42 Sequence gi568815591f:30963265_31206681 : 243417 bp : 47.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 805 861 57 0 0 99 84 100 0.971 10.24 1.02 Intr + 5567 5672 106 1 1 71 47 133 0.412 7.29 1.03 Intr + 5799 5906 108 1 0 85 94 13 0.600 1.86 1.04 Intr + 6603 6700 98 1 2 15 94 67 0.791 -0.27 1.05 Intr + 7855 7952 98 0 2 115 72 178 0.891 17.71 1.06 Intr + 8699 8861 163 2 1 73 75 161 0.991 13.28 1.07 Intr + 10721 10874 154 1 1 58 72 122 0.930 7.25 1.08 Intr + 11165 11225 61 0 1 101 77 93 0.997 7.29 1.09 Intr + 11707 11776 70 1 1 129 44 118 0.974 10.68 1.10 Intr + 12513 12640 128 2 2 115 55 64 0.992 5.28 1.11 Intr + 13165 13294 130 1 1 77 94 128 0.999 13.00 1.12 Intr + 14017 14058 42 0 0 72 85 66 0.860 3.14 1.13 Term + 20305 20397 93 0 0 98 53 110 0.433 6.33 1.14 PlyA + 21930 21935 6 1.05 2.06 PlyA - 22021 22016 6 1.05 2.05 Term - 24456 24295 162 0 0 119 46 50 0.100 1.84 2.04 Intr - 32952 32738 215 1 2 28 89 119 0.134 4.43 2.03 Intr - 34613 34534 80 1 2 108 8 17 0.032 -5.11 2.02 Intr - 40603 40470 134 1 2 58 70 71 0.323 1.74 2.01 Init - 40781 40632 150 2 0 38 101 120 0.743 8.34 2.00 Prom - 46533 46494 40 -7.06 3.03 PlyA - 46993 46988 6 1.05 3.02 Term - 53340 53209 132 0 0 65 49 107 0.791 2.59 3.01 Init - 56625 56467 159 0 0 71 68 110 0.786 7.13 3.00 Prom - 58223 58184 40 -5.96 4.04 PlyA - 59007 59002 6 1.05 4.03 Term - 59225 59115 111 2 0 137 43 45 0.439 3.46 4.02 Intr - 70886 70789 98 0 2 79 107 112 0.804 11.93 4.01 Init - 77996 77939 58 2 1 79 105 44 0.725 6.67 4.00 Prom - 80106 80067 40 -6.06 5.00 Prom + 82499 82538 40 -3.86 5.01 Init + 82675 82754 80 0 2 83 47 6 0.169 -3.47 5.02 Intr + 85041 85293 253 0 1 83 47 118 0.503 4.64 5.03 Term + 89169 89597 429 2 0 43 48 228 0.573 9.60 5.04 PlyA + 91258 91263 6 1.05 6.00 Prom + 93124 93163 40 -5.56 6.01 Init + 94968 95025 58 2 1 75 81 40 0.267 3.48 6.02 Intr + 95081 95230 150 0 0 93 70 54 0.310 4.23 6.03 Intr + 99930 100051 122 1 2 100 91 32 0.330 4.91 6.04 Intr + 101567 101750 184 1 1 113 81 182 0.603 19.36 6.05 Intr + 114727 114834 108 2 0 112 98 109 0.911 14.56 6.06 Intr + 115477 115599 123 2 0 43 53 91 0.717 1.66 6.07 Intr + 117349 117369 21 2 0 126 91 16 0.494 3.22 6.08 Intr + 118449 118490 42 1 0 98 110 21 0.918 3.51 6.09 Intr + 120877 120986 110 2 2 87 109 85 0.988 10.50 6.10 Intr + 121473 121570 98 2 2 85 97 166 0.999 16.01 6.11 Intr + 122046 122178 133 0 1 43 74 197 0.997 14.45 6.12 Intr + 123120 123319 200 2 2 141 89 290 0.804 32.55 6.13 Intr + 124363 124432 70 1 1 96 101 19 0.841 3.18 6.14 Intr + 129380 129471 92 1 2 115 89 93 0.619 10.89 6.15 Intr + 133705 133799 95 1 2 69 76 80 0.804 4.51 6.16 Intr + 136832 136946 115 0 1 96 84 50 0.996 4.91 6.17 Intr + 139320 139400 81 0 0 90 94 44 0.938 3.95 6.18 Intr + 139973 140102 130 2 1 63 96 132 0.993 12.20 6.19 Intr + 141604 141645 42 0 0 87 94 14 0.557 0.34 6.20 Term + 143232 143420 189 2 0 78 52 205 0.999 13.35 6.21 PlyA + 148190 148195 6 1.05 7.06 PlyA - 149763 149758 6 1.05 7.05 Term - 158254 158088 167 2 2 51 50 99 0.268 0.48 7.04 Intr - 159382 159351 32 0 2 66 87 23 0.558 -2.23 7.03 Intr - 160347 160176 172 1 1 -1 98 139 0.474 5.00 7.02 Intr - 161883 161799 85 0 1 44 101 56 0.456 1.89 7.01 Init - 210512 210510 3 2 0 106 101 0 0.450 3.10 7.00 Prom - 212871 212832 40 0.44 8.04 PlyA - 213573 213568 6 1.05 8.03 Term - 216810 216694 117 0 0 94 37 37 0.598 -2.26 8.02 Intr - 217159 217083 77 2 2 63 116 11 0.544 0.63 8.01 Init - 219753 219645 109 0 1 43 98 103 0.342 7.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 101922 102028 107 1 2 117 49 50 0.968 2.57 S.002 Init - 107657 107526 132 2 0 76 64 94 0.812 5.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_1|435_aa MDRRMWGAHVFCVLSPLPTQVLGHMHPECDFITQLREDESACLQAAEEMPNTTLGCPATW DGLLCWPTAGSGEWVTLPCPDFFSHFSSESGAVKRDCTITGWSEPFPPYPVACPVPLELL AEEESYFSTVKIIYTVGHSISIVALFVAITILVALRRLHCPRNYVHTQLFTTFILKAGAV FLKDAALFHSDDTDHCSFSTVMAMGEGAGQVLCKVSVAASHFATMTNFSWLLAEAVYLNC LLASTSPSSRRAFWWLVLAGWGLPVLFTGTWVSCKLAFEDIACWDLDDTSPYWWIIKGPI VLSVGVNFGLFLNIIRILVRKLEPAQGSLHTQSQYWYCVFVSGDTGKGRLSKSTLFLIPL FGIHYIIFNFLPDNAGLGIRLPLELGLGSFQGFIVAILYCFLNQEGFADWAYDADSWADP LTGKLFLSQDSSFHN >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_1|1308_bp atggaccgccggatgtggggggcccacgtcttctgcgtgttgagcccgttaccgacccag gtattgggccacatgcacccagaatgtgacttcatcacccagctgagagaggatgagagt gcctgtctacaagcagcagaggagatgcccaacaccaccctgggctgccctgcgacctgg gatgggctgctgtgctggccaacggcaggctctggcgagtgggtcaccctcccctgcccg gatttcttctctcacttcagctcagagtcaggggctgtgaaacgggattgtactatcact ggctggtctgagccctttccaccttaccctgtggcctgccctgtgcctctggagctgctg gctgaggaggaatcttacttctccacagtgaagattatctacaccgtgggccatagcatc tctattgtagccctcttcgtggccatcaccatcctggttgctctcaggaggctccactgc ccccggaactacgtccacacccagctgttcaccacttttatcctcaaggcgggagctgtg ttcctgaaggatgctgcccttttccacagcgacgacactgaccactgcagcttctccact gtaatggccatgggtgaaggggctgggcaggttctatgcaaggtctctgtggccgcctcc catttcgccaccatgaccaacttcagctggctgttggcagaagccgtctacctgaactgc ctcctggcctccacctcccccagctcaaggagagccttctggtggctggttctcgctggc tgggggctgcccgtgctcttcactggcacgtgggtgagctgcaaactggccttcgaggac atcgcgtgctgggacctggacgacacctccccctactggtggatcatcaaagggcccatt gtcctctcggtcggggtgaactttgggctttttctcaatattatccgcatcctggtgagg aaactggagccagctcagggcagcctccatacccagtctcagtattggtactgtgtgttt gtgtctggggatactgggaaagggcgtctctccaagtcgacacttttcctgatcccactc tttggaattcactacatcatcttcaacttcctgccagacaatgctggcctgggcatccgc ctccccctggagctgggactgggttccttccagggcttcattgttgccatcctctactgc ttcctcaaccaagagggattcgcagactgggcctacgatgcagattcctgggcagatcct ttgactggcaagctcttcctctctcaggactccagcttccacaactga >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_2|246_aa MDASQRYKIPGSETKDFITHSKSTSQSCIGLLRFSDITQVPQRQCKGPMMESNTELGTFA TSLASGKKPGLFLVGGVTASLMAAHGKYSPAKQPGEEQEMKASICVSPAERPAAGPLICP PTVLQAGVKRVTREAPRPRFKLSYDVSMSVMTEQEGEEPAPVSSSAERSEHATYFMRLNR RWKWSIQHRAWHMNSSSAPPPSPSTEYSTPFQQTLQQGPCGIAQMLSDKHLQVPGASAFG AKPNPE >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_2|741_bp atggatgccagccaaagatacaagattcctgggtcagagacaaaggacttcatcactcac agcaaaagcactagccagagctgcataggattgcttcggttctccgacatcacccaagtc ccacaaagacaatgcaaagggcccatgatggagagtaacactgagcttggaacatttgcc acttctctagcgagtggaaagaaacctggtcttttcctggtgggaggtgttactgcatct ctcatggctgctcatggcaaatacagccctgcgaaacagcccggtgaggagcaagaaatg aaagccagcatctgcgtgtcacctgcagagcgccctgcagctggtcctctcatttgccct ccaacagtgctgcaggctggtgtgaagagagtgacccgggaagctccaagacccagattt aaactcagttatgacgtttccatgagtgtcatgactgagcaagaaggtgaagagcccgcg cctgtttcctcatctgcggaacggagcgaacacgccacctacttcatgagacttaacaga cgatggaagtggagcattcagcacagagcctggcacatgaactcttcctctgcaccccca cccagccctagcaccgaatactccactcccttccagcagaccctccagcagggaccctgt ggcattgcacaaatgctctctgacaaacacctacaagtccctggtgcctcagcctttggg gcaaaacccaaccctgaataa >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_3|96_aa MGDKEEEKVQSITWCQVLLPHIFPGPPHSPHRPTPEWSAAPPAGIGGAMEEVWPPEQGEN WMNSGGEHGMMMSHQTAPQWDFAFTSDPRLPIQVVE >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_3|291_bp atgggggacaaggaggaggagaaagtccagagcatcacctggtgtcaagtactcctgcct cacatcttcccaggccctccccacagcccccaccgccccacccctgaatggtcagctgct cctcctgctggcattggtggtgctatggaggaagtgtggcctcctgaacaaggagagaat tggatgaacagtggtggagagcatggcatgatgatgtcccaccagactgcacctcagtgg gacttcgccttcacatcggaccccaggcttccaattcaggttgtggaatga >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_4|88_aa MDSLGDKVLEEAQCEKQLEEGFFIDKALGSAGSVVEGKLLMGKLVTGNAQRLHLPGTDGC SRALLTPFDSEAILFLPAPCAAHHPPFS >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_4|267_bp atggacagtttgggggataaggtgctagaagaagcgcagtgtgaaaagcagttggaagag ggctttttcatcgacaaagcacttggttctgcgggcagtgtggttgaaggcaagctgctg atggggaaattagtgacagggaacgcacagaggctgcatctacctgggactgatggctgc tctcgggccttgctgacaccctttgattcggaagctattctcttcctccctgctccatgt gctgctcatcacccgcctttctcctga >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_5|253_aa MDEAGNHHSKQTITRTENQTPYVLTHRLFVRQLQGFLELLEYSLKIMDLSNLTFIPLVSQ RIPSVLGTRVHGVTEKVLDGIGSSDEDATSWLLYETSGERSALDLLGPASGGVTAAAAAP GERGPRVRTRTRRRAGTHGPRPPAGHTGAETRLGARSSAHTLPIPAPRGPRTCRLRVLCG VCPGPTDRPRSEQVRVAPPRLPLHASAVFSPSRTFAQPGPRIPSSGLAVSASRDLHRRRA GEGRRQASLPDLK >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_5|762_bp atggatgaagctggaaatcatcattctaagcaaactatcacaaggacagaaaaccaaaca ccatatgttctcactcataggctttttgtgaggcagcttcaaggatttctagagctcttg gaatacagtttgaaaatcatggatctaagcaacctaaccttcattcccttggtttctcag aggattccaagcgtcctggggaccagggtccatggggtgactgaaaaggttctagatggc attggttcatctgatgaggatgctacctcctggctcctctatgagacatcaggggaaagg agtgcacttgaccttctagggcctgcttctgggggagtgacggcggcggcggcggctccg ggcgagcgtggtccccgcgtgcgcacacgcacacgccgccgcgcagggacacacggaccc cggccgccagccggccacacaggcgcggagacccggctcggcgcgcgctcctcggcgcac acgctccccatccccgcgccgcgcgggccgcggacttgcaggctgcgcgtcctttgcgga gtctgccccggccccacggacaggccccgcagtgagcaggtaagggtcgccccgccgcgg ctgcccctccacgcctcggccgtcttctccccctccagaaccttcgcccagccgggacct cggatcccctcctctggcttggcggtctccgccagccgcgacctccaccgacggagagcg ggcgagggccggcgccaggcaagcctcccagatctgaaatga >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_6|720_aa MAGGSLSPLCTEMWKVFEPRATTSWCSDSEEDMAAVVSFTPGSVAGLMRPEEERGMVCYW DPPEFPYSPAQRHIGADLPLLSVGGQWCWPRSVMAGVVHVSLAALLLLPMAPAMHSDCIF KKEQAMCLEKIQRANELMGFNDSSPGERGGREHATSPVPAFRTVFLCSGSASCPGMWDNI TCWKPAHVGEMVLVSCPELFRIFNPDQGSYTGSRWEELDLGLPGERVPHTGSSATVLSVT CGRGLGGAVWETETIGESDFGDSNSLDLSDMGVVSRNCTEDGWSEPFPHYFDACGFDEYE SETGDQDYYYLSVKALYTVGYSTSLVTLTTAMVILCRFRKLHCTRNFIHMNLFVSFMLRA ISVFIKDWILYAEQDSNHCFISTVECKAVMVFFHYCVVSNYFWLFIEGLYLFTLLVETFF PERRYFYWYTIIGWGRFLAVAWTGSGPVVSCWDMNDSTALWWVIKGPVVGSIMVNFVLFI GIIVILVQKLQSPDMGGNESSIYLAVAQPTLHTTAAEAFLEFGPHRNCAGSPADRSDRSA NLVTPSSCVQKCYCKPQRAQQHSCKMSELSTITLAFSSTFDSVSQQEVPPAQLEIADAQG PRLARSTLLLIPLFGIHYTVFAFSPENVSKRERLVFELGLGSFQGFVVAVLYCFLNGEVQ AEIKRKWRSWKVNRYFAVDFKHRHPSLASSGVNGGTQLSILSKSSSQIRMSGLPADNLAT >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_6|2163_bp atggctggagggtctctgagtcccctctgcacggagatgtggaaagtcttcgagcccagg gccaccacttcctggtgcagcgactcagaggaggacatggctgctgtagtcagtttcact ccaggttcggtggctgggcttatgaggcctgaggaggaaaggggaatggtgtgctattgg gatccccctgaattcccttacagtccagcccagagacacattggggctgacctgccgctg ctgtcagtgggaggccagtggtgctggccaagaagtgtcatggctggtgtcgtgcacgtt tccctggctgctctcctcctgctgcctatggcccctgccatgcattctgactgcatcttc aagaaggagcaagccatgtgcctggagaagatccagagggccaatgagctgatgggcttc aatgattcctctccaggtgagcggggcggcagggagcatgccacgtccccagtgccagct tttagaactgttttcctgtgctcaggctcggccagctgtcctgggatgtgggacaacatc acgtgttggaagcccgcccatgtgggtgagatggtcctggtcagctgccctgagctcttc cgaatcttcaacccagaccaaggcagttacacgggctcaagatgggaggagctggacctt gggctacctggggagagggtgccacacacggggagcagtgccacagtgctgtcggtgact tgtggacgtgggcttggaggcgcggtctgggagaccgaaaccattggagagtctgatttt ggtgacagtaactccttagatctctcagacatgggagtggtgagccggaactgcacggag gatggctggtcggaacccttccctcattactttgatgcctgtgggtttgatgaatatgaa tctgagactggggaccaggattattactacctgtcagtgaaggccctctacacggttggc tacagcacatccctcgtcaccctcaccactgccatggtcatcctttgtcgcttccggaag ctgcactgcacacgcaacttcatccacatgaacctgtttgtgtcgttcatgctgagggcg atctccgtcttcatcaaagactggattctgtatgcggagcaggacagcaaccactgcttc atctccactgtggaatgtaaggccgtcatggttttcttccactactgtgttgtgtccaac tacttctggctgttcatcgagggcctgtacctcttcactctgctggtggagaccttcttc cctgaaaggagatacttctactggtacaccatcattggctggggtaggttcctggctgtg gcttggacaggttcaggtcccgtggtcagctgctgggatatgaatgacagcacagctctg tggtgggtgatcaaaggccctgtggttggctctatcatggttaactttgtgctttttatt ggcattatcgtcatccttgtgcagaaacttcagtctccagacatgggaggcaatgagtcc agcatctacttggctgtagcccaaccaaccctgcacaccacagctgcggaagcatttctg gagtttggccctcatagaaactgtgctggatccccggctgacagatctgacagaagtgct aatcttgtcacacccagcagctgcgtgcagaaatgctactgcaagccacagcgggctcag cagcactcttgcaagatgtcagaactgtccaccattactctagcattctcttccaccttt gactcagtttcccagcaggaggttcctcctgcccagttggagattgccgatgcccagggt ccgcgactggcccggtccaccctgctgctcatcccactattcggaatccactacacagta tttgccttctccccagagaatgtcagcaaaagggaaagactcgtgtttgagctggggctg ggctccttccagggctttgtggtggctgttctctactgttttctgaatggtgaggtacaa gcggagatcaagcgaaaatggcgaagctggaaggtgaaccgttacttcgctgtggacttc aagcaccgacacccgtctctggccagcagtggggtgaatgggggcacccagctctccatc ctgagcaagagcagctcccaaatccgcatgtctggcctccctgctgacaatctggccacc tga >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_7|152_aa MSLEKLSSMKPVPDAKKIGNRCFRRLPLCAGATDVNHRGGPNQRQGGWAVTPTSLSHWLP ATLDEKGEGGSLTPQKSASNCLQKRAGLSDCGWVTARWRGLSIATTSFPGPKCEAWNLPT PFPGSEDEDEDWAFMGELSMASTNPGASPSGF >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_7|459_bp atgtccctggaaaaattgtcttccatgaaaccagtccctgatgccaaaaagattgggaac cgctgctttagaaggctgcctctctgcgcaggagccactgatgtcaaccacagaggtggt cccaaccagaggcaaggcggctgggcggttacccctacatccctcagtcactggctaccg gccaccctggatgagaaaggtgaagggggtagtcttacccctcagaaatctgccagcaat tgtctgcagaagagggcagggctaagtgactgtggttgggtcacggcccgctggaggggc ctttccattgccactacatcatttccagggcctaagtgtgaggcctggaacctgcctact cctttcccaggaagtgaggatgaggatgaggattgggctttcatgggagaactctccatg gcttctaccaaccctggggccagtcccagtggattctag >gi568815591f:30963265_31206681|GENSCAN_predicted_peptide_8|100_aa MNIDAKISNKILANQIQQHIKKQIHYDQVGFIPEMQGEIDALPRIGLGVKTKAACPLCGK EKHFQLSFSLTPTSAPNPEGPLGFGPESRVMLSGGMGTSG >gi568815591f:30963265_31206681|GENSCAN_predicted_CDS_8|303_bp atgaacatagatgcaaaaatctccaacaaaatactagcaaatcaaatccagcagcacatc aaaaagcaaatccactatgatcaagtaggctttatccctgagatgcaaggtgaaattgat gcactgcccagaataggcttgggggtaaaaaccaaggctgcttgccctctgtgcggaaag gaaaagcatttccagctttccttctcactcacaccaacatcagctcctaatcctgaaggc cctctgggtttcggtccagagagcagagtgatgctctctggtggtatgggcacctctgga tag