GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:31:50 Sequence gi568815595f:49948254_50177130 : 228877 bp : 47.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 20025 20495 471 2 0 72 91 334 0.601 27.63 1.02 Intr + 23806 23895 90 0 0 37 94 92 0.952 4.89 1.03 Intr + 27070 27139 70 0 1 41 109 65 0.795 2.65 1.04 Intr + 51187 51260 74 2 2 75 9 145 0.084 4.43 1.05 Intr + 98463 98580 118 2 1 57 95 77 0.803 5.34 1.06 Intr + 99284 99411 128 0 2 78 59 25 0.826 -1.00 1.07 Intr + 99992 100066 75 1 0 117 91 72 0.964 10.11 1.08 Intr + 106082 106142 61 1 1 102 113 9 0.779 3.11 1.09 Intr + 110149 110309 161 2 2 46 101 204 0.741 17.21 1.10 Intr + 111396 111493 98 2 2 57 116 103 0.998 8.81 1.11 Intr + 112887 112968 82 0 1 75 84 45 0.619 2.44 1.12 Intr + 113175 113294 120 2 0 11 109 72 0.555 2.29 1.13 Intr + 113709 113855 147 2 0 51 90 109 0.944 7.83 1.14 Intr + 116784 116873 90 2 0 31 63 124 0.843 4.39 1.15 Intr + 117989 118249 261 1 0 89 62 292 0.999 24.38 1.16 Intr + 120437 120511 75 1 0 65 82 73 0.932 4.11 1.17 Intr + 122202 122299 98 2 2 68 115 58 0.952 5.31 1.18 Intr + 126948 127077 130 0 1 62 83 107 0.999 8.30 1.19 Term + 128755 128880 126 0 0 97 48 145 0.999 9.68 1.20 PlyA + 128975 128980 6 1.05 2.00 Prom + 130433 130472 40 -3.76 2.01 Init + 142182 142198 17 2 2 83 115 10 0.986 2.88 2.02 Intr + 143790 143955 166 0 1 48 105 121 0.978 9.76 2.03 Intr + 145467 145622 156 2 0 79 67 215 0.749 18.71 2.04 Intr + 154830 154913 84 2 0 102 121 6 0.777 5.32 2.05 Intr + 155995 156055 61 0 1 52 91 10 0.794 -3.99 2.06 Intr + 156824 156889 66 0 0 95 92 58 0.978 5.78 2.07 Intr + 157296 157456 161 1 2 75 77 175 0.969 14.81 2.08 Intr + 158514 158611 98 2 2 95 87 13 0.978 0.71 2.09 Intr + 159229 159316 88 1 1 66 110 77 0.995 7.67 2.10 Intr + 159817 159894 78 0 0 43 96 41 0.524 0.15 2.11 Intr + 162126 162210 85 2 1 88 103 17 0.967 2.59 2.12 Intr + 162426 162517 92 1 2 76 38 155 0.917 9.01 2.13 Intr + 165130 165291 162 0 0 83 66 131 0.993 10.57 2.14 Intr + 165927 165998 72 2 0 98 78 60 0.964 5.60 2.15 Intr + 167175 167354 180 2 0 120 32 223 0.947 19.96 2.16 Intr + 168821 168945 125 1 2 65 62 244 0.959 18.88 2.17 Intr + 168997 169126 130 1 1 85 99 179 0.992 19.40 2.18 Term + 170078 170203 126 1 0 98 54 135 0.994 9.38 2.19 PlyA + 170980 170985 6 -0.45 3.07 PlyA - 171627 171622 6 1.05 3.06 Term - 171800 171770 31 1 1 44 48 42 0.181 -6.87 3.05 Intr - 172758 172659 100 1 1 60 93 110 0.408 7.87 3.04 Intr - 174335 174272 64 2 1 140 94 4 0.570 4.49 3.03 Intr - 175638 175612 27 0 0 118 90 26 0.687 4.11 3.02 Intr - 180450 180345 106 2 1 109 94 -8 0.432 2.12 3.01 Init - 182677 182607 71 1 2 98 80 57 0.752 5.94 3.00 Prom - 184709 184670 40 -6.96 4.00 Prom + 198588 198627 40 -4.86 4.01 Init + 202296 202362 67 2 1 91 92 46 0.084 6.53 4.02 Intr + 203346 203455 110 1 2 27 36 114 0.049 0.10 4.03 Intr + 207138 207311 174 2 0 26 105 167 0.205 12.34 4.04 Intr + 211439 211481 43 1 1 96 116 4 0.110 1.81 4.05 Intr + 222567 222767 201 1 0 91 47 49 0.287 0.36 4.06 Intr + 225540 225730 191 1 2 80 35 488 0.530 42.00 4.07 Intr + 225799 225861 63 0 0 105 92 75 0.389 8.51 4.08 Intr + 225978 226097 120 2 0 75 115 144 0.977 16.49 4.09 Intr + 226843 226935 93 0 0 135 80 15 0.933 5.56 4.10 Intr + 228515 228608 94 1 1 111 70 145 0.891 14.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 140819 140460 360 2 0 65 49 217 0.893 11.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:49948254_50177130|GENSCAN_predicted_peptide_1|824_aa MPPVDPNILDYIQPSTQDREHSGMNVNRREESTHDHTIERPAFGIQKGEFEHSETREGET QGVAFEHESPADFQNSQSPVQDQDKSQLSGREEQSSDAGLFKEEGGLDFLGRQDTDYRSM EYRDVDHRLPGSQMFGYGQSKSFPEGKTARDAQRDLQDQDYRTGPSEEKPSRLIRLSGVP EDATKEEILNAFRTPDGMPVKNLQLKEYNTGYDYGYVCVEFSLLEDAIGCMEANQARLPV PNTEQKLVNETGDPSVKTKGKRKMKQEIPSLQPRYFPKGLLANGCSSDLRASSLYRLQAR VIPPPPLLLPPANQFPAGTLMIQDKEVTLEYVSSLDFWYCKRCKANIGGHRSSCSFCKNP REAIMLKRIYRSTPPEVIVEVLEPYVRLTTANVRIIKNRTGPMGHTYGFIDLDSHAEALR VVKILQNLDPPFSIDGKMVAVNLATGKRRVKNISEIGGEVAEIQTGLQIQIDKDNSRLTL ALILLPLASSDCYIYDSATGYYYDPLAGTYYDPNTQQEVYVPQDPGLPEEEEIKEKKPTS QGKSSSKKEMSKRDGKEKKDRGVTRENASEGKAPAEDVFKKPLPPTVKKEESPPPPKVVN PLIGLLGEYGGDSDYEEEEEEEQTPPPQPRTAQPQKREEQTKKENEEDKLTDWNKLACLL CRRQFPNKEVLIKHQQLSDLHKQNLEIHRKIKQSEQELAYLERREREGKFKGRGNDRREK LQSFDSPERKRIKYSRETDSDRKLVDKEDIDTSSKGGCVQQATGWRKGTGLGYGHPGLAS SEEAEGRMRGPSVGASGRTSKRQSNETYRDAVRRVMFARYKELD >gi568815595f:49948254_50177130|GENSCAN_predicted_CDS_1|2475_bp atgccccctgtggatccaaatattttggattacattcagccctctacacaagatagagaa cattctggtatgaatgtgaacaggagagaagaatccacacatgaccatacgatagaaagg cctgcttttggcattcagaagggagaatttgagcattcagaaacaagagaaggagaaaca caaggtgtagcctttgaacatgagtctccagcagactttcagaacagccaaagtccagtt caagaccaagataagtcacagctttctggacgtgaagagcagagttcagatgctggtctg tttaaagaagaaggcggtctggactttcttgggcggcaagacaccgattacagaagcatg gagtaccgtgatgtggatcataggctgccaggaagccagatgtttggctatggccagagc aagtcttttccagagggcaaaactgcccgagatgcccaacgggaccttcaggatcaagat tataggaccggcccaagtgaggagaaacccagcaggcttattcgattaagtggggtacct gaagatgccacaaaagaagagattcttaatgcttttcggactcctgatggcatgcctgta aagaacttgcagttgaaggagtataacacaggttacgactatggctatgtctgcgtggag ttttcactcttggaagatgccatcggatgcatggaggccaaccaggccagattacctgtc cctaatactgagcagaagctggtgaatgaaacaggagatccctcagtcaaaacaaaagga aaaagaaaaatgaaacaggagatcccttctctacagcccagatactttcccaaaggctta cttgccaatggttgctcctcagatctcagggctagctcactctataggctccaagccaga gtgataccgccgccgccgctgttgctcccaccagccaatcagtttcctgctggaactcta atgatccaggacaaagaagttaccctggagtatgtatcaagcctggatttttggtactgc aaacgatgtaaggcaaacattggtgggcaccgatcttcctgttcattctgcaagaaccca agagaagctatcatgctaaagcgtatctatcgttccacaccacctgaggtgatagtggaa gtgctggagccctatgtccgccttactactgccaacgtccgtatcatcaagaacagaaca ggccctatggggcatacctatggctttattgacctcgactcccatgcggaagctcttcgt gtggtgaagatcttacagaaccttgatccgccatttagcattgatgggaagatggtagct gtaaacctggccactggaaaacgaagggtaaaaaatatttccgagataggaggggaggtg gcagaaattcagactggtcttcagatacaaatcgacaaggacaacagtagactaacattg gctctgatcttgttacctttagcatcatctgactgctacatatatgattctgctactggc tactattatgaccccttggcaggaacttattatgaccccaatacccagcaagaagtctat gtgccccaggatcctggattacctgaggaagaagagatcaaggaaaaaaaacccaccagt caaggaaagtcaagtagcaagaaggaaatgtctaaaagagatggcaaggagaaaaaagac agaggagtgacgagggaaaatgccagtgaagggaaggcccctgcagaagacgtctttaag aagcccctgcctcctactgtgaagaaggaagagagtccccctccacctaaagtggtaaac ccactgatcggcctcttgggtgaatatggaggagacagtgactatgaggaggaagaagag gaggaacagacccctcccccacagccccgcacagcacagccccagaagcgagaggagcaa accaagaaggagaatgaagaagacaaactcactgactggaataaactggcttgtctgctt tgcagaaggcagtttcccaataaagaagttctgatcaaacaccagcagctgtcagacctg cacaagcaaaacctggaaatccaccggaagataaaacagtctgagcaggagctagcctat ctggaaaggagagaacgagagggaaagtttaaaggaagaggaaatgatcgcagggaaaag ctccagtcttttgactctccagaaaggaaacggattaagtactccagggaaactgacagt gatcgtaaacttgttgataaagaagatatcgacactagcagcaaaggaggctgtgtccaa caggctactggctggaggaaagggacaggcctgggatatggccatcctggattggcttca tcagaggaggctgaaggccggatgaggggccccagtgttggagcctcaggaagaaccagc aaaagacagtccaacgagacttaccgagatgctgttcgaagagtcatgtttgctcgatat aaagaactcgattaa >gi568815595f:49948254_50177130|GENSCAN_predicted_peptide_2|648_aa MGSDKRVSRTERSGRYGSIIDRDDRDERESRSRRRDSDYKRSSDDRRGDRYDDYRDYDSP ERERERRNSDRSEDGYHSDGDYGEHDYRHDISDERESKTIMLRGLPITITESDKKLVIQG KHIAMHYSNPRPKFEDWLCNKCCLNNFRKRLKCFRCGADKFDSEQEVPPGTTESVQSVDY YCDTIILRNIAPHTVVDSIMTALSPYASLAVNNIRLIKDKQTQQNRGFAFVQLSSAMDAS QLLQILQSLHPPLKIDGKTIGVDFAKSARKDLVLSDGNRVSAFSVASTAIAAAQWSSTQS QSGEGGSVDYSYLQPGQDGYAQYAQTEEAQPSTSTSTQAPAASPTGVVPGTKYAVPDTST YQYDESSGYYYDPTTGLYYDPNSQYYYNSLTQQYLYWDGEKETYVPAAESSSHQQSGLPP AKEGKEKKEKPKSKTAQQGALAERQQLIPELVRNGDEENPLKRGLVAAYSGDSDNEEELV ERLESEEEKLADWKKMACLLCRRQFPNKDALVRHQQLSDLHKMKYRDRAAERREKYGIPE PPEPKRKKQFDAGTVYVMCTFSSSNYEQPTKDGIDHSNIGNKMLQAMGWREGSGLGRKCQ GITAPIEAQVRLKGAGLGAKGSAYGLSGADSYKDAVRKAMFARFTEME >gi568815595f:49948254_50177130|GENSCAN_predicted_CDS_2|1947_bp atgggttcagacaaaagagtgagtagaacagagcgtagtggaagatacggttccatcata gacagggatgaccgtgatgagcgtgaatcccgaagcaggcggagggactcagattacaaa agatctagtgatgatcggaggggtgatagatatgatgactaccgagactatgacagtcca gagagagagcgtgaaagaaggaacagtgaccgatccgaagatggctaccattcagatggt gactatggtgagcacgactataggcatgacatcagtgacgagagggagagcaagaccatc atgctgcgcggccttcccatcaccatcacagagagcgataaaaagttggtgattcaagga aagcacattgcaatgcattatagcaatcccagacctaagtttgaagattggctttgtaac aagtgctgccttaacaatttcaggaaaagactaaaatgcttccgatgtggagcagacaag tttgactctgaacaggaagtgcctcctggaaccacagagtcggttcagtctgtggattac tactgtgatacgatcattcttcggaacatagctccgcacactgtggtggattccatcatg acagcactgtctccttacgcgtctttagctgtcaataacatccgcctcataaaagacaaa cagacccagcagaacagaggcttcgcatttgtgcagctgtcctctgcaatggatgcttct cagctgcttcagatattacagagtctccatcctcctttgaaaattgatggcaaaactatt ggggttgattttgcaaaaagtgccagaaaagacttggtcctctcagatggtaaccgcgtc agcgctttctctgtagctagtacggctattgctgctgctcagtggtcatccacccagtct caaagtggtgaaggaggcagtgttgactacagttatctgcaaccaggtcaagatggctat gcccaatatgctcagactgaggaagcacagcctagcactagcacaagtacacaggcccca gccgcttcccctactggtgtagttcctggtaccaaatatgcagtacctgacacgtccact taccagtatgatgaatcttcaggatattactatgatccgacaacagggctctattatgac cccaactcgcaatactactataattccttgacccagcagtacctttactgggatggggaa aaagagacctacgtgccagctgcagagtctagctcccaccagcagtcgggcctgcctcct gcaaaagaggggaaagagaagaaggagaaacccaagagcaaaacagcccagcagggagcc ttagctgaaaggcagcagctcatcccagaattggtgcgaaatggagatgaggagaatccc ctcaaaaggggtctggttgctgcttacagtggtgacagtgacaatgaggaggagctggtg gagagacttgagagtgaggaagagaagctagctgactggaagaagatggcctgtctgctc tgccggcgccagttcccgaacaaagatgccctagtcaggcaccagcaactctcagacctt cacaagatgaaataccgagaccgagctgcagaaagacgggagaagtacggcattccagaa cctccagagcccaagcgcaagaagcagtttgatgccggcactgtgtatgtgatgtgcaca ttttccagttcgaattacgagcaacccaccaaagatggcattgaccacagtaacattggc aacaagatgctgcaggccatgggctggcgggaaggctctggcttgggacgaaagtgtcaa ggcattacggctcccattgaggctcaagttcggctaaagggagctggcctaggagccaaa ggcagcgcatatggtttgtcgggcgccgattcctacaaagatgctgtccggaaagccatg tttgcccggttcactgagatggagtga >gi568815595f:49948254_50177130|GENSCAN_predicted_peptide_3|132_aa MASAQEFRGDTGASGGLLPPGLPRTMPCGVHPLQPQRLQMVPHSSPGLYPNTQEHLEPQV WLQVLEGKGTGRHPKTQLTPGLDLNLVPAGTLSFMFIGKATLVLTTDAVVHSTHGRALSS FDRTSLAAEKMQ >gi568815595f:49948254_50177130|GENSCAN_predicted_CDS_3|399_bp atggcctcggcacaggaattccgaggggatactggagcctcgggcgggctgctgcctcct gggctgcccaggaccatgccatgtggggtgcatcccctgcagccccagagactacaaatg gtgccccatagtagcccgggcctatatccaaacacacaggagcacctggaacctcaggta tggctgcaggtgttagaagggaagggcactgggaggcatcccaagacccagctcactcca gggctggacctcaaccttgtacctgcaggaaccctcagcttcatgttcataggaaaggcc acgctggtgttgacaacagatgccgtggtccacagcacacatgggagagccttgagcagc tttgacaggacctccttagcagctgagaagatgcagtga >gi568815595f:49948254_50177130|GENSCAN_predicted_peptide_4|386_aa MRTGFLGDDACLDLEGDKKDIPGGGWANDFPVVTLTVGSVTPHKLQDLKTMKSTLKLSQP AEPVKPRPRRRSRLRRSGERSRAGPWPRGAASADRPNGSRSVPPPPPGRPGPAAAEEDHL PATPRVRLSFKVTKASRPVTPEPLTPYPVATGRGPAGPESCALSTGSESPSGPGPPGRQS LPPGLASPGPPPPSAMDLELKATGTAHFFNFLLNTTDYRILLKDEDHDRMYVGSKDYVLS LDLHDINREPLIVRAGPDVGRGIHWAASPQRIEECVLSGKDVNGECGNFVRLIQPWNRTH LYVCGTGAYNPMCTYVNRGRRAQATPWTQTQAVRGRGSRATDGALRPMPTAPRQDYIFYL EPERLESGKGKCPYDPKLDTASALIX >gi568815595f:49948254_50177130|GENSCAN_predicted_CDS_4|1158_bp atgaggactggcttcctaggagatgatgcctgtctggatctagaaggagacaagaaagac attccaggtgggggctgggccaatgacttccctgttgtaaccctcactgttggctcggtc acccctcacaagctccaggacctcaagactatgaagtccaccctgaagctcagccagccc gcggaaccggttaagccgcggccgcggcgccgatcccggctgaggcgcagcggcgagagg tcgcgggcagggccatggccccggggggccgctagcgcggaccggcccaacgggagccgc tccgtgccgccgccgccgcccgggcgcccaggccccgccgctgcggaagaggaccacctc ccggccacgccccgggtccggctctcattcaaagtgaccaaagcgtcacgtcctgtgaca ccagagccattgacgccatatcccgtggccactgggaggggtcccgcaggtcctgagtca tgtgccctgtccacggggagcgagtcaccttccggccctggccctcctggcaggcagtct ctaccaccaggactggcttcccctgggcctccccctccttcagccatggacctagagctg aaggccacaggcaccgcccacttcttcaacttcctgctcaacacaaccgactaccgaatc ttgctcaaggacgaggaccacgaccgcatgtacgtgggcagcaaggactacgtgctgtcc ctggacctgcacgacatcaaccgcgagcccctcattgtaagggctggccctgatgtggga cgtgggatacactgggcagcctccccacagcgcatcgaggaatgcgtgctctcaggcaag gatgtcaacggcgagtgtgggaacttcgtcaggctcatccagccctggaaccgaacacac ctgtatgtgtgcgggacaggtgcctacaaccccatgtgcacctatgtgaaccgcggacgc cgcgcccaggccacaccatggacccagactcaggcggtcagaggccgcggcagcagagcc acggatggtgccctccgcccgatgcccacagccccacgccaggattacatcttctacctg gagcctgagcgactcgagtcagggaagggcaagtgtccgtacgatcccaagctggacaca gcatcggccctcatcann