GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:55:27 Sequence gi568815597f:35788425_36019604 : 231180 bp : 41.58% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9292 9425 134 0 2 84 87 86 0.821 7.76 1.02 Intr + 19887 20011 125 0 2 66 93 140 0.071 11.61 1.03 Intr + 28458 28623 166 1 1 56 110 162 0.555 13.30 1.04 Intr + 34438 34558 121 1 1 20 110 80 0.033 2.98 1.05 Intr + 36889 37070 182 0 2 86 88 130 0.990 10.54 1.06 Intr + 37255 37391 137 1 2 106 58 63 0.992 4.39 1.07 Intr + 37502 37636 135 0 0 57 107 82 0.985 6.62 1.08 Intr + 43113 43150 38 1 2 48 107 -15 0.110 -6.54 1.09 Intr + 43388 43507 120 1 0 108 65 105 0.976 9.97 1.10 Intr + 43633 43761 129 0 0 96 102 72 0.988 9.37 1.11 Intr + 44013 44146 134 2 2 84 87 93 0.998 7.32 1.12 Intr + 45566 45750 185 2 2 70 56 119 0.809 5.41 1.13 Intr + 47410 47569 160 2 1 40 116 61 0.624 2.22 1.14 Intr + 52741 53056 316 1 1 18 78 400 0.876 27.14 1.15 Intr + 53192 53326 135 1 0 123 90 61 0.998 9.64 1.16 Intr + 61733 61834 102 1 0 146 91 59 0.983 11.45 1.17 Intr + 62430 62599 170 2 2 68 64 99 0.497 3.32 1.18 Intr + 84935 85020 86 2 2 67 89 71 0.003 3.64 1.19 Intr + 95779 95892 114 2 0 83 47 105 0.035 5.40 1.20 Intr + 100003 100186 184 2 1 109 131 177 0.957 22.02 1.21 Intr + 104133 104253 121 0 1 96 90 109 0.989 11.48 1.22 Intr + 104673 104854 182 2 2 47 57 214 0.633 11.94 1.23 Intr + 105250 105386 137 1 2 92 101 46 0.814 5.59 1.24 Intr + 105613 105813 201 2 0 92 51 268 0.889 21.84 1.25 Intr + 105891 105978 88 1 1 27 105 84 0.849 2.11 1.26 Intr + 106698 106845 148 0 1 100 78 29 0.784 2.42 1.27 Intr + 113050 113169 120 0 0 114 65 144 0.999 14.47 1.28 Intr + 113524 113646 123 0 0 116 88 200 0.996 22.66 1.29 Intr + 113780 113913 134 1 2 80 80 140 0.970 10.92 1.30 Intr + 118511 118695 185 2 2 89 47 206 0.957 15.11 1.31 Intr + 125418 125611 194 1 2 91 76 118 0.948 9.19 1.32 Intr + 126924 127118 195 2 0 56 91 238 0.976 19.59 1.33 Intr + 129169 129303 135 0 0 82 72 88 0.988 6.44 1.34 Intr + 129898 129999 102 0 0 60 70 80 0.874 2.85 1.35 Intr + 130631 130830 200 1 2 118 94 186 0.996 19.43 1.36 Intr + 131320 131526 207 1 0 38 87 137 0.847 5.87 1.37 Intr + 135271 135391 121 1 1 90 59 45 0.052 1.38 1.38 Intr + 139003 139195 193 0 1 51 71 93 0.023 2.04 1.39 Intr + 140703 140865 163 1 1 84 90 5 0.007 -1.59 1.40 Intr + 142683 143021 339 0 0 20 109 145 0.006 3.46 1.41 Term + 143181 143310 130 0 1 113 39 105 0.904 4.77 1.42 PlyA + 145725 145730 6 1.05 2.00 Prom + 153172 153211 40 -5.85 2.01 Init + 157324 157439 116 0 2 112 110 71 0.730 11.43 2.02 Intr + 178531 178651 121 1 1 94 78 129 0.962 12.08 2.03 Intr + 183600 183808 209 2 2 93 31 156 0.615 7.35 2.04 Intr + 194197 194288 92 1 2 72 110 21 0.295 1.42 2.05 Intr + 203805 203921 117 1 0 62 56 116 0.244 5.32 2.06 Intr + 215917 216051 135 2 0 77 63 121 0.969 8.12 2.07 Intr + 217295 217434 140 0 2 50 98 90 0.978 5.46 2.08 Intr + 220486 220620 135 0 0 86 19 99 0.774 2.64 2.09 Intr + 221051 221170 120 1 0 69 45 78 0.668 1.37 2.10 Intr + 225206 225328 123 1 0 90 70 179 0.865 16.16 2.11 Term + 225491 225628 138 1 0 83 47 70 0.824 -0.62 2.12 PlyA + 227439 227444 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 95311 94948 364 0 1 94 47 246 0.842 15.36 S.002 Intr - 96480 96410 71 0 2 58 90 109 0.932 5.16 S.003 Init + 99586 99661 76 0 1 77 -22 26 0.883 -8.19 S.004 Init + 142521 142716 196 2 1 40 44 214 0.854 9.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:35788425_36019604|GENSCAN_predicted_peptide_1|2096_aa MAYKIEIGELEYKSEFECRLISFHNSFPLLHLKEEKEKSIGVVKNANIPEIKRYAAAAAA AAGPGAGGAGDRGEAAPAAAMEALGPGPPASLFQPPRRPGLGTVGKPIRLLANHFQVQIP KIDVYHYDVDIKPEKRPRRVNREVVDTMVRHFKMQIFGDRQPGYDGKRNMYTAHPLPIGR DRVDMEVTLPGEGKDQTFKVSVQWVSVVSLQLLLEALAGHLNEVPDDSVQALDVITRHLP SMRYTPVGRSFFSPPEGYYHPLGGGREVWFGFHQSVRPAMWNMMLNIDVSATAFYRAQPI IEFMCEVLDIQNINEQTKPLTDSQRVKFTKEIRVGQEQKHTYLPLEVCNIVAGQRCIKKL TDNQTSTMIKATARSAPDRQEEISRLVKSNSMVGGPDPYLKEFGIVVHNEMTELTGRVLP APMLQYGGRNKTVATPNQGVWDMRGKQFYAGIEIKVWAVACFAPQKQCREDLLKSFTDQL RKISKDAGMPIQGQPCFCKYAQGADSVEPMFKHLKMTYVGLQLIVVILPGKTPVYAEVKR VGDTLLGMATQCVQVKNVVKTSPQTLSNLCLKINAKLGGINNVLVPHQRPSVFQQPVIFL GADVTHPPAGDGKKPSIAAVVGSMDGHPSRYCATVRVQTSRQEISQELLYSQEVIQDLTN MVRELLIQFYKSTRFKPTRIIYYRGGVSEGQMKQVAWPELIAIRKACISLEEDYRPGITY IVVQKRHHTRLFCADKTERVGKSGNVPAGTTVDSTITHPSEFDFYLCSHAGIQGTSRPSH YQVLWDDNCFTADELQLLTYQLCHTYVRCTRSVSIPAPAYYARLVAFRARPLVALDVLEA LLNGLYWSRVDLTHLDDGGALTLGPQSQFAKNGLREAAKVHFSATGKTGTEAMGSGAAGA YLPPLQQVFQAPRRPGIGTVGKPIKLLANYFEVDIPKIDVYHYEVDIKPDKCPRRVNREV VEYMVQHFKPQIFGDRKPVYDGKKNIYTVTALPIGNERVDFEVTIPGEGKDRIFKVSIKW LAIVSWRMLHEALVSGQIPVPLESVQALDVAMRHLASMRYTPVGRSFFSPPEGYYHPLGG GREVWFGFHQSVRPAMWKMMLNIDVSATAFYKAQPVIEFMCEVLDIRNIDEQPKPLTDSQ RVRFTKEIKGEDPTGGEGKQRHFSPKRKSPWGLKVEVTHCGQMKRKYRVCNVTRRPASHQ TFPLQLESGQTVECTVAQYFKQKYNLQLKYPHLPCLQVGQEQKHTYLPLEVCNIVAGQRC IKKLTDNQTSTMIKATARSAPDRQEEISRLMKNASYNLDPYIQEFGIKVKDDMTEVTGRV LPAPILQYGGRNRAIATPNQGVWDMRGKQFYNGIEIKVWAIACFAPQKQCREEVLKNFTD QLRKISKDAGMPIQGQPCFCKYAQGADSVEPMFRHLKNTYSGLQLIIVILPGKTPVYAEV KRVGDTLLGMATQCVQVKNVVKTSPQTLSNLCLKINVKLGGINNILVPHQRYELCCPLAL VKVVGSMDAHPSRYCATVRVQRPRQEIIEDLSYMVRELLIQFYKSTRFKPTRIIFYRDGV PEGQLPQILHYELLAIRDACIKLEKDYQPGITYIVVQKRHHTRLFCADKNERIGKSGNIP AGTTVDTNITHPFEFDFYLCSHAGIQGTSRPSHYYVLWDDNRFTADELQILTYQLCHTYV RCTRSVSIPAPAYYARLVAFRARYHLVDKEHDRGGVRVGNRASKTDHQPEISDINLMSPT PHPILSHLALTPLDQKGQHWCPPYTQVSHVTHSAKDSCLTAWERGWVGALPYSTHTDLEP LAAVWASKKGRGRNELKTKQNEFASWSQYGYCNSGKEMLTEPCLAKERQIFLCPSKARKS SPGSSNRYGLMALLEMGHLTPVIGKAKQTQLGMSYPNIGCTCSQERASYSFAFGRILQTS LSSYFLILQIMSLSHALTLKRSAAAARSRGGAVGASAAAGDATSGRDSPLSAPHISPSSR LVLCRFPSATLPAQSFRSAVAQGKPSPPPPRGRVRVPVASRRVAPRAASLPPVAGSVLPR STPPSSMNGNRLRSPAPVPRGPLSRRNCEAGKRLEDLSLGLSRRHYSLLEYPSSRL >gi568815597f:35788425_36019604|GENSCAN_predicted_CDS_1|6291_bp atggcctataaaattgaaattggagaattagaatataaatcggaatttgaatgcaggctt atctcatttcataattcattccctttacttcatcttaaggaagagaaagaaaaatcaatt ggtgtggttaaaaacgccaatattccggagatcaagcgttacgcggcggcggcggcggcg gcggcggggcccggagcgggaggcgccggggaccggggcgaggcggcccccgccgccgcc atggaggcgctgggacccggacctccggctagcctgtttcagccacctcgtcgtcctggc cttggaactgttggaaaaccaattcgactgttagccaatcattttcaggttcagattcct aaaatagatgtgtatcactatgatgtggatattaagcctgaaaaacggcctcgtagagtc aacagggaggtagtagatacaatggtgcggcacttcaagatgcaaatatttggtgatcgg cagcctgggtatgatggcaaaagaaacatgtacacagcacatccactaccaattggacgg gatagggttgatatggaggtgactcttccaggcgagggtaaagaccaaacatttaaagtg tctgttcagtgggtgtcagttgtgagccttcagttgcttttagaagctttggctgggcac ttgaatgaagtcccagatgactcagtacaagcacttgatgttatcacaagacaccttccc tccatgaggtacaccccagtgggccgttcctttttctcacccccggaaggttactaccac cctctgggagggggcagggaggtctggtttggttttcatcagtctgtgagacctgccatg tggaatatgatgctcaacattgatgtatctgcaactgctttctaccgggctcagcctatc attgagttcatgtgtgaggttttagacattcagaacatcaatgaacagaccaaacctcta acagactcccagcgtgtcaaatttaccaaagaaatcagagtgggacaagaacaaaagcat acatacttgccactcgaggtctgtaatatagtggcaggacagcgatgtatcaagaagctc acagacaatcagacttccacaatgatcaaagctacagcaagatctgctcctgacagacag gaagagatcagtagactggtgaagagcaacagtatggtgggtggacctgatccatacctt aaagaatttggtattgttgtccacaatgaaatgacagagctcacaggcagggtacttcca gcaccaatgctgcaatatggaggccggaataaaacagtagccacacccaaccagggtgtc tgggacatgcgaggaaagcagttttatgctggcattgaaattaaagtttgggcagttgct tgttttgcacctcagaaacaatgtagggaagatttactaaagagtttcactgaccagctg cgtaaaatctctaaggatgcaggaatgcccatccagggtcagccatgtttctgcaagtat gcacaaggtgcagacagtgtggagcctatgtttaaacatctgaaaatgacttatgtgggc ctacagctaatagtggttatcctgcctggaaagacaccagtatatgcggaggtgaaacgt gttggagatacccttctaggtatggccacacagtgtgtccaggtaaaaaatgtagtgaag acctcacctcaaaccctttccaatctttgcctgaagataaatgcaaaacttggaggaatt aacaatgtgcttgtgcctcatcaaaggccctcggtgttccagcagcctgtcatcttcctg ggagcggatgtcacacaccccccagcaggggatgggaagaaaccttccattgctgctgtg gttggcagtatggatggccaccccagccggtactgtgccaccgttcgggtgcagacttcc cggcaggagatctcccaagagctcctctacagtcaagaggtcatccaggacctgactaac atggttcgagagctgctgattcagttctacaaatccacacgcttcaaacccactcggatc atctattaccgtggaggggtatctgagggacaaatgaaacaggtagcttggccagaacta atagcaattcgaaaggcatgtattagcttggaagaagattaccggccaggaataacttat attgtggtgcaaaaaagacatcacacacgactcttctgtgcagataaaacagaaagggta gggaaaagtggcaatgtaccagcaggcactacagtggatagtaccatcacacatccatct gagtttgacttttacctctgtagtcatgcaggaattcagggaaccagccgtccctcacat taccaggtcttgtgggatgacaactgcttcactgcagatgaactccagctactgacttac cagctgtgtcacacctatgtgaggtgcactcgctcagtctctattccagcccctgcatat tatgcccggcttgtagcatttagggcaagaccactggtagcactggatgtcctggaagct cttcttaatgggctgtattggtcaagagtagatctcactcatctcgatgatggtggggct cttacacttggccctcagtcacagtttgctaagaatgggttgagggaagctgccaaagtg catttttctgccacaggaaagactgggaccgaagcgatgggttctggggctgcgggcgct tacctgccccccctgcagcaggtgttccaggcacctcgccggcctggcattggcactgtg gggaaaccaatcaagctcctggccaattactttgaggtggacatccctaagatcgacgtg taccactacgaggtggacatcaagccggataagtgtccccgtagagtcaaccgggaagtg gtggaatacatggtccagcatttcaagcctcagatctttggtgatcgcaagcctgtgtat gatggaaagaagaacatttacactgtcacagcactgcccattggcaacgaacgggtcgac tttgaggtgacaatccctggggaagggaaggatcgaatctttaaggtctccatcaagtgg ctagccattgtgagctggcgaatgctgcatgaggccctggtcagcggccagatccctgtt cccttggagtctgtgcaagccctggatgtggccatgaggcacctggcatccatgaggtac acccctgtgggccgctccttcttctcaccgcctgagggctactaccacccgctggggggt gggcgcgaggtctggttcggctttcaccagtctgtgcgccctgccatgtggaagatgatg ctcaacattgatgtctcagccactgccttttataaggcacagccagtgattgagttcatg tgtgaggtgctggacatcaggaacatagatgagcagcccaagcccctcacggactctcag cgcgttcgcttcaccaaggagatcaagggtgaggacccaacaggaggggaagggaaacag cgccactttagccctaagaggaaatccccttggggcctgaaggtggaagtcacccactgt ggacagatgaagaggaagtaccgcgtgtgtaatgttacccgtcgccctgctagccatcag acattccccttacagctggagagtggacagactgtggagtgcacagtggcacagtatttc aagcagaaatataaccttcagctcaagtatccccatctgccctgcctacaagttggccag gaacaaaagcatacctaccttcccctagaggtctgtaacattgtggctgggcagcgctgt attaaaaagctgaccgacaaccagacctcgaccatgataaaggccacagctagatccgct ccagacagacaggaggagatcagtcgcctgatgaagaatgccagctacaacttagatccc tacatccaggaatttgggatcaaagtgaaggatgacatgacggaggtgacagggcgagtg ctgccggcgcccatcttgcagtacggcggccggaaccgggccattgccacacccaatcag ggtgtctgggacatgcgggggaaacagttctacaatgggattgagatcaaagtctgggcc atcgcctgcttcgcaccccaaaaacagtgtcgagaagaggtgctcaagaacttcacagac cagctgcggaagatttccaaggatgcggggatgcctatccagggtcaaccttgtttctgc aaatatgcacagggggcagacagcgtggagcctatgttccggcatctcaagaacacctac tcagggctgcagctcattattgtcatcctgccagggaagacgccggtgtatgctgaggtg aaacgtgtcggagatacactcttgggaatggctacgcagtgtgtgcaggtgaagaacgtg gtcaagacctcacctcagactctgtccaacctctgcctcaagatcaatgtcaaacttggt ggcattaacaacatcctagtcccacaccagcggtatgaactctgttgtccacttgccctt gtcaaggtggtaggcagtatggatgcccaccccagccgatactgtgctactgtgcgggta cagcgaccacggcaagagatcattgaagacttgtcctacatggtgcgtgagctcctcatc caattctacaagtccacccgtttcaagcctacccgcatcatcttctaccgagatggggtg cctgaaggccagctaccccagatactccactatgagctactggccattcgtgatgcctgc atcaaactggaaaaggactaccagcctgggatcacttatattgtggtgcagaaacgccat cacacccgccttttctgtgctgacaagaatgagcgaattgggaagagtggtaacatccca gctgggaccacagtggacaccaacatcacccacccatttgagtttgacttctatctgtgc agccacgcaggcatccagggcaccagccgaccatcccattactatgttctttgggatgac aaccgtttcacagcagatgagctccagatcctgacgtaccagctgtgccacacttacgta cgatgcacacgctctgtctctatcccagcacctgcctactatgcccgcctggtggctttc cgggcacgataccacctggtggacaaggagcatgacagaggtggtgtaagagtggggaac agggccagcaagacagaccaccagccagaaatctctgatatcaacctcatgtcccccacc cctcaccccatcttgtcacatctggccctgaccccactggaccaaaaggggcagcactgg tgcccaccatacacacaggtgtctcatgtgactcacagtgctaaagactcatgcttgaca gcttgggagagaggctgggttggtgctctcccttactctactcatactgacttagagcct ctggctgctgtttgggcatccaagaaagggaggggaaggaatgagctaaaaacaaaacag aatgagttcgcctcatggtcccaatatggctactgtaattctgggaaggaaatgctgaca gaaccgtgtctggcaaaagagagacagatttttctgtgtccttctaaagcaaggaagtct tcccctggaagctccaacagatacggcctcatggctttgctagaaatgggtcatctcact cctgtcattggcaaagcaaagcagactcagttgggaatgagttatccaaatatagggtgt acatgtagtcaggagagagcctcatacagctttgcctttggcagaatccttcaaacctct ttgtcttcctacttcttgatattacaaatcatgagcctttcacatgcattgactctaaag aggtcggcggcggcggcccgcagtcgtggaggagcggtgggagcgtcggcggccgcgggc gatgcaacttccggacgggactcccctctgtccgcgcctcacatctccccttcctctcgc ctagtcctgtgccgttttccgtccgcgactcttccggcccagagctttcggagtgcggtt gctcaggggaagccgtcgccgcccccgcctcggggccgagtgagagtgcccgtcgcgtcg cgccgcgtcgccccccgggccgcctccttgccgccagtggcgggctccgttctccctcga agcactccccccagctccatgaatggaaatcggctccgcagcccagccccagttccccgg gggcccctgagtcggcgaaactgcgaggcggggaaacgcttggaggatttaagtttgggg ttatctaggcggcattactctttgctggagtacccttcttctagactttag >gi568815597f:35788425_36019604|GENSCAN_predicted_peptide_2|481_aa MGKPIKLLANCFQVEIPKIDVYLYEVDIKPDKCPRRVNREVVDSMVQHFKVTIFGDRRPV YDGKRSLYTANPLPVATTGVDLDVTLPGEGGKDRPFKVSIKFVSRVSWHLLHEVLTGRTL PEPLELDKPISTNPVHAVDVVLRHLPSMKKRPLAAVWRIDRGEKTNLRSQLGGFSITSSV LCSQLCLASFFQGASVVSLPVSMPADLCKDEQEAAVHQFSATAFYKAQPVIQFMCEVLDI HNIDEQPRPLTDSHRVKFTKEIKENTRCTEQGFNCCPKSMDSILTSTAQGPKEMTEKMTR KHNKVDVIVKLENGQTVERTVAQYFREKYTLQLKYPHLPCLQVGQEQKHTYLPLEVCNIV AGQRCIKKLTDNQTSTMIKATARSAPDRQEEISRLVRSANYETDPFVQEFQFKVRDEMAH VTGRVLPAPMLQYGGRNRTVATPSHGVWDMRGKQFHTGVEIKMWAIACFATQRQCREEIL K >gi568815597f:35788425_36019604|GENSCAN_predicted_CDS_2|1446_bp atgggcaaacccattaaactgctggctaactgttttcaagttgaaatcccaaagattgat gtctacctctatgaggtagatattaaaccagacaagtgtcctaggagagtgaacagggag gtggttgactcaatggttcagcattttaaagtaactatatttggagaccgtagaccagtt tatgatggaaaaagaagtctttacaccgccaatccacttcctgtggcaactacaggggta gatttagacgttactttacctggggaaggtggaaaagatcgacctttcaaggtgtcaatc aaatttgtctctcgggtgagttggcacctactgcatgaagtactgacaggacggaccttg cctgagccactggaattagacaagccaatcagcactaaccctgtccatgccgttgatgtg gtgctacgacatctgccctccatgaaaaagagacctctggcagcagtgtggagaatagat agaggagaaaaaactaatctgagaagccagttaggaggcttttcaatcactagttcagtc ttgtgctctcaactgtgtctggcatccttcttccagggagcctctgttgtatctcttcca gttagtatgcctgcagatctgtgcaaggatgaacaagaggcagctgttcatcaattttct gccactgccttctacaaagcacaacctgtaattcagttcatgtgtgaagttcttgatatt cataatattgatgagcaaccaagacctctgactgattctcatcgggtaaaattcaccaaa gagataaaagaaaatacaagatgtacagaacaaggattcaactgctgcccgaagagcatg gactcgatcttaacttcaactgctcaggggcccaaagaaatgactgaaaaaatgactaga aagcataataaagttgatgttatagtgaagttagaaaacggccaaactgtggagagaaca gtagcgcagtatttcagagaaaagtatactcttcagctgaagtacccgcaccttccctgt ctgcaagtcgggcaggaacagaaacacacctacctgccactagaagtctgtaatattgtg gcagggcaacgatgtatcaagaagctaacagacaatcagacttccactatgatcaaggca acagcaagatctgcaccagatagacaagaggaaattagcagattggtaagaagtgcaaat tatgaaacagatccatttgttcaggagtttcaatttaaagttcgggatgaaatggctcat gtaactggacgcgtacttccagcacctatgctccagtatggaggacggaatcggacagta gcaacaccgagccatggagtatgggacatgcgagggaaacaattccacacaggagttgaa atcaaaatgtgggctatcgcttgttttgccacacagaggcagtgcagagaagaaatattg aagtaa