GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:45:08 Sequence gi568815592f:47682067_47911731 : 229665 bp : 38.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 135 560 426 2 0 75 111 482 0.278 45.55 1.02 Intr + 4926 4967 42 2 0 117 79 40 0.164 3.52 1.03 Intr + 9768 9844 77 2 2 23 95 39 0.029 -4.51 1.04 Intr + 11606 11682 77 2 2 91 105 46 0.071 4.84 1.05 Intr + 25238 25272 35 0 2 109 106 20 0.651 2.92 1.06 Intr + 26158 26212 55 0 1 49 95 45 0.764 -1.07 1.07 Intr + 28669 28820 152 2 2 137 59 74 0.991 8.36 1.08 Intr + 30291 30542 252 2 0 70 61 237 0.981 15.91 1.09 Intr + 31732 33111 1380 0 0 101 111 888 0.975 80.59 1.10 Intr + 35226 35285 60 2 0 80 99 40 0.703 2.31 1.11 Intr + 54831 54965 135 2 0 51 53 127 0.016 5.34 1.12 Intr + 75358 75455 98 0 2 63 83 121 0.281 6.99 1.13 Intr + 86677 86735 59 1 2 63 92 42 0.050 -0.29 1.14 Intr + 99978 100130 153 1 0 86 99 72 0.074 7.22 1.15 Intr + 104449 104568 120 2 0 74 92 46 0.866 3.15 1.16 Intr + 105941 106117 177 0 0 15 31 143 0.002 0.27 1.17 Intr + 109742 109906 165 0 0 28 115 126 0.002 8.31 1.18 Intr + 113163 113497 335 1 2 99 115 144 0.000 12.57 1.19 Intr + 126088 126329 242 0 2 99 98 170 0.566 14.53 1.20 Intr + 129608 129665 58 2 1 72 64 49 0.098 -1.13 1.21 Intr + 135797 135927 131 1 2 86 92 56 0.074 4.37 1.22 Intr + 141709 141827 119 1 2 13 68 72 0.023 -3.11 1.23 Intr + 142211 142360 150 0 0 23 105 105 0.682 4.91 1.24 Intr + 149923 149991 69 2 0 41 83 70 0.123 0.04 1.25 Intr + 151745 151802 58 0 1 72 78 61 0.121 0.52 1.26 Term + 158527 158635 109 1 1 42 35 137 0.442 0.80 1.27 PlyA + 158758 158763 6 1.05 2.00 Prom + 162019 162058 40 -2.45 2.01 Init + 170143 170242 100 0 1 100 80 52 0.732 6.07 2.02 Term + 170890 170981 92 2 2 72 41 122 0.912 2.80 2.03 PlyA + 171317 171322 6 1.05 3.05 PlyA - 171899 171894 6 1.05 3.04 Term - 197870 196237 1634 0 2 54 55 667 0.333 48.94 3.03 Intr - 210811 210602 210 2 0 83 69 50 0.014 0.56 3.02 Intr - 221907 221832 76 0 1 65 81 34 0.211 -1.43 3.01 Intr - 225764 225672 93 2 0 59 80 94 0.372 4.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100130 130 1 1 74 99 83 0.861 8.36 S.002 Term + 105941 106293 353 0 2 15 42 243 0.910 6.06 S.003 Init + 139273 139537 265 0 1 64 47 161 0.817 6.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:47682067_47911731|GENSCAN_predicted_peptide_1|1577_aa MIVFHTLPKSVLVASLFSVGYGCPLAIAAITVAATEPGKGYLRPEICWLNWDMTKALLAF VIPALAIVVVNLITVTLVIVKTQRAAIGNSMFQEVRAIVRISKNIAILTPLLGLTWGFGV ATVIDDRSLAFHIIFSLLNAFQGFFILVFGTILDPKVEGHSSKDSLKCSGIENICLQQAS LIMLLLNYLVIPSKAQVNKPGYIPNLGECSHYRSKIHLKAGDKLQSPEGKPKTGRIQEKC EGPCISSSNCSQPCAKDFHGEIGFTCNQKKWQKSAETCTSLSVEKLFKDSTGASRLSVAA PSIPLHILDFRAPETIESVAQGIRKNCPFDYACITDMVKSSETTSGNIAFIVELLKNIST DLSDNVTREKMKSYSEVANHILDTAAISNWAFIPNKNASSDLLQSVNLFARQLHIHNNSE NIVNELFIQTKGFHINHNTSEKSLNFSMSMNNTTEDILGMVQIPRQELRKLWPNASQAIS IAFPTLGAILREAHLQNVSLPRQVNGLVLSVVLPERLQEIILTFEKINKTRNARAQCVGW HSKKRRWDEKACQMMLDIRNEVKCRCNYTSVVMSFSILMSSKSMTDKVLDYITCIGLSVS ILSLVLCLIIEATVWSRVVVTEISYMRHVCIVNIAVSLLTANVWFIIGSHFNIKAQDYNM CVAVTFFSHFFYLSLFFWMLFKALLIIYGILVIFRRMMKSRMMVIGFAIGYGCPLIIAVT TVAITEPEKGYMRPEACWLNWDNTKALLAFAIPAFVIVAVNLIVVLVVAVNTQRPSIGSS KSQDVVIIMRISKNVAILTPLLGLTWGFGIATLIEGTSLTFHIIFALLNAFQIRDALRMR MSSLKGKSRAAEFNSRYNCLSDVIKLIHILPYVCVFEVVFKEILLYPQPVKMFPQIETER EQEEELEMVNSIGADECGCRHWKKTPKDLRDTWADDCKSTLLPRAFAYKHPSWFENRMAL NHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKK KLRPAEIMTINLAVCDLGISAHSRCVLTQSLDGLGYLGRLYSVWTCINIMNAYGECTRTL RRARHAWFCENKQDEPQGICKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCGSLITMTA VSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPFGTSCTLD WWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSRIHSSHVL EMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAAMYNPIIY QVIDYKFACCQTGGLKATKKKSLEGFRLHTVTTVRKSSAVLEIHEEDKGRLWHQLSCGEN SEMTASSETLGTKAHSPLQETREGENEAVGTSRKGTGQDAPAKRTLWVRGEAISVTMSIA ILVLLVCVCVRTSLSSELGLVKAGLKSLIQGDQAVSLTGAATLPICNLICSIADDVLGRE GRNKSASPNNLDADSHSNDDDCVSKTATKGGSDNGDLNGEILMQQQGAILKAESPHQTTE PAGALILDFSTSRTVEK >gi568815592f:47682067_47911731|GENSCAN_predicted_CDS_1|4734_bp atgattgttttccataccttgcccaagtcagtcctggtggcatctctgttttcagtgggc tatggatgccctttggccattgctgccatcactgttgctgccactgaacctggcaaaggc tatctacgacctgagatctgctggctcaactgggacatgaccaaagccctcctggccttc gtgatcccagctttggccatcgtggtagtaaacctgatcacagtcacactggtgattgtc aagacccagcgagctgccattggcaattccatgttccaggaagtgagagccattgtgaga atcagcaagaacatcgccatcctcacaccacttctgggactgacctggggatttggagta gccactgtcatcgatgacagatccctggccttccacattatcttctccctgctcaatgca ttccagggtttcttcatcctagtgtttggaaccatcctggatccaaaggttgagggtcat tcatctaaagactccctgaagtgttctggcattgaaaatatttgtctccaacaagcatcc ctgataatgcttcttctgaattatcttgtcatcccatcaaaggcccaagttaacaaacca ggctacatccctaacctaggagaatgttcccactatagatccaagattcacctaaaagct ggagataaacttcaaagccctgaagggaaacccaagactggaaggatccaagagaaatgc gaaggaccttgtatttcttcttccaactgcagccagccctgtgctaaggactttcatgga gaaataggatttacatgtaatcaaaaaaagtggcaaaaatcagctgaaacatgtacaagc ctttctgtggaaaaactctttaaggactcaactggtgcatctcgcctttctgtagcagca ccatctatacctctgcatattctagactttcgagctccagagaccattgagagtgtagct caaggaatccgtaagaactgcccctttgattatgcctgcatcactgacatggtgaaatca tcagaaacaacatctggaaatattgcatttatagtggagttattaaaaaatatttctaca gacttgtctgataatgttactcgagagaaaatgaagagctatagtgaagtggccaaccac atcctcgacacagcagccatttcaaactgggctttcattcccaacaaaaatgccagctcg gatttgttgcagtcagtgaatttgtttgccagacaactccacatccacaataattctgag aacattgtgaatgaactcttcattcagacaaaagggtttcacatcaaccataatacctca gagaaaagcctcaatttctccatgagcatgaacaataccacagaagatatcttaggaatg gtacagattcccaggcaagagctaaggaagctgtggccaaatgcatcccaagccattagc atagctttcccaaccttgggggctatcctgagagaagcccacttgcaaaatgtgagtctt cccagacaggtaaatggtctggtgctatcagtggttttaccagaaaggttgcaagaaatc atactcaccttcgaaaagatcaataaaacccgcaatgccagagcccagtgtgttggctgg cactccaagaaaaggagatgggatgagaaagcgtgccaaatgatgttggatatcaggaac gaagtgaaatgccgctgtaactacaccagtgtggtgatgtctttttccattctcatgtcc tccaaatcgatgaccgacaaagttctggactacatcacctgcattgggctcagcgtctca atcctaagcttggttctttgcctgatcattgaagccacagtgtggtcccgggtggttgtg acggagatatcatacatgcgtcacgtgtgcatcgtgaatatagcagtgtcccttctgact gccaatgtgtggtttatcataggctctcactttaacattaaggcccaggactacaacatg tgtgttgcagtgacatttttcagccactttttctacctctctctgtttttctggatgctc ttcaaagcattgctcatcatttatggaatattggtcattttccgtaggatgatgaagtcc cgaatgatggtcattggctttgccattggctatgggtgcccattgatcattgctgtcact acagttgctatcacagagccagagaaaggctacatgagacctgaggcctgttggcttaac tgggacaataccaaagcccttttagcatttgccatcccggcgttcgtcattgtggctgta aatctgattgtggttttggttgttgctgtcaacactcagaggccctctattggcagttcc aagtctcaggatgtggtcataattatgaggatcagcaaaaatgttgccatcctcactcca ctgctgggactgacctggggttttggaatagccactctcatagaaggcacttccttgacg ttccatataatttttgccttgctcaatgctttccagataagagatgctttgaggatgagg atgtcttcactgaaggggaaatcgagggcagctgagtttaattctcgatacaactgccta agtgatgtcatcaaattgatccacattttgccttatgtttgtgtttttgaagttgtgttt aaagaaattcttctgtacccgcagcctgtaaagatgtttccccagattgagactgaaaga gagcaggaagaggagttggaaatggtcaacagtattggagcagatgagtgtgggtgcaga cactggaaaaagacaccaaaggacctgagggacacctgggcagatgattgtaagtcaacg ctacttccccgggccttcgcatataaacatccctcgtggttcgagaacagaatggcgtta aatcacactgccctgcctcaggacgagcgcctgccccattaccttcgagatggggatcct tttgcttccaaactttcttgggaagcggatttagtggctggcttttacctaacaataatt gggattctgtccacatttggaaatggatatgtcctttacatgtcttctagacgaaagaag aagctgagacccgctgaaataatgactatcaatttagcagtctgtgatctggggatttca gcacacagcagatgtgtgcttactcagagtttggatggtcttggttatctgggaaggctt tattcagtatggacctgcataaacattatgaacgcttatggtgaatgcacacgcacatta agaagagcaagacatgcctggttttgtgaaaataaacaggacgaaccccaggggatttgc aagccgttcaccatcatctcttgcttttgtcaccgctgggtgtttggctggatcggctgc cgctggtatggatgggctggatttttctttggctgtggaagccttatcaccatgactgct gtcagcctggatcgatatttgaaaatctgctatttatcttatggggtttggctgaaaaga aagcacgcctacatctgcctggcagccatctgggcctatgcttccttctggaccaccatg cccttggtaggtctgggggactacgtacctgagcccttcggaacctcgtgcaccctggac tggtggctggcccaggcctcggtagggggccaggttttcatcctgaacatcctcttcttc tgcctcttgctcccaacggctgtgatcgtgttctcctacgtaaagatcattgccaaggtt aagtcctcttccaaagaagtagctcatttcgacagtcggatccatagcagccatgtgctg gaaatgaaactgacaaaggtagcgatgttgatttgtgctggattcctgattgcctggatt ccttatgcagtggtgtctgtgtggtcagcttttggaaggccagactccattcccatacag ctctctgtggtgccaaccctacttgcaaaatctgcagcgatgtacaatcccatcatttac caagttattgattacaaatttgcctgttgccaaactggtggtttgaaagcaaccaagaag aagtctctggaaggcttcaggctgcacaccgtaaccacagtcaggaagtcttctgctgtg ctggaaattcatgaagaggacaaagggagattatggcatcagctatcttgtggggagaat tctgagatgacagcctccagtgaaacactagggacaaaggcccattcacctcttcaagaa accagggaaggggaaaatgaagctgttggaacatccaggaagggaacagggcaggatgca cctgcaaagaggactctgtgggttcggggagaagccatctcagtgacaatgtccatcgca atcctggttcttttggtgtgtgtgtgtgtaaggacttcactttcttctgaactggggctt gtgaaagctggcctgaaaagcctgattcaaggagatcaggctgtgtctctgactggagca gctacactccccatctgtaatttgatttgttctattgcagatgatgtactgggcagagag ggcaggaacaaatctgctagtcccaataatctggatgcagacagtcatagtaatgatgat gactgtgtctccaagactgctacaaaagggggaagtgacaatggagacttaaacggtgag attttgatgcaacaacaaggtgccatcttgaaagcagagagccctcaccagacaactgaa cctgctggtgccttgatcttggacttctcaacctctagaacagtggaaaaataa >gi568815592f:47682067_47911731|GENSCAN_predicted_peptide_2|63_aa MGKEKLFTGEMENDFWYLGQNTLEHTFYVLDAAAISLQLKFCDQFRASCFQSVTDEQEHV QTG >gi568815592f:47682067_47911731|GENSCAN_predicted_CDS_2|192_bp atgggcaaggagaagctcttcacaggtgagatggaaaatgacttctggtatctaggacaa aacactttagaacataccttttatgttttagatgcagcagcaatcagcctgcagctgaag ttctgcgaccagttcagggcaagctgcttccagagtgtcactgatgagcaagagcatgtt caaacaggataa >gi568815592f:47682067_47911731|GENSCAN_predicted_peptide_3|670_aa GILKAVCYLATGGIGSPGSLLNLCQQHRGTEEYASPEDITASFLPSFRPSSKSDLHGHSV QKKIQLWNVFISSSLGRKATKHSGMPLKTCPLKHYAFIKHLCYSFEDFSLESYLVEIKAV HNLSLQSHGTKGVFELLSGWRRTKENLPFKDRIADAYSDVMVTYTMTSSLYFITFGMGAS PFTNIEAVKVFCQNMCVSILLNYFYIFSFFGSCLVFAGQLEQNRYHSIFCCKIPSAEYLD RKPVWFQTVMSDGHQQTSHHETNPYQHHFIQHFLREHYNEWITNIYVKPFVVILYLIYAS FSFMGCLQISDGANIINLLASDSPSVSYAMVQQKYFSNYSPVIGFYVYEPLEYWNSSVQD DLRRLCSGFTAVSWVEQYYQFLKVSNVSANNKSDFISVLQSSFLKKPEFQHFRNDIIFSK AGDESNIIASRLYLVARTSRDKQKEITEVLEKLRPLSLSKSIRFIVFNPSFVFMDHYSLS VTVPVLIAGFGVLLVLILTFFLVIHPLGNFWLILSVTSIELGVLGLMTLWNVDMDCISIL CLIYTLNFAIDHCAPLLFTFVLATEHTRTQCIKSSLQDHGTAILQNVTSFLIGLVPLLFV PSNLTFTLFKCLLLTGGCTLLHCFVILPVFLTFFPPSKKHHKKKKRAKRKEREEIECIEI QENPDHVTTV >gi568815592f:47682067_47911731|GENSCAN_predicted_CDS_3|2013_bp gggattctgaaagctgtgtgctaccttgccacaggtggaattggaagtccaggctccctg ctcaacctttgccagcagcacaggggaacagaggaatatgcttctcctgaagacattaca gccagcttcctcccttctttcaggccatcttctaaaagtgatcttcatgggcattctgtt cagaaaaaaatccagctgtggaatgtttttatttcatcttcattgggaagaaaggccaca aaacattcagggatgccccttaaaacttgtcctttgaagcattatgcttttatcaagcat ctgtgctacagctttgaagatttttctcttgaatcatatttagtagaaatcaaagctgtg cataatctaagtttgcaaagtcatggaactaaaggagtgtttgagcttctgtccggatgg cggagaaccaaagagaacttgcccttcaaagacaggatagcagatgcctattctgatgtg atggtcacctataccatgaccagctccctgtacttcatcacttttggcatgggtgccagc ccattcacaaacatagaggctgtgaaggtcttctgtcaaaacatgtgtgtctctattctg ttgaactacttctacattttctccttctttggctcctgtctggtctttgctggccaacta gagcaaaaccgctaccacagcatcttttgctgtaagatcccttctgcagaatacctggat cgcaaacctgtgtggttccagacagtgatgagtgatgggcatcaacagacgtcccatcat gagacgaacccctaccagcaccacttcattcagcacttcctccgtgaacattataatgaa tggattaccaatatatatgtgaagccatttgttgtcatcctctatctcatttatgcctcc ttctccttcatggggtgcttacagatcagtgacggagccaacatcatcaatctactagcc agtgattcgccaagtgtttcctatgccatggttcagcagaaatatttcagcaactatagc cctgtgataggattctacgtctatgagcccctagagtactggaacagcagcgtccaggat gacctaagaagactctgtagtggattcactgcagtgtcctgggtggagcagtactaccag ttcctgaaagtcagcaacgtcagtgccaataacaaaagtgacttcatcagtgtcctgcaa agctcatttttaaaaaagccagaattccagcattttcgaaatgatatcatcttctccaag gcaggggatgaaagcaatatcattgcttctcgcttgtatctggtggccaggactagcaga gacaagcagaaagaaatcacagaagtgttggaaaagctgaggcccctatccctctcaaag agcatccgattcatcgtgttcaacccctcctttgtcttcatggaccattacagcttgtct gtcacagtgcctgttctgattgcaggctttggtgttctcctggtgttaatcctgactttt ttcctagtgatccaccctctgggaaacttctggctaattcttagcgtcacctcaattgag ctgggcgttctgggcttaatgacattatggaacgtcgacatggattgcatttctatcttg tgccttatctacaccttgaatttcgccattgaccactgtgcaccactgcttttcacattt gtattagcaactgagcacacccgaacacaatgtataaaaagctccttgcaagaccatggg acagccattttgcaaaatgttacttcttttcttattgggttagtcccccttctatttgtg ccttcgaacctgaccttcacactgttcaaatgcttgctgctcactgggggttgcacactt ctgcactgttttgttattttacctgtgttcctaacgtttttccccccttccaaaaagcac cacaagaaaaagaaacgtgccaagcgaaaggagagagaggaaattgaatgcatagaaatt caagagaacccggatcacgtcaccacagtatga