GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:56:30 Sequence gi568815577r:5872956_6073333 : 200378 bp : 46.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 262 336 75 0 0 69 51 58 0.266 1.57 1.02 Intr + 5451 5554 104 2 2 48 84 67 0.318 1.27 1.03 Intr + 11246 11449 204 2 0 109 26 123 0.199 6.52 1.04 Intr + 24986 25130 145 2 1 27 47 122 0.002 2.28 1.05 Term + 33615 34004 390 2 0 77 42 215 0.579 10.79 1.06 PlyA + 34670 34675 6 1.05 2.04 PlyA - 36663 36658 6 1.05 2.03 Term - 41483 41338 146 0 2 32 38 143 0.907 1.77 2.02 Intr - 42952 42819 134 0 2 66 36 130 0.633 5.89 2.01 Init - 100378 100002 377 1 2 88 4 877 0.038 75.71 2.00 Prom - 108785 108746 40 -5.16 3.03 PlyA - 108970 108965 6 1.05 3.02 Term - 109987 109784 204 1 0 103 43 64 0.392 0.77 3.01 Init - 112300 112109 192 1 0 104 66 63 0.467 4.67 3.00 Prom - 112770 112731 40 -5.46 4.03 PlyA - 112912 112907 6 1.05 4.02 Term - 114300 114040 261 0 0 27 42 199 0.687 4.73 4.01 Init - 114388 114335 54 1 0 86 36 89 0.765 4.78 4.00 Prom - 124552 124513 40 -2.56 5.00 Prom + 125284 125323 40 -5.56 5.01 Init + 128772 128832 61 2 1 74 77 6 0.368 -0.49 5.02 Term + 135650 135858 209 0 2 63 43 227 0.879 13.20 5.03 PlyA + 136396 136401 6 1.05 6.00 Prom + 155562 155601 40 -1.96 6.01 Init + 159813 159855 43 2 1 74 91 35 0.630 2.99 6.02 Intr + 159943 160193 251 2 2 50 101 144 0.462 9.06 6.03 Intr + 162698 162833 136 1 1 36 36 73 0.133 -2.96 6.04 Term + 167195 167346 152 0 2 82 41 143 0.683 7.07 6.05 PlyA + 168642 168647 6 1.05 7.09 PlyA - 169260 169255 6 1.05 7.08 Term - 170133 170074 60 0 0 62 54 86 0.935 0.40 7.07 Intr - 170431 170337 95 2 2 62 92 138 0.837 11.28 7.06 Intr - 172480 172454 27 2 0 114 91 24 0.438 3.39 7.05 Intr - 175964 175517 448 2 1 60 -6 345 0.249 15.22 7.04 Intr - 177819 177670 150 0 0 85 89 35 0.836 3.66 7.03 Intr - 180412 180222 191 2 2 11 71 87 0.638 -1.40 7.02 Intr - 180612 180474 139 0 1 75 53 136 0.945 8.84 7.01 Init - 183290 183198 93 2 0 79 26 114 0.965 4.68 7.00 Prom - 184015 183976 40 -6.56 8.05 PlyA - 184542 184537 6 -0.45 8.04 Term - 185736 185626 111 0 0 86 54 148 0.782 9.76 8.03 Intr - 195137 194909 229 1 1 69 38 162 0.161 7.17 8.02 Intr - 198254 198145 110 2 2 86 81 59 0.589 4.08 8.01 Intr - 199113 199055 59 1 2 76 66 98 0.589 5.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 100378 99998 381 1 0 88 37 875 0.958 78.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_1|305_aa MSLNALVTSSALSVLVGKALSYPEKCSRADDSHIRDTWELSMNGVLYTVPGAQNADEGGG LFNQQTDQQAKISTIVANLNADEWIKTHSSLAMNGIMYGGIIQSGYRGELKIIVYNITRE SFAVKLQIINQEEIESLKRQTTGSEIQAVINSLQTTKRSETQRLKAEGYDIYEEELPPLL IPRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQKVHPQQNPICTSQSSKTKGEKTTKMGKK QSRKTENSKNQSISPPPKEHSSSPAMEQSWTENDFDELREEGFRRSNYSELKEEVQTHGK EVKNL >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_1|918_bp atgtctcttaacgccttggtcacgtctagtgccctctctgtcctggtgggaaaggccttg agctatcctgaaaagtgcagccgtgctgatgactcacatatcagggatacatgggaactg tcaatgaacggagtgctctacacagtgcctggagctcagaatgcagatgagggaggagga ctatttaatcagcagacagatcaacaggccaaaatttccaccatagttgcaaatttgaat gcggatgaatggattaaaacacattcaagtcttgcaatgaacggcataatgtatggtggt ataattcagagtggttacaggggagagttaaagatcattgtatacaatatcactcgagaa tcttttgctgtaaaactgcagataataaaccaggaagaaattgaatccctgaaaagacaa acaacaggctctgaaattcaggcagtaataaatagcctacaaaccacaaaaagatcagaa acacaaagattaaaagctgaaggttatgacatttatgaagaagaactgcctccactgctg atacccaggcaaacagggtctggagtggacctccagcaaactccaacagacctgcagctg agggttctgactgttagaaggaaaactaacaaacagaaagtacatccacaacaaaatccc atctgtacatcacaatcatcaaagaccaaaggagagaaaactacaaagatggggaaaaaa cagagcagaaaaacggaaaattctaaaaatcagagtatctctccacctccaaaggaacac agctcctcaccagcaatggaacaaagctggacagagaatgactttgacgaattgagagaa gaaggcttcagacgatcaaactactccgagctaaaggaggaagttcaaacccatggcaaa gaagtgaaaaacctttaa >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_2|218_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGRKRKRSRKESYSVYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLPHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSAKASSQKVPKTIQPELLKKTVALSAHRIIRNFHLSIEKNAPLEFGAESLIHPYDIP LSIAQTKKLTSQPKCSSVLIFTDVSGLTMLSIIWKQLD >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_2|657_bp atgccggagccagcgaagtccgctcccgcgcccaagaagggctcgaagaaagccgtgact aaggcgcagaagaaggacggcaggaagcgcaagcgcagccgcaaggagagctactccgta tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcctctaaggccatg ggaatcatgaactccttcgtcaacgacatcttcgaacgcatcgcaggtgaggcttcccgc ctgccgcattacaacaagcgctcgaccatcacctccagggagatccagacggccgtgcgc ctgctgctgcccggggagttggccaagcacgccgtgtccgagggcaccaaggccgtcacc aagtacaccagcgctaaagcatcctcccagaaagtccccaaaaccattcaaccagagctc ctgaaaaaaacagtcgcactctctgcccatcggatcatccggaatttccatttatccatt gagaagaatgctccattggagtttggcgcagaatccctcatccatccctatgatattcca cttagcattgctcagaccaagaaactcacttcacagccaaagtgtagcagtgtgctcatc ttcacggatgtcagtggacttaccatgctttccattatctggaagcaactggattaa >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_3|131_aa MMTQVKNSTPDFTRQVTIKRQAHNTQFIQSHQGKRDRSGPFGCDVFSMQARIPQCKHTHK GNKMNQITLSRGLGITSSISAESPRVGQAHSACCYHHRCDPPICATCRSKDQPTQPVTAV TNISTLENSPA >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_3|396_bp atgatgacacaagtgaaaaattccacacctgacttcacacgacaggttacaatcaaaagg caggcacacaacacacagtttattcagagtcaccaagggaaaagagaccgttcaggtccc tttggctgcgatgtcttttccatgcaggcccggattccccaatgcaagcacacccacaaa ggtaataaaatgaaccagatcacactgtccaggggcctggggatcaccagtagcatctca gcagagtctcccagagtgggtcaggcccactcagcttgctgctaccaccaccgctgcgac ccacccatatgtgccacctgcaggtccaaggaccagcccacccaacctgtcacagccgtc acaaacatcagcactttggaaaacagcccagcataa >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_4|104_aa MAPTKGGEKKKGRSTINKHIHRVAFKKRAPQALKEIWKFAMKEMGTPDMCIDTRLNKAVW TKGIRNIPYRIRVRLSRKCKENEDSPNKLYTLITYVPVTAFKNL >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_4|315_bp atggctcctacaaagggtggcgagaagaaaaagggccgttctaccatcaacaagcacatc cacagagtggccttcaagaagcgtgcccctcaggcactcaaagagatttggaaatttgcc atgaaggagatgggaactccagatatgtgcattgataccaggctcaacaaagctgtctgg accaaaggaataaggaatatcccataccgcatccgtgtgcggttgtccagaaaatgtaaa gagaatgaagattcaccaaataaactctatactttgattacttatgtacccgttactgct ttcaaaaatctatag >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_5|89_aa MKAYKLRKQSRDRGNVLELVDPDAEVCLHVLRLVQSVVLEPEVFSKSASEFRSSLPLQRI LAMSKSRNPRLQTAAQELLEDLRTLEHNV >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_5|270_bp atgaaggcctataaactccggaaacaaagtagagatagagggaatgtcttagagcttgta gatccagatgcagaggtgtgccttcatgtactgaggcttgtccagtctgtggttctggaa cctgaagtcttctccaagtcggcctctgagttccggagctccctgcccctgcaacgcatc ctggcaatgtccaagagccgcaacccccgcctgcaaaccgcagcccaggagctcctggaa gatctccgcactctggagcataatgtgtag >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_6|193_aa MVNHISLQGHANLDVLLGEQRSEYLGNPFGTMCDVEHHVPFLQQVPCRHPQEKLEHVLTS QGSSVGSSPKPDTAQCHPRVKEHFMARCTRKPHTALATCPQSPGLGANDSPVALGLAVLG STTDPKLPDCSRYRRNAPESSSRRCPHSGTFSHLDDSNGLIQYRSVPAADNDWNLGNVSM ARQGPVEVAEATA >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_6|582_bp atggtcaaccacattagtctccagggacacgcaaacctggatgtgcttctgggggagcag cggtctgaatatttgggaaacccgtttggcactatgtgtgacgttgaacatcatgtgccc tttctacagcaggttccctgcaggcacccccaggagaagctcgagcatgtgctcacatct cagggcagttctgttgggagcagccccaagccagacacagcccagtgccatcccagagtt aaggagcacttcatggcgcgctgcacgcggaaaccccacacagctctggccacgtgtcca cagagccctggcctgggagctaacgactcccctgtagccctggggcttgctgtgctgggg agcaccacagaccccaaacttcctgattgctctcggtacagacgaaatgcgcccgagtct tccagcagacgctgccctcattcaggtaccttttcacacttggatgactccaacgggctg attcagtaccgcagtgtcccagcagctgacaatgactggaacttggggaacgtctccatg gccagacagggccctgttgaggtggctgaggccacagcttaa >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_7|400_aa MGCVSESGPDQGQQSHWQTGEDEDKALLPDTATATPDVPSQLPTGHRYGDTKCAGGKVIC VVLSCRRHTTAPEELPPGTCHRACRVPPSCWHVRCLLAEPRSCVAVALLHHCLVHRALSS GALRAVMRGHVYSLAGLSRSQRHSPWRCLVELSMAALRGNRSATQLHRLLSSHLSIPLLP PVPSGFLGHLQAPPPALGTTSPHETRAGHPSYIALFNREEQRLHTVCCSEQPETNSKRKG ETPTPNEKKEKHQHQTKRRRNTNSKQKGETPTPNEKKEKHQHQTKRRRNTNTKRKEGETP TPNEKKEKQSQHKAGPTLGVFNRALQLYNVLVIKVNVARHALLMSQLVPDSRQDLPPTKD FTWTELLGPKQLTFDSRNNCAGKNDGDGENLHVPVAGNCL >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_7|1203_bp atgggctgcgtgtcagagtcaggcccagaccagggacagcaaagccactggcagacaggc gaggatgaggacaaggccctgctgccggacacagcaacagcaactccagatgtcccatca cagctgcccaccggacacagatatggggacaccaagtgcgcagggggcaaagtcatctgc gttgtgctcagctgccgccgccacacaacagcaccagaagagctcccgccggggacatgc caccgagcatgccgggtgccaccgagctgctggcatgttcggtgcctgctggctgaacct cggagttgcgtggcggtggctttgcttcatcactgtctggttcaccgtgctctaagttcg ggagccctgcgagcagtgatgcgtggccacgtctacagcctcgctggcctgtcacgttcc cagaggcacagcccttggaggtgcctggtggaactgagcatggctgctctccgtgggaac cgctcagcaactcagctgcacagactcctttcttcccacctctccatcccactgctccct cctgtcccctcgggcttcctgggtcacctccaggccccacctccagcactggggactaca tccccacatgagaccagagcgggacatccaagctatattgccctcttcaacagggaggag cagaggctgcacactgtgtgctgctcagagcagccagagaccaacagcaaacgaaaagga gaaacaccaacaccaaacgaaaagaaggagaaacaccaacaccaaacgaaaagaaggaga aacaccaacagcaaacaaaaaggagaaacaccaacaccaaacgaaaagaaggagaaacac caacaccaaacgaaaagaaggagaaacaccaacaccaaacgaaaagaaggagaaacacca acaccaaacgaaaagaaggagaaacagagccagcacaaggcgggaccaaccctgggcgtg ttcaacagagccctgcagctctataatgttttagttattaaagtgaatgtggccaggcat gcccttctgatgagtcagctggtgccagattcacggcaggatctgcccccgacaaaggac ttcacctggaccgagctcctgggccccaagcagctgacatttgattctcgcaacaactgt gcagggaagaatgatggagacggcgagaatctgcatgtgcccgtggctggcaactgcttg tga >gi568815577r:5872956_6073333|GENSCAN_predicted_peptide_8|169_aa XPRLRALADFHELRPPATKTARAQGNKQKEFRETAPGSSSKADTKSEHLWALLPSRRKVK ETGLPSTVQAVSELLQVLLCKQKAGGCTPDQGPHTSKVLTEASSLPRISRMCLLVREESS VAQTNTITFPSSQTWSSKASCLAFRPTIKTASRNDDGLMDAGHRQPGDH >gi568815577r:5872956_6073333|GENSCAN_predicted_CDS_8|510_bp ngtccaaggctacgcgccctggcagacttccatgagctgcgaccaccagccaccaagacg gcaagagcacaaggaaacaagcagaaggaattcagagaaacagctccaggctccagctca aaggccgacaccaaaagcgaacacctctgggcgctcttgccttcaagaaggaaagtaaaa gaaacaggtcttccttccactgtgcaggcagtcagcgagctcctgcaggtcctgctctgt aagcagaaagctggaggctgcaccccagatcaaggccctcacacctcaaaggttctcacg gaggcctcgtctctgccccgcatctccagaatgtgcttgctggtcagagaagagtccagt gtggctcagacaaacacaatcacctttcccagcagccagacatggtccagcaaggccagc tgtctggctttccggccgacgatcaaaacagcaagtcggaatgatgatgggctgatggac gcgggccaccgacaaccaggtgaccactga