GENSCAN 1.0 Date run: 6-Nov-116 Time: 10:23:58 Sequence gi568815594r:71652518_71884018 : 231501 bp : 37.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1068 1242 175 2 1 62 81 131 0.848 9.36 1.02 Term + 9303 9577 275 1 2 85 49 71 0.118 -2.45 1.03 PlyA + 11580 11585 6 1.05 2.20 PlyA - 11795 11790 6 1.05 2.19 Term - 16190 16048 143 0 2 96 48 32 0.363 -2.89 2.18 Intr - 17698 17495 204 2 0 42 79 111 0.207 3.85 2.17 Intr - 23554 23429 126 2 0 52 64 117 0.529 5.43 2.16 Intr - 55035 54990 46 0 1 54 82 3 0.000 -6.64 2.15 Intr - 69404 69043 362 0 2 6 102 284 0.267 15.61 2.14 Intr - 78716 78602 115 2 1 91 30 94 0.187 3.00 2.13 Intr - 78948 78736 213 0 0 62 19 198 0.220 8.29 2.12 Intr - 100133 100001 133 1 1 47 75 117 0.893 6.03 2.11 Intr - 101991 101894 98 0 2 105 69 62 0.940 3.89 2.10 Intr - 102590 102461 130 1 1 90 71 62 0.985 4.48 2.09 Intr - 104397 104195 203 0 2 56 97 200 0.935 14.86 2.08 Intr - 105654 105525 130 2 1 53 94 107 0.967 7.58 2.07 Intr - 110985 110891 95 0 2 96 103 -10 0.735 -0.86 2.06 Intr - 111419 111287 133 1 1 129 87 -3 0.641 3.43 2.05 Intr - 113126 112915 212 2 2 123 70 190 0.978 17.49 2.04 Intr - 115916 115784 133 1 1 134 73 93 0.997 12.13 2.03 Intr - 118117 118031 87 0 0 103 89 73 0.786 7.07 2.02 Intr - 123973 123946 28 2 1 131 65 -11 0.143 -2.84 2.01 Init - 131501 131444 58 2 1 70 94 55 0.524 6.00 2.00 Prom - 139633 139594 40 -3.65 3.03 PlyA - 140391 140386 6 1.05 3.02 Term - 141683 140974 710 0 2 15 42 306 0.007 11.38 3.01 Init - 152529 152379 151 0 1 69 103 130 0.782 12.86 3.00 Prom - 153037 152998 40 -10.84 4.03 PlyA - 153364 153359 6 -0.45 4.02 Term - 153996 153839 158 1 2 64 48 168 0.698 7.51 4.01 Init - 157109 156890 220 2 1 52 103 126 0.834 9.34 4.00 Prom - 163607 163568 40 -5.65 5.04 PlyA - 164538 164533 6 1.05 5.03 Term - 171457 171410 48 1 0 109 44 61 0.262 0.13 5.02 Intr - 185687 185527 161 0 2 56 65 62 0.334 -0.51 5.01 Init - 186633 186504 130 0 1 79 91 81 0.545 7.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 141576 140974 603 0 0 41 42 277 0.966 14.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:71652518_71884018|GENSCAN_predicted_peptide_1|149_aa MGVFYRYFKIRKQKELQKKPNQDCKVDAQQFRIKTLAKLSLLDERNEKENCHGGEGLTGS LNLPTPSEIVPSLNSFKPLGCQLKNDEFLKFGQESFISHKGLQLAGWPISQARKHSLWPE AGNIHFNGGAKGTGIYAEQGGQRYIFNTL >gi568815594r:71652518_71884018|GENSCAN_predicted_CDS_1|450_bp atgggagtcttttacagatattttaagatcagaaaacaaaaagaacttcagaagaagcca aatcaggactgtaaggtggatgctcagcaatttcgcatcaaaactcttgcaaaattgtcc ttgcttgatgagagaaatgaaaaggagaattgccacggtggagaaggactcactggttcc cttaacctaccgaccccttcagaaatagtcccttccctaaattcttttaagcctttgggc tgtcaactgaagaatgatgagtttcttaaatttggacaggagagctttatttctcataaa gggctgcagcttgctgggtggccaatctcacaggctagaaagcacagcctctggccagaa gctggaaacatacacttcaacggaggggcaaagggaacaggaatttatgctgagcaaggt ggccaaagatacatatttaatacgctatag >gi568815594r:71652518_71884018|GENSCAN_predicted_peptide_2|882_aa MKRVLVLLLAVAFGHALERASSSHHFTLLANTGKEKLVREIAAGKTQLNGQKIQWAAWSL VLYSRKFPSGTFEQVSQLVKEVVSLTEACCAEGADPDCYDTRTSALSAKSCESNSPFPVH PGTAECCTKEGLERKLCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFMWEYST NYGQAPLSLLVSYTKSYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNRVCSQYAA YGEKKSRLSNLIKLAQKVPTADLEDVLPLAEDITNILSKCCESASEDCMAKELPEHTVKL CDNLSTKNSKFEDCCQEKTAMDVFVCTYFMPAAQLPELPDVELPTNKDVCDPGNTKVMDK YTFELSRRTHLPEVFLSKVLEPTLKSLGECCDVEDSTTCFNAKGPLLKKELSSFIDKGQE LCADYSENTFTEYKKKLAERLKAKLPDATPTELAKLVNKHSDFASNCCSINSPPLYCDSE GVCKRVGEPEVMNAETGQSLLSGRSRLCAGPTAASKCVTTNAFSALPSRDDQVPTNSAEG PGGSPCPLSTQEESGHTNGLKGSVGGGFYWVMEVALSGMGNSKKMVREEGHQPQRPKVDK SAKVRKNQHKNSENSKIQNASSLPNDCNASPARVQNETKYEMDELTQVGFRRWVITNSTE LKEHVLTQCKEAKNLDKRIQELLTRITNLERNINDLTELKNTAQELREAYTYLRPLWGTS LKNRKALGRVLYGGKQIHRHQVHDPDSQEGVTGMKEADAQSLVELVPVCPELVSSGGFLV LLTSRMKLRTLAVSVTVLKDSLSRVCSFRCSDVSGVSSSGFVVLLDFRSEATDLHSSSWF PVPAVMPQSFLILPGICPGSSKFDVQTSSRNIIWALVRNAES >gi568815594r:71652518_71884018|GENSCAN_predicted_CDS_2|2649_bp atgaagagggtcctggtactactgcttgctgtggcatttggacatgctttagagagagcc tctagtagccatcattttactcttctggctaacacaggaaaggagaagctagtaagagag atagcagcaggaaaaactcaactcaatgggcaaaaaattcaatgggcagcatggtcacta gtcctgtacagtagaaaatttcccagtggcacgtttgaacaggtcagccaacttgtgaag gaagttgtctccttgaccgaagcctgctgtgcggaaggggctgaccctgactgctatgac accaggacctcagcactgtctgccaagtcctgtgaaagtaattctccattccccgttcac ccaggcactgctgagtgctgcaccaaagagggcctggaacgaaagctctgcatggctgct ctgaaacaccagccacaggaattccctacctacgtggaacccacaaatgatgaaatctgt gaggcgttcaggaaagatccaaaggaatatgctaatcaatttatgtgggaatattccact aattacggacaagctcctctgtcacttttagtcagttacaccaagagttatctttctatg gtagggtcctgctgtacctctgcaagcccaactgtatgctttttgaaagagagactccag cttaaacatttatcacttctcaccactctgtcaaatagagtctgctcacaatatgctgct tatggggagaagaaatcaaggctcagcaatctcataaagttagcccaaaaagtgcctact gctgatctggaggatgttttgccactagctgaagatattactaacatcctctccaaatgc tgtgagtctgcctctgaagattgcatggccaaagagctgcctgaacacacagtaaaactc tgtgacaatttatccacaaagaattctaagtttgaagactgttgtcaagaaaaaacagcc atggacgtttttgtgtgcacttacttcatgccagctgcccaactccccgagcttccagat gtagagttgcccacaaacaaagatgtgtgtgatccaggaaacaccaaagtcatggataag tatacatttgaactaagcagaaggactcatcttccggaagtattcctcagtaaggtactt gagccaaccctaaaaagccttggtgaatgctgtgatgttgaagactcaactacctgtttt aatgctaagggccctctactaaagaaggaactatcttctttcattgacaagggacaagaa ctatgtgcagattattcagaaaatacatttactgagtacaagaaaaaactggcagagcga ctaaaagcaaaattgcctgatgccacacccacggaactggcaaagctggttaacaagcac tcagactttgcctccaactgctgttccataaactcacctcctctttactgtgattcagag ggagtgtgcaaacgagtgggggaaccggaggtaatgaacgctgaaactggccagtcactc ctgtctggcaggagcaggctctgtgcaggccccacagcagcatccaagtgtgttacaacc aatgctttttcagctctgccatccagggatgatcaagtgccaaccaactcagcggagggc ccaggtggcagcccctgccctctcagcacccaggaagaatcaggtcatacgaatggattg aagggtagtgtaggcgggggattttattgggtaatggaagtggctctcagcgggatgggg aattcgaagaagatggtgcgggaagaaggccatcagcctcaaagaccaaaggtagataaa tctgcaaaggtgaggaaaaaccagcacaaaaactctgaaaattccaaaatccagaatgcc tcttcacttccaaatgattgcaatgcctctcctgcaagggtgcagaatgagacaaagtat gagatggatgaattgacacaagtaggcttcagaaggtgggtaataacaaactccactgag ctaaaggagcatgttctaacccaatgcaaagaagctaagaaccttgataaaaggatacag gaactgctaactagaataaccaatttagagaggaacataaatgacctgacggagctgaaa aacacagcacaagaacttcgtgaagcatacacttatctacgccccctgtggggaacatca ctaaagaacagaaaagcactcggtagagtcctctatgggggcaagcagatccaccgccac caagtccatgatccagacagccaggaaggagtaactgggatgaaggaagcagatgctcaa tccttggtagagctagtcccagtgtgtccggaattggtttcttctggtgggttcttggtc ttactgacttcaagaatgaagctgcggaccctcgcagtgagtgttacagttcttaaagac agtttgtccagagtttgttccttcagatgttcagatgtgtccggagtttcttccagtgga ttcgtggtcttgcttgacttcaggagtgaagccacagaccttcacagcagctcatggttt ccagtccctgctgtgatgccccaatcatttctaatattaccaggcatatgccctggttcc tcaaagtttgatgtgcagaccagtagcagaaacatcatctgggcacttgttagaaatgca gaatcataa >gi568815594r:71652518_71884018|GENSCAN_predicted_peptide_3|286_aa MGTSNIFNIMGLRAALAPLDSEEPKQGREKEPSPTLLPPPPPPPSAPLLPEKEGILPNSF YEASIILIPKPGRDTTETENFRPISLMSNDAKIFNKLLANQIQQHIKKLIQHDQAGFIPG MQGWFNISKSINVIHRINRSNNKDHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLTI IRAIYHKPTANIILNGQKPEAFPLKTSTRKGCPLSPLLFSIVLEVPVGAIRQEKEIKGIQ LGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINV >gi568815594r:71652518_71884018|GENSCAN_predicted_CDS_3|861_bp atgggtaccagtaacatctttaacattatgggccttagggctgctttggccccactcgac tcagaagagcctaaacagggaagggagaaggaaccatcacctaccttactgcctcctcct cctcctcctccctcagccccgctgttaccagaaaaagagggaattctccctaactcattt tatgaggccagcatcatcctgataccaaagcctggcagagacacaacagaaacagagaat tttagaccaatatccctgatgagcaatgatgcaaaaatcttcaacaaattactggcaaac caaatccagcagcacatcaaaaagcttatccagcatgatcaagctggcttcatccctggg atgcaaggctggttcaacataagcaaatcaataaacgtaatccatcgcataaatagatcc aacaacaaagaccacatgattatctcaatagatgcagaaaaggcatttgacaaaattcaa cagcccttcatgctaaaaacgctcaataaactaggtattgatggaacgtatctcacaata ataagagctatttatcacaaacccacagccaatatcatactgaatgggcaaaaaccagaa gccttccctttgaaaaccagcacaagaaaaggatgccctctctcaccactcctattcagc atagtgttggaagttccggttggggcaatcaggcaagagaaagaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaa aaccccatcgtctcagcccaaaacctcctgaagctgataagcaatttcagcaaagtctca ggatacaaaatcaatgtataa >gi568815594r:71652518_71884018|GENSCAN_predicted_peptide_4|125_aa MLDLELWEQVGRNLKQYHAQGQWVPVTSLTLWALVRAALAPLDSEEPKQGREEEPSPTLL PPPPPPPSAPLLPAQCIKPYHDVAGTQPSTKNKGNNPAGPTTPDDAASSDNTGPGHGAEE DNSGG >gi568815594r:71652518_71884018|GENSCAN_predicted_CDS_4|378_bp atgctagacctagagctctgggaacaagtgggcagaaatcttaaacaatatcatgcacaa gggcaatgggtaccagtaacatctctaacattatgggccttagttagggctgctttggcc ccactcgactcagaagagcctaaacagggaagggaggaggaaccatcacctaccttactg cctcctcctcctcctcctccctcagccccgctgttaccagctcaatgcatcaaaccatac cacgatgtggctgggactcaacccagtaccaaaaataaaggaaataaccctgcaggaccc accaccccagatgatgcagcttcctcggacaacacaggccctggacacggtgctgaagaa gacaactcaggaggctga >gi568815594r:71652518_71884018|GENSCAN_predicted_peptide_5|112_aa MKHKFSNLTVPKNQRKMMIKMKTHGSSKSRIDSSIQHAYQAIADIETARWKSLPGKTLAR LRTGRNAHWGGATEVCAVCSQVDPGPSSSWVEPGIQSDRLEKKKLFSVDWDD >gi568815594r:71652518_71884018|GENSCAN_predicted_CDS_5|339_bp atgaagcacaagttctccaacttgacagtacctaagaatcagcggaagatgatgataaaa atgaaaactcatggatcttccaaatctaggatagattctagtattcagcatgcttaccaa gcaattgcagatattgagacagccagatggaaaagtctccctggcaaaactttagccaga ctacgcactgggaggaatgcacactggggtggagccacagaagtttgcgctgtttgcagc caggttgatcctggcccctcctcttcctgggtggaacctgggattcagtctgacagactg gagaaaaagaaactcttctcagttgactgggatgactag