GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:56:10 Sequence gi568815596r:223497432_223699282 : 201851 bp : 39.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1358 1455 98 1 2 105 62 148 0.423 13.73 1.02 Intr + 16716 16843 128 0 2 55 97 52 0.012 2.20 1.03 Intr + 28486 28629 144 2 0 75 66 47 0.112 0.53 1.04 Intr + 32216 32334 119 0 2 56 103 65 0.749 4.06 1.05 Intr + 33426 33486 61 2 1 57 81 34 0.035 -3.01 1.06 Intr + 35223 35390 168 1 0 38 62 100 0.079 1.40 1.07 Intr + 45068 45661 594 0 0 21 64 236 0.107 6.11 1.08 Term + 51786 51967 182 1 2 67 37 147 0.437 4.29 1.09 PlyA + 53766 53771 6 1.05 2.06 PlyA - 54878 54873 6 1.05 2.05 Term - 57323 57119 205 1 1 69 41 116 0.024 0.86 2.04 Intr - 60093 60013 81 2 0 74 71 64 0.012 1.13 2.03 Intr - 83010 82861 150 2 0 113 64 21 0.007 0.56 2.02 Intr - 89041 88927 115 2 1 49 48 122 0.060 2.89 2.01 Init - 91971 91917 55 0 1 72 101 54 0.658 6.60 2.00 Prom - 95437 95398 40 -3.45 3.02 PlyA - 98692 98687 6 1.05 3.01 Sngl - 101851 99998 1854 1 0 93 43 2121 0.999 199.89 3.00 Prom - 111446 111407 40 -4.85 4.02 PlyA - 111693 111688 6 1.05 4.01 Sngl - 116170 115895 276 1 0 81 55 208 0.960 11.72 4.00 Prom - 116274 116235 40 -8.45 5.00 Prom + 116462 116501 40 -5.55 5.01 Init + 117665 117820 156 1 0 74 75 127 0.995 9.96 5.02 Intr + 118044 118171 128 2 2 50 75 62 0.992 -0.34 5.03 Intr + 121461 121575 115 0 1 86 50 155 0.959 11.03 5.04 Term + 122129 122314 186 1 0 68 44 160 0.939 6.11 5.05 PlyA + 122384 122389 6 -0.45 6.00 Prom + 123124 123163 40 -9.55 6.01 Init + 123908 124010 103 1 1 99 109 13 0.576 4.95 6.02 Term + 128626 128804 179 2 2 -56 39 496 0.284 27.17 6.03 PlyA + 131534 131539 6 1.05 7.00 Prom + 135668 135707 40 -5.95 7.01 Init + 139774 139846 73 0 1 101 31 68 0.501 3.68 7.02 Intr + 150800 150973 174 0 0 78 26 135 0.033 5.29 7.03 Intr + 152055 152131 77 1 2 29 103 32 0.043 -2.88 7.04 Intr + 153764 153861 98 1 2 103 74 82 0.966 6.19 7.05 Intr + 156366 156557 192 0 0 117 61 113 0.938 9.19 7.06 Intr + 159881 159992 112 2 1 126 92 14 0.802 5.06 7.07 Intr + 189811 189918 108 0 0 48 40 95 0.025 0.26 7.08 Intr + 192111 192345 235 2 1 69 100 60 0.019 1.64 7.09 Term + 198340 198632 293 2 2 105 40 119 0.009 3.32 7.10 PlyA + 200667 200672 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 1729 1801 73 1 1 104 37 92 0.897 2.10 S.002 Term - 83784 83687 98 1 2 104 42 95 0.894 3.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:223497432_223699282|GENSCAN_predicted_peptide_1|497_aa MAQLEGVEMTVRVDSEGVPSTGGYASISRMGGRQPLLVWPQGAKTFLYFRTKGFMGSMGL PSSCCQMEKKTNKVVGGISGPCPPVKILLNPKVLRCEDSYPLMFGVCHKVGTEGIPSTHI QSAGGGDIISIVYLLLMTIPLQAIPGQLWILKVHYILPRLQIQCSSNLILLGHWTRAQDP PSSVSNDFEASLVKNGVFSSYSSKTGTVVGKQVAVEITEEEEMESDIAEGEGPYLNLNFD LFGHKGQQQKPAIAFQERRCKMPLRSHTPALCLVRSSCFYVKFICLKYLLYVIITWLVGL CHRNCLSKHDEHQREDERIFGRTETPLQNTPVLRGTASVCENLRNKYWLHKQEHDGTKCY SSLLPSCLVFGIESQNKRPSLKSSPIEVTGITIIGVPQTLPTQPCQLLTISCNQLTQSYP LSKIIYFFFQQLFSLHILILITYQVQKLIIRFLRFQETRPTLSSFPSSDSHTKPLRQQRR TYWYQQLQADSSKNVHN >gi568815596r:223497432_223699282|GENSCAN_predicted_CDS_1|1494_bp atggcccagttagaaggtgtggaaatgacagtgcgtgtggacagtgaaggtgtgcccagt acaggagggtatgccagcatctccaggatgggtggccgccagcccctattggtatggccc caaggagccaaaacatttctatacttcagaaccaaaggattcatgggatccatgggattg ccaagttcctgttgccagatggagaaaaaaacaaacaaagtggttggaggtatatctggg ccctgtcccccagtgaaaattctgcttaatcccaaagtcttgagatgtgaggattcctac ccactgatgtttggtgtctgtcataaggtaggcactgaaggcatcccctcaacacacatc cagagtgcaggtggtggtgacatcattagtattgtctatttgctgctgatgaccatcccc ttacaggcgatccctggacagctgtggattttgaaagttcattatattctcccaaggttg caaattcagtgttccagcaacctcattcttcttggacactggacaagagctcaggaccca ccaagttcagtttcaaatgattttgaagcttctctagtcaagaatggagtcttctcttcc tattcctccaagacagggacagtggtagggaagcaagtggctgtagagataacagaagaa gaagagatggaaagtgacatagcagaaggggagggcccatatttgaatttgaattttgat ctttttggccacaaaggccaacagcaaaagccagccatcgcgttccaggaacggaggtgc aaaatgccgctcagatcacacactcctgccttgtgtttggtgagaagcagctgcttctat gtcaagtttatctgtttgaaatatttactgtatgtaattatcacttggctggtgggcctt tgccatagaaactgtcttagcaaacacgatgaacaccagcgggaagatgagcgcatattt ggaaggacagaaacccctctgcaaaacaccccagtcttgagaggaacagcttctgtgtgt gaaaacttgagaaacaagtactggttacataaacaggaacatgatggaaccaaatgctat tccagccttttgccatcatgcttggtttttggcatagagagccagaataaacgtcccagt ctaaagagctcaccaattgaggtaactggaattacaattataggagttcctcagaccctt cctactcagccctgccagcttctgaccataagctgcaaccaactaactcagtcctatcca ctttctaagataatttatttcttctttcaacaattattttcactacatatattgattttg atcacataccaagttcaaaaacttattattcggtttctaagatttcaggaaacaaggcca actttatcctcatttccatcatctgactcacacaccaagcccctgaggcagcaacgccga acatattggtatcagcaattacaggctgactcatccaagaacgtgcacaattaa >gi568815596r:223497432_223699282|GENSCAN_predicted_peptide_2|201_aa MRKLTIMAESKGGAGVSHAILGKWFVMKQQVLIPSVMGYNLTGAISRELECLEARGWCFG KMMAGEELEARSLIYFLWTCTLAQCFSHLTVHTSHLEILLECISWARTAQSKGDNGHLYH LTDFYERASNVLPSLGGCVHVKNNLAKGLCAKIRKRGMRSHWLLGGLIYASRSMDQVSSH WEGTAGTTTPSLEVQQQGKMR >gi568815596r:223497432_223699282|GENSCAN_predicted_CDS_2|606_bp atgaggaagcttacaatcatggcggaaagcaaagggggagctggtgtgtcacatgctatt ttgggaaagtggtttgtaatgaagcaacaagttctgattccatctgtcatgggatacaat ttaactggggccatttcacgggaattggaatgtttagaagcccgtgggtggtgctttgga aagatgatggcaggagaggagcttgaggctaggagcctgatttacttcctgtggacttgc accctagcccagtgcttctcacacttgactgtgcacacgagtcacctggaaatcttgtta gaatgcatatcctgggccaggactgcccagagcaaaggtgacaatggccatctttatcac cttactgatttttatgaacgtgcttcaaatgttttacctagcctgggtggatgtgtacat gtgaagaataatcttgcaaaagggctttgtgcaaaaataaggaaacgggggatgagaagt cactggctgctaggtggccttatctatgccagccgttccatggatcaagtatcaagccac tgggaagggacagcaggcacaaccacgccatccttagaggtccagcagcaaggaaaaatg agatga >gi568815596r:223497432_223699282|GENSCAN_predicted_peptide_3|617_aa MAEAKTHWLGAALSLIPLIFLISGAEAASFQRNQLLQKEPDLRLENVQKFPSPEMIRALE YIENLRQQAHKEESSPDYNPYQGVSVPLQQKENGDESHLPERDSLSEEDWMRIILEALRQ AENEPQSAPKENKPYALNSEKNFPMDMSDDYETQQWPERKLKHMQFPPMYEENSRDNPFK RTNEIVEEQYTPQSLATLESVFQELGKLTGPNNQKRERMDEEQKLYTDDEDDIYKANNIA YEDVVGGEDWNPVEEKIESQTQEEVRDSKENIEKNEQINDEMKRSGQLGIQEEDLRKESK DQLSDDVSKVIAYLKRLVNAAGSGRLQNGQNGERATRLFEKPLDSQSIYQLIEISRNLQI PPEDLIEMLKTGEKPNGSVEPERELDLPVDLDDISEADLDHPDLFQNRMLSKSGYPKTPG RAGTEALPDGLSVEDILNLLGMESAANQKTSYFPNPYNQEKVLPRLPYGAGRSRSNQLPK AAWIPHVENRQMAYENLNDKDQELGEYLARMLVKYPEIINSNQVKRVPGQGSSEDDLQEE EQIEQAIKEHLNQGSSQETDKLAPVSKRFPVGPPKNDDTPNRQYWDEDLLMKVLEYLNQE KAEKGREHIAKRAMENM >gi568815596r:223497432_223699282|GENSCAN_predicted_CDS_3|1854_bp atggctgaagcaaagacccactggcttggagcagccctgtctcttatccctttaattttc ctcatctctggggctgaagcagcttcatttcagagaaaccagctgcttcagaaagaacca gacctcaggttggaaaatgtccaaaagtttcccagtcctgaaatgatcagggctttggag tacatagaaaacctccgacaacaagctcataaggaagaaagcagcccagattataatccc taccaaggtgtctctgtcccccttcagcaaaaagaaaatggcgatgaaagccacttgccc gagagggattcactgagtgaagaagactggatgagaataatactcgaagctttgagacag gctgaaaatgagcctcagtctgcaccaaaagaaaataagccctatgccttgaattcagaa aagaactttccaatggacatgagtgatgattatgagacacagcagtggccagaaagaaag cttaagcacatgcaattccctcctatgtatgaagagaattccagggataacccctttaaa cgcacaaatgaaatagtggaggaacaatatactcctcaaagccttgctacattggaatct gtcttccaagagctggggaaactgacaggaccaaacaaccagaaacgtgagaggatggat gaggagcaaaaactttatacggatgatgaagatgatatctacaaggctaataacattgcc tatgaagatgtggtcgggggagaagactggaacccagtagaggagaaaatagagagtcaa acccaggaagaggtgagagacagcaaagagaatatagaaaaaaatgaacaaatcaacgat gagatgaaacgctcagggcagcttggcatccaggaagaagatcttcggaaagagagtaaa gaccaactctcagatgatgtctccaaagtaattgcctatttgaaaaggttagtaaatgct gcaggaagtgggaggttacagaatgggcaaaatggggaaagggccaccaggctttttgag aaacctcttgattctcagtctatttatcagctgattgaaatctcaaggaatttacagata cccccagaagacttaattgagatgctcaaaactggggagaagccgaatggatcagtggaa ccggagcgggagcttgaccttcctgttgacctagatgacatctcagaggctgacttagac catccagacctgttccaaaataggatgctctccaagagtggctaccctaaaacacctggt cgtgctgggactgaggccctaccagacgggctcagtgttgaggatattttaaatctttta gggatggagagtgcagcaaatcagaaaacgtcgtattttcccaatccatataaccaggag aaagttctgccaaggctcccttatggtgctggaagatctagatcgaaccagcttcccaaa gctgcctggattccacatgttgaaaacagacagatggcatatgaaaacctgaacgacaag gatcaagaattaggtgagtacttggccaggatgctagttaaataccctgagatcattaat tcaaaccaagtgaagcgagttcctggtcaaggctcatctgaagatgacctgcaggaagag gaacaaattgagcaggccatcaaagagcatttgaatcaaggcagctctcaggagactgac aagctggccccggtgagcaaaaggttccctgtggggcccccgaagaatgatgatacccca aataggcagtactgggatgaagatctgttaatgaaagtgctggaatacctcaaccaagaa aaggcagaaaagggaagggagcatattgctaagagagcaatggaaaatatgtaa >gi568815596r:223497432_223699282|GENSCAN_predicted_peptide_4|91_aa MSGFASEAPALVPPKPAASPASCSSPIRKRHKSIMTFKALFLSLIRAEPEDDCNERLRLC ECVMCKLLPPCPSMRGCAETLMIKTVNLALG >gi568815596r:223497432_223699282|GENSCAN_predicted_CDS_4|276_bp atgtcggggtttgcgtctgaggcccccgccctggttcctccgaaaccggctgcatcacca gcatcgtgtagttctcccatcaggaaaagacacaaaagcatcatgacatttaaagccttg ttcctgagtctcattagggcagagcccgaggatgactgtaatgaacggctgcgcctctgt gaatgtgtaatgtgcaaactgctcccgccgtgcccctccatgcggggctgtgcggagaca ctgatgattaaaaccgtgaacctggccttgggctga >gi568815596r:223497432_223699282|GENSCAN_predicted_peptide_5|194_aa MERDPVSKKNKERIEAFASSQSLSKSAALRRAFFSCPKRRAGCSRTLNVTHRDIPDLGAL ELSSHSSHQVVTIGHSANDFSGFLGCGKIKWQNEGPRSTASPGGAAAQDVDAANREMLQT GTVDLDCPPKLVKLRPWLPELREEDWGWRMGVRDSPGRQHIGSQERQMQSYLGGSLSSDT RNQECGKTGGVPHG >gi568815596r:223497432_223699282|GENSCAN_predicted_CDS_5|585_bp atggagcgagatcctgtctcaaagaaaaacaaagaaagaatagaagcatttgcttcatct caatctctcagcaagtctgcagctttacgcagagctttcttcagttgtccgaagagaaga gcaggctgctcccggaccctgaacgtgacccacagggatattcctgatcttggggctttg gaattaagttctcatagctctcatcaagtagtcacaataggacactcagctaatgacttt tcaggatttcttggatgtggaaagataaagtggcaaaacgaaggcccaaggagcacagcc agccctggaggagctgcagcccaagatgttgatgctgcaaacagggagatgctgcaaaca gggacagtcgaccttgactgtcctcccaaactagtgaagctgagaccatggctacctgag ctgagagaggaagactggggctggcggatgggagtaagggatagtcctggccgccagcac ataggcagtcaggagaggcaaatgcagtcatacctgggtggcagcctgtctagtgacact cgaaaccaggaatgtggcaaaactggtggtgttcctcatggctag >gi568815596r:223497432_223699282|GENSCAN_predicted_peptide_6|93_aa MACSHRPDDTTCMGTDLAHRWMGIQHSSKIGERPEEEEEEKGEGEGEEEEEEEEEEEEEE EEEEEEEEEEEEGEGEGEGEGEEKQRVKMKVFE >gi568815596r:223497432_223699282|GENSCAN_predicted_CDS_6|282_bp atggcctgcagtcacagaccagatgacacaacttgcatgggaacagacctggcacacagg tggatgggtatccagcactcctctaagataggggagaggccagaagaagaagaagaagaa aaaggagaaggagaaggagaggaagaggaagaggaagaggaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaaggagaaggagaaggagaaggagaa ggagaagaaaaacagagggtgaaaatgaaggtgtttgaatag >gi568815596r:223497432_223699282|GENSCAN_predicted_peptide_7|453_aa MTSLNADLGRRGHNWLLRAHASGCGCFVSGNLSDITAPDPRLLHSCDISTKHKTQRFTTT GEESVRELRGVRWSIKTMHQTEKTAMHRPCTMPGVRRTASPTFNKAYQNPELSGWSYQTM QQPYGTELEFQFVTGYYFFPRVQPAPVIKLGLFPSLSNLDHRIHPASKIQPISSSRIDTF DEDIGKIVAQRAPRTLIHRLKRHKEAWISSSLGNSSEDKVFLHLAISSISHQAGQFRVII QWQNENPPGPIQGALESAFPQGDSDVKALIKILSSLSWIRRTEEQCDVPQAKMFLSVLQM ITLLLNSYDGQPFYTSANYMQEARSLRCLSDSFMVQSRDSGSARKHLQSSTVTFAKAAGK AQHSLLQAPSQLVRATGPNLASLQRGSLPMGLRISCFFIKGLCKFHVPGYRFPLLPALKP AVMPSSDRVRARIATKGQHAKADGVKEWKNLDL >gi568815596r:223497432_223699282|GENSCAN_predicted_CDS_7|1362_bp atgacatcactgaatgctgacttgggaagacgtggacacaattggctcctaagagcccat gcaagtggctgtggctgctttgtgtctgggaatctctccgacattactgctccagatccc aggctgcttcattcttgtgacatctccaccaagcacaagactcaaaggttcaccacaaca ggagaagagagtgttagagagctgcgaggtgtcaggtggtcaattaagacaatgcaccaa actgaaaagactgcaatgcaccggccatgcactatgcctggtgtgaggaggacagcctca cctaccttcaataaggcctaccagaaccctgaattaagtgggtggtcatatcagaccatg cagcagccctatggaacagaattagagtttcagtttgtcacaggatactatttcttcccc agggtccagcccgctcctgtcatcaagcttgggctatttccttccctctccaacttagac cacaggatccaccccgcaagcaagatccagccaatatcctcaagtagaattgacacattc gatgaggacattggaaagatagtggcacaaagagcaccgaggacgttgatacatcggctc aaaaggcataaagaggcttggatctcttcctcactgggaaacagcagcgaagacaaagtc tttcttcaccttgcaatttccagcatctcccaccaggctgggcaattcagggtaattatt caatggcagaatgagaatcccccaggcccaattcagggagccttggaatctgcatttcca caaggtgattctgatgtaaaagcactgatcaagattttatcttctttatcctggattagg agaactgaagaacaatgcgatgtaccacaggcaaaaatgtttctttcagtccttcaaatg attactctccttttaaactcttatgatggccaacccttctatacatctgcaaactatatg caggaagctaggagtctgagatgtttgagtgattcatttatggtacagtcaagggacagt gggtctgctagaaaacacctgcaaagcagcacagtgacatttgctaaagctgctggaaaa gctcaacactccctcttgcaggctccctcacagctggtaagggcaacaggacccaattta gccagtcttcaaaggggtagtctgccaatgggattgagaatatcttgtttcttcataaaa ggactttgcaagttccatgttcctggctaccgttttcctcttcttcctgccttgaaaccg gctgtgatgccttcttctgaccgtgtaagggcaaggatagccacaaaaggccaacacgct aaggctgatggagtaaaagagtggaaaaacctggatctttga