GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:41:49 Sequence gi568815593f:146239113_146440441 : 201329 bp : 41.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12599 12763 165 0 0 130 115 185 0.977 24.41 1.02 Intr + 15831 15980 150 1 0 113 78 134 0.999 14.11 1.03 Intr + 19337 19481 145 0 1 100 69 121 0.999 9.82 1.04 Intr + 21633 21786 154 0 1 62 99 102 0.999 7.85 1.05 Intr + 22398 22694 297 2 0 62 84 209 0.886 14.05 1.06 Intr + 24379 24519 141 0 0 67 85 133 0.998 10.53 1.07 Intr + 28537 28656 120 0 0 102 92 14 0.884 2.97 1.08 Intr + 30095 30169 75 1 0 83 116 30 0.943 4.09 1.09 Intr + 30308 30472 165 1 0 82 55 104 0.973 5.74 1.10 Intr + 31842 31946 105 2 0 60 78 115 0.981 7.19 1.11 Intr + 32371 32562 192 0 0 93 89 143 0.994 13.67 1.12 Intr + 45510 45620 111 2 0 39 84 106 0.964 4.96 1.13 Term + 46835 46918 84 1 0 89 44 128 0.997 5.17 1.14 PlyA + 47815 47820 6 1.05 2.08 PlyA - 47908 47903 6 1.05 2.07 Term - 56072 55894 179 0 2 61 48 112 0.293 1.37 2.06 Intr - 72800 72739 62 1 2 69 100 63 0.095 3.06 2.05 Intr - 81212 81086 127 0 1 48 61 109 0.067 3.02 2.04 Intr - 93931 93774 158 0 2 -18 107 137 0.674 3.73 2.03 Intr - 94191 94023 169 1 1 118 44 72 0.626 3.88 2.02 Intr - 94675 94638 38 0 2 100 116 3 0.894 1.29 2.01 Init - 95215 94983 233 1 2 69 47 153 0.485 5.38 2.00 Prom - 95712 95673 40 -13.01 3.00 Prom + 96436 96475 40 -5.25 3.01 Init + 96896 96932 37 1 1 78 89 34 0.829 2.72 3.02 Intr + 98308 98498 191 2 2 75 72 103 0.677 5.78 3.03 Intr + 99293 99379 87 1 0 49 41 126 0.360 3.25 3.04 Intr + 99548 99636 89 1 2 64 47 59 0.474 -2.75 3.05 Intr + 99991 100120 130 1 1 48 98 100 0.810 6.78 3.06 Term + 100436 101332 897 1 0 123 38 1109 0.973 100.78 3.07 PlyA + 102540 102545 6 1.05 4.05 PlyA - 102604 102599 6 1.05 4.04 Term - 103401 103239 163 2 1 14 42 192 0.295 3.63 4.03 Intr - 104316 104159 158 0 2 47 26 172 0.198 4.79 4.02 Intr - 113671 113571 101 2 2 90 57 42 0.013 0.21 4.01 Init - 137900 137603 298 2 1 48 103 157 0.191 10.63 4.00 Prom - 140531 140492 40 -5.25 5.00 Prom + 145372 145411 40 -7.55 5.01 Init + 156048 156187 140 2 2 40 -16 254 0.352 9.86 5.02 Intr + 161496 161783 288 0 0 93 101 135 0.358 10.74 5.03 Term + 168083 168134 52 2 1 128 54 31 0.191 -0.38 5.04 PlyA + 168226 168231 6 1.05 6.00 Prom + 185264 185303 40 -3.75 6.01 Init + 189373 189468 96 0 0 64 40 130 0.679 6.16 6.02 Intr + 193691 193921 231 1 0 64 93 200 0.952 15.15 6.03 Intr + 195617 195719 103 1 1 74 33 95 0.165 1.43 6.04 Term + 200185 200294 110 2 2 84 54 56 0.125 -0.51 6.05 PlyA + 200751 200756 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 140688 140909 222 2 0 73 38 155 0.836 4.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:146239113_146440441|GENSCAN_predicted_peptide_1|634_aa XQPMYSREHGAAASERLQLGTPPPLLAARLVPPRNLMGSSIGYHTSVSSPTPLVPDTYEP DGYNPEAPSITSSGRSQYRQFFSRTQTQRPNLIGLTSGDMDVNPRAANIVIQTEPPVPVS INSNITRVVLEPDSRKRAMSGLEGPLTKKPWLGKQGNNNQNKPGFLRKNQYTNTKLEVKK IPQELNNITKLNEHFSKFGTIVNIQVAFKGDPEAALIQYLTNEEARKAISSTEAVLNNRF IRVLWHRENNEQPTLQSSAQLLLQQQQTLSHLSQQHHHLPQHLHQQQVLVAQSAPSTVHG GIQKMMSKPQTSGAYVLNKVPVKHRLGHAGGNQSDASHLLNQSGGAGEDCQIFSTPGHPK MIYSSSNLKTPSKLCSGSKSHDVQEVLKKKQEAMKLQQDMRKKRQEVLEKQIECQKMLIS KLEKNKNMKPEERANIMKTLKELGEKISQLKDELKTSSAVSTPSKVKTKTEAQKELLDTE LDLHKRLSSGEDTTELRKKLSQLQVEAARLGILPVGRGKTMSSQGRGRGRGRGGRGRGSL NHMVVDHRPKALTVGGFIEEEKEDLLQHFSTANQGPKFKDRRLQISWHKPKVPSISTETE EEEVKEEETETSDLFLPDDDDEDEDEYESRSWRR >gi568815593f:146239113_146440441|GENSCAN_predicted_CDS_1|1905_bp ngacagcccatgtactctcgtgaacatggtgctgctgcatctgagcgacttcagttgggg acaccgcctcctctgttggcagctcgtttggtgccacctcgaaacctcatgggatcctcc attggataccatacctcagtctccagccctacccctctggttccagatacatatgaacca gatggttacaacccagaagctcctagtattactagttctggtagatctcagtacagacag ttcttttcaagaactcagacacagcgtcccaatctgattggcctaacatctggagatatg gatgtaaatccaagagctgctaacattgtgatccagactgaaccaccagttcctgtttcg attaatagcaacataaccagagtagttcttgaaccagatagtcgaaaaagagctatgagt ggtttggaagggccactcacaaagaaaccttggctgggaaagcaaggaaataacaatcaa aataaaccagggttcttacgaaagaatcagtatacaaacaccaaattagaagtcaagaaa atccctcaggaattgaacaacattaccaagctcaatgaacacttcagcaaatttggaact attgttaatatccaggttgcttttaagggtgacccagaagcagccctaatccaatatctt accaatgaggaggccaggaaagccatttctagcacagaagcagttctaaacaaccgattc attcgagtcttgtggcatagggaaaataatgagcaaccgacactacagtcctcagcacag ctgctcctgcaacaacagcaaacacttagtcacctctcacagcagcaccatcacctgcca cagcatctacatcagcagcaggtgctagtggcccagtctgctccttcaacagtgcacgga ggtatccagaagatgatgagcaaaccacagacatcaggtgcatatgttcttaacaaagtt cctgttaaacatcgtcttggacatgcaggtggtaaccagagtgatgcatcacatttgttg aatcagtctggtggtgctggagaagattgccagatattttcaactccaggccatccaaaa atgatttacagctcctcaaacttaaagacaccttcaaagctctgttcagggtctaaatct catgatgttcaagaagtgcttaaaaaaaaacaggaagcaatgaagttacaacaagatatg aggaaaaaaagacaggaagtgttagaaaagcaaatagaatgccaaaagatgttaatatcc aagttagaaaaaaacaaaaacatgaaaccagaagaaagagcaaatataatgaagactttg aaagagcttggagagaagatctcacaattaaaagatgaattaaaaacatcttctgcagtc tccacaccatctaaagtgaagacaaaaacggaggcccagaaggagttattagatactgaa ctggacctccacaagaggctgtcctcaggagaagacaccacagaattacggaaaaaactc agtcagttacaggttgaggctgcacggttaggtattttacctgtgggtcgaggaaagacc atgtcctctcaaggtcgaggaagaggccgagggcgtggaggaagaggaaggggctcacta aatcacatggtggtggaccatcgtcccaaagcactaacagttggaggattcattgaggaa gaaaaagaagacttgcttcagcatttctcaaccgcaaaccaagggccaaaatttaaagac cgtcggctacagatatcatggcacaagcccaaggtaccatctatatccactgagactgaa gaagaagaagtcaaggaggaggaaacagaaacctcagatttgtttttgcctgatgatgac gatgaagatgaagatgaatatgagtctcgctcatggcgaagatga >gi568815593f:146239113_146440441|GENSCAN_predicted_peptide_2|321_aa MPSRSEAAGGAAWPGVRSRLTCLPLPFSGRLSQTSHMYSPPAQLLRDSSVGRCLALSAKW LRGSETPRGIQGPEAERPPSYPHGAEASAEGPAVCSSLSSRLENICVISSLASSVLSGLN ESILEDENWFCSPLLQFRDILKFKAVRGGGQSTGGVDSNMGTWLTCWVTLDKSLMGSESQ FPYHYKRVRETGSIVPSGGVATPLSDVDDHDEDEDSYGYCARHCARGFTEISLFNPSNSP MRSGSPSLQMMQILVPTTDLAEGHAERKTSEAIYCYHFCYPKSMKHKTASDDNQNLKMNK KAILHGCKTDEAKYAMLFLQY >gi568815593f:146239113_146440441|GENSCAN_predicted_CDS_2|966_bp atgccatcgcgcagcgaggcagctggcggggctgcctggccaggtgttcgcagccgcctg acttgccttcctctccctttctctggtcgcctttcacaaacttctcacatgtactcgccg cctgcgcagctgctgcgggactcgtctgttggccgctgccttgcattgtctgcaaagtgg ctcagaggatccgagacgcccagagggattcaggggccagaggccgagcgtcccccctca tacccccatggagctgaggcttctgcagaaggtcctgctgtgtgttcatctttgagctct agacttgagaatatctgtgtcatcagcagcttggctagctctgtgctatctggactcaat gagtccatcctggaagatgagaactggttttgctctcctcttctgcaattcagagacatc ttaaagttcaaagccgtaagaggtggtggacagagcactggaggtgtggattcaaatatg ggtacatggctaacctgctgggtgaccctagacaagtcactcatgggctctgagtctcag tttccttatcactacaagagggttagggagacaggctccatcgtcccctctggtggagtg gctactcctctcagtgatgttgacgatcatgatgaagatgaagatagttatggatactgt gccaggcactgtgctagaggctttactgagatttctttatttaatccttccaacagtcct atgagatcaggttctccttctctgcagatgatgcagattttagttccaaccacagaccta gctgaaggccatgctgaaagaaagacaagtgaagccatctactgttaccacttctgctac cctaagtctatgaagcataaaactgccagtgatgataatcaaaacctcaaaatgaataaa aaggcaatcctccatggttgtaaaacagacgaggcaaaatatgctatgttgtttttacaa tattga >gi568815593f:146239113_146440441|GENSCAN_predicted_peptide_3|476_aa MGKTLKHMGPDTGREPAEGAGSDGEKSNWRAPRGRGSPDLSGRPLGLRDVPLNHTTPPRS DSGKPGTHIPCTITGRPLTLSRARLGAWRRLLAAPRDRTLVDSLGAVRAGTGRRGWDFWG GMGEGKSTKKKESERSAKMMAMNSKQPFGMHPVLQEPKFSSLHSGSEAMRRVCLPAPQLQ GNIFGSFDESLLARAEALAAVDIVSHGKNHPFKPDATYHTMSSVPCTSTSSTVPISHPAA LTSHPHHAVHQGLEGDLLEHISPTLSVSGLGAPEHSVMPAQIHPHHLGAMGHLHQAMGMS HPHTVAPHSAMPACLSDVESDPRELEAFAERFKQRRIKLGVTQADVGAALANLKIPGVGS LSQSTICRFESLTLSHNNMIALKPVLQAWLEEAEAAYREKNSKPELFNGSERKRKRTSIA APEKRSLEAYFAIQPRPSSEKIAAIAEKLDLKKNVVRVWFCNQRQKQKRMKYSAVH >gi568815593f:146239113_146440441|GENSCAN_predicted_CDS_3|1431_bp atgggaaaaaccctaaaacacatgggaccagacacagggcgggaaccagcggagggggct ggctcggatggggagaaaagcaactggagggcgccgaggggaagagggagcccggatctg tcagggcgtcctcttggactaagggatgttcccctaaaccacaccaccccacctcgttca gattctgggaaacccggcacgcacataccctgcacaataacaggcaggcccctcaccctc tccagggcccgtctgggcgcttggaggcgcctcctcgctgcgccgcgggaccggactctg gtggacagcttgggcgcagttcgagctgggacaggacggagaggttgggacttttggggt ggcatgggggaagggaagtccacgaagaagaaagaatcggaaaggagcgcgaagatgatg gccatgaactccaagcagcctttcggcatgcacccggtgctgcaagaacccaaattctcc agtctgcactctggctccgaggccatgcgccgagtctgtctcccagccccgcagctgcag ggtaatatatttggaagctttgatgagagcctgctggcacgcgccgaagctctggcggcg gtggatatcgtctcccacggcaagaaccatccgttcaagcccgacgccacctaccatacc atgagcagcgtgccctgcacgtccacttcgtccaccgtgcccatctcccacccagctgcg ctcacctcacaccctcaccacgccgtgcaccagggcctcgaaggcgacctgctggagcac atctcgcccacgctgagtgtgagcggcctgggcgctccggaacactcggtgatgcccgca cagatccatccacaccacctgggcgccatgggccacctgcaccaggccatgggcatgagt cacccgcacaccgtggcccctcatagcgccatgcctgcatgcctcagcgacgtggagtca gacccgcgcgagctggaagccttcgccgagcgcttcaagcagcggcgcatcaagctgggg gtgacccaggcggacgtgggcgcggctctggctaatctcaagatccccggcgtgggctcg ctgagccaaagcaccatctgcaggttcgagtctctcactctctcgcacaacaacatgatc gctctcaagccggtgctccaggcctggttggaggaggccgaggccgcctaccgagagaag aacagcaagccagagctcttcaacggcagcgaacggaagcgcaaacgcacgtccatcgcg gcgccggagaagcgttcactcgaggcctatttcgctatccagccacgtccttcatctgag aagatcgcggccatcgctgagaaactggaccttaaaaagaacgtggtgagagtctggttc tgcaaccagagacagaaacagaaacgaatgaagtattcggctgtccactga >gi568815593f:146239113_146440441|GENSCAN_predicted_peptide_4|239_aa METFTTGDSKLNTGDGKGLVKSPDVDLKSSGISSKGSARSHLTGYDQYWGLYSSKGSLSN FWTTLMVSEAFCRLSSNNSLLVVLGQQCVATEIKSNSSSSTPLGPHKTKVSLLSHKSPSD FGLKNTSPFSSTGTSGKGDYTILGKPHLQLLPEASRRVRRRRRSGALRRRERVERFADRA AAPACTKAFLQLANKKRERKENRELASMEHLLCTGHYARRFAAVIAVPPQRSRGGPQSE >gi568815593f:146239113_146440441|GENSCAN_predicted_CDS_4|720_bp atggaaaccttcaccactggtgactccaagctgaacacaggggatggcaaaggcctagtg aagagtccagatgtagacttaaagagctctggaatctcctccaagggttcagccagaagc catctgactggctatgaccagtattggggactttattcttccaaaggcagtctatccaat ttttggaccactctgatggtgtcagaggccttctgtagattgagctcaaataactcactt ctagttgtcttaggtcaacaatgtgtggccacagagattaagtcaaactcctcttcatct acacccctggggccccataaaacaaaagtaagccttctttcacataaaagcccttcagat ttcggccttaaaaacacaagtccattttcttctacaggaacgagtggcaaaggcgactac acaatcctggggaaaccacatctccagttgcttcctgaagcaagccgaagagtgaggcga cggcgccgttccggcgcactcaggaggagggaaagggtggagcgcttcgctgaccgggct gcagctccggcctgcacgaaggcgtttctacagctcgcaaataaaaagagggagcgtaaa gagaacagggagctggcgtctatggagcacctactatgtacagggcactatgctaggcgc tttgcagcagttatagcagtacccccacaaagaagccgaggtggaccccagagtgaataa >gi568815593f:146239113_146440441|GENSCAN_predicted_peptide_5|159_aa MTKYGPSAKERQYGAILDATQEDHLKRQRDLDRQKLEVESEMPGGYGFLLRAFPQLIICT RIPSQPLSLENLRQAPFMDSVLGMGDGGGHPEKKRTGLKLGQGPQPCRLTLPHPSHPDST VQMPWMCHLECHQLLKKFQKKGINQESTGLWLRTVGRHN >gi568815593f:146239113_146440441|GENSCAN_predicted_CDS_5|480_bp atgaccaaatatggaccaagcgcaaaagaacgccagtatggagccatcttagatgctacc caagaggaccacctgaagaggcaacgggaccttgatagacagaagctggaagtggaatct gagatgcccgggggctatgggtttcttctcagggctttccctcaattaatcatttgcaca agaattccatctcagcctctgagtctagagaacctaagacaagcccctttcatggacagt gtgctgggcatgggagatggaggagggcatccagaaaagaagagaacagggctgaagcta ggacagggaccacaaccatgtcgccttactttgccccatccttcccaccctgattctact gtgcagatgccatggatgtgccacttagaatgccatcaattattgaagaagttccaaaag aagggaataaatcaagaaagcacaggactctggctcagaactgttggcaggcacaactga >gi568815593f:146239113_146440441|GENSCAN_predicted_peptide_6|179_aa MNGYEVALVVSPSGPSGGLDVEDSDTERAAKAKGYTRQQFCHESTLNKGDQVIYNLTCLP YCCVRFPLAGTGPHILYLSRVASNLELFKRGKGRGEQRKEEVTCGMLRKFPPSQRSLPPN TVPASVFLPNYKEYDIKIQSKCSRNTRVRGSNQTDPEGKQSWSSASPYYDKRNCSRLKA >gi568815593f:146239113_146440441|GENSCAN_predicted_CDS_6|540_bp atgaatggatatgaggtggctttggtggtctcaccttcaggaccctctggtgggttggat gtggaggacagtgacacggagagagctgccaaggctaaagggtacactcgccagcagttt tgccatgagagtacactgaacaaaggagaccaggtcatttacaacctgacgtgtctaccc tactgctgtgtccggtttccactggctggaacgggacctcacattctatatttgtcccga gtggctagcaacttagaactttttaaaagaggcaaaggcagaggagaacaaaggaaggag gaagtaacttgtggaatgctgagaaagtttccaccttcacaacggagtctccctccaaac acagtccctgcttctgtgtttctgcctaactacaaagagtatgacataaagattcagagc aaatgcagtagaaacacaagggttagaggaagcaatcagacagatccagaggggaaacag tcctggtcttcagcaagtccatattatgataaaaggaactgttctagattaaaagcttga