GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:59:54 Sequence gi568815593f:126677509_126936261 : 258753 bp : 43.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 100001 100359 359 1 2 116 89 862 0.440 83.68 1.02 Intr + 127268 127424 157 2 1 61 110 155 0.995 15.01 1.03 Intr + 128063 128188 126 1 0 71 78 87 0.988 6.88 1.04 Intr + 132672 132842 171 2 0 98 61 147 0.926 13.14 1.05 Intr + 134265 134390 126 2 0 100 84 86 0.958 10.28 1.06 Intr + 141414 141634 221 2 2 24 94 248 0.874 16.10 1.07 Intr + 143402 143627 226 2 1 91 90 210 0.999 19.49 1.08 Intr + 145273 145377 105 0 0 99 98 4 0.880 2.91 1.09 Intr + 148480 148599 120 0 0 97 64 126 0.952 11.79 1.10 Intr + 155186 155293 108 1 0 63 67 129 0.951 8.78 1.11 Term + 159057 159065 9 2 0 78 55 0 0.095 -6.11 1.12 PlyA + 159079 159084 6 1.05 2.00 Prom + 159151 159190 40 -2.26 2.01 Init + 177200 177620 421 1 1 65 47 262 0.307 14.35 2.02 Intr + 179495 179579 85 0 1 61 12 78 0.122 -3.62 2.03 Intr + 181138 181201 64 1 1 86 87 53 0.286 3.72 2.04 Intr + 183365 183434 70 1 1 46 100 49 0.378 0.65 2.05 Term + 186704 186858 155 0 2 30 39 140 0.413 1.38 2.06 PlyA + 186932 186937 6 1.05 3.08 PlyA - 186948 186943 6 1.05 3.07 Term - 193283 193125 159 2 0 122 41 121 0.997 8.74 3.06 Intr - 200886 200677 210 0 0 102 94 112 0.525 12.31 3.05 Intr - 202491 202363 129 0 0 110 54 36 0.504 3.29 3.04 Intr - 211745 211625 121 1 1 43 63 37 0.103 -2.80 3.03 Intr - 214531 213733 799 2 1 15 91 338 0.265 17.32 3.02 Intr - 215607 215110 498 1 0 46 81 293 0.353 17.36 3.01 Init - 217820 217622 199 2 1 58 -8 261 0.410 12.56 3.00 Prom - 218905 218866 40 -1.16 4.03 PlyA - 221005 221000 6 1.05 4.02 Term - 226617 226581 37 2 1 93 37 40 0.520 -3.79 4.01 Init - 227908 226965 944 1 2 86 53 272 0.742 16.63 4.00 Prom - 228151 228112 40 -8.96 5.02 PlyA - 228299 228294 6 1.05 5.01 Sngl - 229925 228831 1095 2 0 44 35 372 0.991 24.50 5.00 Prom - 230018 229979 40 -4.96 6.04 PlyA - 230189 230184 6 1.05 6.03 Term - 231525 230417 1109 1 2 29 43 568 0.931 38.63 6.02 Intr - 237626 237418 209 1 2 105 89 116 0.970 12.12 6.01 Init - 240663 240476 188 0 2 74 52 257 0.994 19.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:126677509_126936261|GENSCAN_predicted_peptide_1|575_aa MATATPVPPRMGSRAGGPTTPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKCKAEHDQLLLN YAKKESDLNGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLAAA KKQLADETLLKVDLENRCQSLTEDLEFRKSMYEEEINETRRKHETRLVEVDSGRQIEYEY KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI ESLSSQLSNLQKESRACLERIQELEDLLAKEKDNSRRMLTDKEREMAEIRDQMQQQLNDY EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV SYKYTSRYVLKAGQTVTIWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA QRSTVFKTTIPEEEEEEEEAAGVVVEEELFHQQFL >gi568815593f:126677509_126936261|GENSCAN_predicted_CDS_1|1728_bp atggcgactgcgacccccgtgccgccgcggatgggcagccgcgctggcggccccaccacg ccgctgagccccacgcgcctgtcgcggctccaggagaaggaggagctgcgcgagctcaat gaccggctggcggtgtacatcgacaaggtgcgcagcctggagacggagaacagcgcgctg cagctgcaggtgacggagcgcgaggaggtgcgcggccgtgagctcaccggcctcaaggcg ctctacgagaccgagctggccgacgcgcgacgcgcgctcgacgacacggcccgcgagcgc gccaagctgcagatcgagctgggcaagtgcaaggcggaacacgaccagctgctcctcaac tatgctaagaaggaatctgatcttaatggcgcccagatcaagcttcgagaatatgaagca gcactgaattcgaaagatgcagctcttgctactgcacttggtgacaaaaaaagtttagag ggagatttggaggatctgaaggatcagattgcccagttggaagcctccttagctgcagcc aaaaaacagttagcagatgaaactttacttaaagtagatttggagaatcgttgtcagagc cttactgaggacttggagtttcgcaaaagcatgtatgaagaggagattaacgagaccaga aggaagcatgaaacgcgcttggtagaggtggattctgggcgtcaaattgagtatgagtac aagctggcgcaagcccttcatgagatgagagagcaacatgatgcccaagtgaggctgtat aaggaggagctggagcagacttaccatgccaaacttgagaatgccagactgtcatcagag atgaatacttctactgtcaacagtgccagggaagaactgatggaaagccgcatgagaatt gagagcctttcatcccagctttctaatctacagaaagagtctagagcatgtttggaaagg attcaagaattagaggacttgcttgctaaagaaaaagacaactctcgtcgcatgctgaca gacaaagagagagagatggcggaaataagggatcaaatgcagcaacagctgaatgactat gaacagcttcttgatgtaaagttagccctggacatggaaatcagtgcttacaggaaactc ttagaaggcgaagaagagaggttgaagctgtctccaagcccttcttcccgtgtgacagta tcccgagcatcctcaagtcgtagtgtacgtacaactagaggaaagcggaagagggttgat gtggaagaatcagaggcgagtagtagtgttagcatctctcattccgcctcagccactgga aatgtttgcatcgaagaaattgatgttgatgggaaatttatccgcttgaagaacacttct gaacaggatcaaccaatgggaggctgggagatgatcagaaaaattggagacacatcagtc agttataaatatacctcaagatatgtgctgaaggcaggccagactgttacaatttgggct gcaaacgctggtgtcacagccagccccccaactgacctcatctggaagaaccagaactcg tggggcactggcgaagatgtgaaggttatattgaaaaattctcagggagaggaggttgct caaagaagtacagtctttaaaacaaccatacctgaagaagaggaggaggaggaagaagca gctggagtggttgttgaggaagaacttttccaccagcagttcttatga >gi568815593f:126677509_126936261|GENSCAN_predicted_peptide_2|264_aa MARAGRLCWGTWRPPLQLLAQVLSPSLPPGWGRRPAARSAGACRAHAHPELTLACKCAPN PGSHRRLSLHTSPQAEGASSSLGQPRVELPQCSGRLKGSSSAARVGAEVEEAPRASEGCQ HAVTSHYEQLAGQSWSGPKEELPQCKDSVELQRLTRPELPTTLAYCPECLLLFKGELTAI QGDGTSARMETKSKREDGNKTSNEEEDNCRLNEGALGRCHCHFTESYKQLPRGEAGHMEQ HCAIGLCKTPGRGTAQLWPDQPVE >gi568815593f:126677509_126936261|GENSCAN_predicted_CDS_2|795_bp atggcacgggcaggccggctgtgctgggggacctggcgcccccctctgcagctgctggcc caggtgctaagcccctcactgcccccgggctgggggcgcaggccggctgctcggagtgcg ggggcctgccgagcccacgcccacccggaactcacgctggcctgcaagtgtgcgcccaac cctggttcccaccggcgcctctccctccacacctccccacaagcagagggagccagctcc agcctcggccagcccagagtggagctcccacagtgcagcggcaggctgaagggctcctca agcgcggccagagtgggcgcggaggtcgaggaggcgccaagagcgagcgagggctgccag cacgctgtcacctctcactatgaacaattggcaggacagtcatggtctggacctaaggaa gagctcccacaatgcaaagattctgtggagctgcaaaggttgacccggccagagctgccc acgaccctggcgtactgtccagaatgccttctcctcttcaaaggcgagctgactgcaatt cagggagatggaacttccgccaggatggagacaaaaagtaaaagagaggatggaaataaa acatcaaatgaagaagaggataactgcaggctgaatgaaggagccttgggccgctgccac tgccatttcacggagtcctacaaacagctgccacgtggagaggcgggccacatggagcag cactgtgccatcggactgtgtaagacacctggcagaggtactgcccagctgtggccagat caacccgtggaataa >gi568815593f:126677509_126936261|GENSCAN_predicted_peptide_3|704_aa MENDFEELREEGFRRSNYSELQEDIQTKGKEVENFEKTLEECITGITNTEKCLKELMELK TKARELQIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIV AIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIDKEGILPNSFYEASIILIRK PGRDITKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQENKIPRNP TTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKL PMPFFTELEKTTLKFIWNQKRARVAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQ NRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPF LTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWD LIKLKSFCTAKETTIRVNSVSHDHVLLLEVIIQMFRNPDSNSSPTEWSLDTSNLPVWGPW FCRIRCPGHLGCKWPVLLFGKPAVWHCWARISRQSMLGPCLLWLRNPGPQHEKRTLFGDM VCFLFITPLATISGWLCLRGAVDHLHFSSRLEAVGLIALTVALFTIYLFWTLVSFRYHCR LYNEWRRTNQRVILLIPKSVNVPSNQPSLLGLHSVKRNSKETVV >gi568815593f:126677509_126936261|GENSCAN_predicted_CDS_3|2115_bp atggagaatgactttgaggagctgagagaagaaggcttcagacgatcaaattactctgag ctacaggaggacattcaaaccaaaggcaaagaagttgaaaactttgaaaaaactttagaa gaatgtataacaggaataaccaatacagagaagtgcttaaaggagctgatggagctgaaa accaaggctcgagaactacaaatacaaactaccatcagagaatactacaaacacctctac gcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacaccctccca agactcaaccaggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtg gcaataatcaatagcttaccaacgaaaaagagtccaggaccagatggattcacagctgaa ttctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatcaata gacaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacgaaag ccaggcagagacataacaaaaaaagagaattttagaccaatatccttgatgaacattgat gcaaaaattctcaataaaatactggcaaaacgaatccagcagcacatcaaaaagcttatc caccatgatcaagtgggcttcatccctgggatgcaagagaataaaatacctaggaatcca actacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatc gtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagcta ccaatgcctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcgtcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacacta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcagtggaacagaacagagccctcagaaataacgccgcatatctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaa accataaaaacgctagaagaaaacctaggcattaccattcaggacataggcatgggcaag gacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagtgta tcccatgatcatgtacttctgctggaagtaattattcaaatgttcagaaacccagactcc aacagcagtccaacggagtggtcattagacaccagtaaccttccagtctgggggccatgg ttttgtaggattcgctgccctggccacttgggctgcaagtggccagtgctcctgtttggc aagccagctgtgtggcactgctgggcacgcatcagcagacagtcaatgctaggtccctgt ttgttgtggctgagaaaccctggcccccagcatgagaagcggactctgtttggcgacatg gtgtgcttcttgtttataactcccctggccaccatctcgggctggctgtgcctgcggggc gccgtggaccacctgcactttagtagtcggctggaagccgtcggactgattgcactcact gtcgcactcttcactatttacctcttttggacactagtgtcatttaggtaccactgtcga ttgtacaacgagtggcgtcggaccaatcagagggtgattctcctcattccaaagtctgtc aatgtaccttctaaccagccgtccttgctgggcctccattcggtcaagaggaactcaaag gagacagttgtttga >gi568815593f:126677509_126936261|GENSCAN_predicted_peptide_4|326_aa MAILPKVIYRFNAIPIKLPMPFFTELEKTTLKFIWNQKRAHITKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDF MSKTPKAMATKAKIDKWDLIELKSFCTAKETTIRVNRQPTEWEKIFPTYSSDKGLISRIY NELKQIYKKKTNNPIKKWAKDMNRHFSKEDIHAAKKHMKKCSPSLAIRETQIKATMRYHL TPVRMAIIKKSGNNRDVDEIGNHHSQ >gi568815593f:126677509_126936261|GENSCAN_predicted_CDS_4|981_bp atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg cctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt gaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaaaattttcccaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaagacattcatgcagccaaaaaacacatgaaaaaa tgctcaccatcactggccatcagagaaacacaaatcaaagccacaatgagataccatctc acaccagttagaatggcaatcattaaaaagtcaggaaacaacagggacgtggatgaaatt ggaaatcatcattctcagtaa >gi568815593f:126677509_126936261|GENSCAN_predicted_peptide_5|364_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNCLSDHSAIKLELRVKNLTQNRSTTWKLNNLLLNDYWV RNEMKAEIKMFFETNDNKDTTYQHLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEMQEQTHSKASRRQEITKIRAELKEIETQKTFQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTMREYYKHLYANKLENLEEIDKFLDTYTLPRLN QEEVESLHRQMTGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELGTIPSETIPINRQR GNPP >gi568815593f:126677509_126936261|GENSCAN_predicted_CDS_5|1095_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcagggttaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cgtaacgaaatgaaggcagaaataaagatgttctttgaaaccaacgacaacaaagacaca acataccagcatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaatgcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacctttcaaaaaattaacgaatcc aggagctggttttttgaaaggatcaacaagattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccact gatcccacagaaatacaaactaccatgagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaattgataaattcctggacacatacactctcccaagactgaac caggaagaagttgaatctctgcatagacaaatgacaggagctgaaattgtggcaataatc aatagcttaccaacgaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactgggtaccattccttctgaaactattccaatcaatagacaaaga gggaatcctccctaa >gi568815593f:126677509_126936261|GENSCAN_predicted_peptide_6|501_aa MTTSRCSHLPEVLPDCTSSAAPVVKTVEDCGSLVNGQPQYVMQVSAKDGQLLSTVVRTLA TQSPFNDRPMCRICHEGSSQEDLLSPCECTGTLGTIHRSCLEHWLSSSNTSYCELCHFRF AVERKPRPLVEVKGKLTNRKDIHTKNPSVHHHHQRPKVDKTTNLGKKHSRKTGNSKKQST SPPPKERRSSPATEQSWMENDFEELREEGFRRWNYSELQEDIQTKGKEVENFEKTLEECI TGITNTEKCLKELMELKTKAREQHEECRSLRSRCDQLEERIPLMEDEMNEMKQEGKFREK RIKRNEQSLQEIWDYVKRPNLCLIGVPESDRENGTKLENILQDIIQENLPNLARQANIQI QEIQRTPQRYSSRRATPRHIIVRFTKVAMKEKILRAAREKGRVTLKEKPIRLTADLSAGT LQARRGWGPIFNILKEKNFQPRISYPAKLTFISEGEIEYFTDKQMLRDFVTTRPALKELL KEVLNMERNNRYQPLQNHAKM >gi568815593f:126677509_126936261|GENSCAN_predicted_CDS_6|1506_bp atgacaaccagccgctgcagtcacctgcccgaagtcctgccagactgcaccagctcagct gcacccgtggtgaagacggtggaggattgtggcagcctagtgaatgggcagccgcagtat gtcatgcaagtttcagccaaggacgggcagctgctgtcaacagtagtgcggactcttgcc acccagagccccttcaatgaccggccgatgtgcaggatctgccacgagggcagcagccaa gaggacttgctctctccatgtgaatgtacagggaccttggggacaattcatcggagctgc ctggagcactggctgtcatcctcaaacaccagctactgtgaactctgccacttcaggttt gcagtcgagcgcaaacccaggccgttagtggaggtcaaaggaaaactaacaaacagaaag gacatccacaccaaaaacccatctgtacatcaccatcatcaaagaccaaaagtagataaa accacaaacttggggaagaaacacagcagaaaaactggaaactctaaaaagcagagcacc tctcctcctccaaaggaacgcaggtcctcaccagcaacagaacaaagctggatggagaat gactttgaggagctgagagaagaaggcttcagacgatggaattactctgagctacaggag gacattcaaaccaaaggcaaagaagttgaaaactttgaaaaaactttagaagaatgtata acaggaataaccaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggct cgagagcaacatgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagg ataccattgatggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaa agaataaaaagaaacgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaat ctatgtctgattggtgtacctgaaagtgacagggagaatggaaccaagttggaaaacatt ctgcaggatattatccaggagaacctccccaatctagcaaggcaggccaacattcagatt caggaaatacagagaacaccacaaagatactcctcgagaagagcaactccaagacacata attgtcagattcaccaaagttgcaatgaaggaaaaaatcttaagggcagccagagagaaa gggcgggttaccctcaaagagaagcccattagactaacagcagatctgtcggcaggaact ctacaagccagaagagggtgggggccaatattcaacattcttaaagaaaagaattttcaa cccagaatttcatatccagccaaactaaccttcataagtgaaggagaaatagaatacttt acagacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctg aaggaagtgctaaatatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaa atgtaa