GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:14:48 Sequence gi568815597r:217510320_217730971 : 220652 bp : 36.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13050 13170 121 2 1 68 74 79 0.483 4.90 1.02 Intr + 19706 19743 38 0 2 93 103 17 0.264 0.76 1.03 Intr + 43692 43802 111 2 0 33 37 115 0.041 0.56 1.04 Intr + 44847 44889 43 2 1 67 121 21 0.336 0.19 1.05 Intr + 50324 50404 81 0 0 75 70 59 0.322 1.49 1.06 Intr + 61554 61709 156 1 0 13 75 114 0.294 1.66 1.07 Term + 62647 62771 125 2 2 31 47 92 0.217 -3.03 1.08 PlyA + 64023 64028 6 1.05 2.06 PlyA - 64932 64927 6 1.05 2.05 Term - 71903 71763 141 2 0 84 48 48 0.113 -2.65 2.04 Intr - 100081 100002 80 2 2 59 66 68 0.283 0.05 2.03 Intr - 100752 100570 183 1 0 58 78 160 0.800 10.84 2.02 Intr - 103883 103822 62 1 2 98 79 48 0.980 2.36 2.01 Init - 110164 109464 701 1 2 99 57 578 0.980 50.30 2.00 Prom - 111319 111280 40 -5.85 3.00 Prom + 115925 115964 40 -9.05 3.01 Init + 116190 116198 9 2 0 65 89 0 0.162 -1.68 3.02 Term + 120582 120755 174 2 0 119 49 190 0.986 14.98 3.03 PlyA + 120875 120880 6 1.05 4.02 PlyA - 122175 122170 6 1.05 4.01 Sngl - 133105 132917 189 1 0 71 42 199 0.685 8.46 4.00 Prom - 147536 147497 40 -3.95 5.04 PlyA - 149985 149980 6 1.05 5.03 Term - 174969 174922 48 0 0 103 49 75 0.147 1.43 5.02 Intr - 182361 180859 1503 0 0 6 13 516 0.032 24.84 5.01 Init - 183788 182751 1038 2 0 44 41 495 0.041 34.43 5.00 Prom - 183881 183842 40 -6.15 6.03 PlyA - 184050 184045 6 1.05 6.02 Term - 184830 184278 553 2 1 -48 43 334 0.761 8.10 6.01 Init - 185294 184909 386 2 2 88 44 420 0.974 33.96 6.00 Prom - 195537 195498 40 -0.95 7.02 PlyA - 195654 195649 6 1.05 7.01 Sngl - 202629 202441 189 0 0 45 48 179 0.780 3.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 183788 182373 1416 2 0 44 42 572 0.859 43.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:217510320_217730971|GENSCAN_predicted_peptide_1|224_aa MTLNEHAAFKRLFNKAQLAPPLIHLTLSGHSTCFREHRVGGYKINSDKDITFVMGPTGRF SANNSSFFGPSCPALELAPADNDEYQGYSKVLFLKTNVKSNAAKGASFTAEFLNPCMIDI WGQVNSLLWETAWATRDSEKQKDRKKDRQTKEERKKREEGRKEGKKERRKEGRKEGRKEG RKGDTLGQNPDVLQSKTLEVQEERKSDTKVHNPFIYNMLFKQIK >gi568815597r:217510320_217730971|GENSCAN_predicted_CDS_1|675_bp atgactcttaacgagcatgctgccttcaagcgtctgtttaacaaggcacaacttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcacagggttggg ggttacaaaatcaacagtgacaaagacataacctttgtgatgggaccaactggacgcttc agcgcaaacaactcatccttcttcggaccttcttgtccagctcttgagctagctccagct gataatgatgaataccaaggctacagtaaagtcctctttttaaaaaccaacgtgaaaagt aatgcagcaaaaggtgcttcatttacagcagaatttctcaacccctgcatgattgacatc tggggccaggttaattctttgctgtgggagactgcctgggccacaagagattctgagaaa cagaaagacagaaagaaagacagacagacaaaagaagaaagaaagaaaagggaggaagga aggaaggaaggaaagaaagaaagaaggaaggaaggaaggaaagaaggaaggaaggaagga aggaagggagatacgttaggacaaaatccagatgtcctgcaatccaaaacattagaagtg caagaagaaaggaagtccgacacaaaagttcataaccccttcatctacaatatgcttttt aagcaaataaaatga >gi568815597r:217510320_217730971|GENSCAN_predicted_peptide_2|388_aa MEELVHDLVSALEESSEQARGGFAETGDHSRSISCPLKRQARKRRGRKRRSYNVHHPWET GHCLSEGSDSSLEEPSKDYRENHNNNKKDHSDSDDQMLVAKRRPSSNLNNNVRGKRPLWH ESDFAVDNVGNRTLRRRRKVKRMAVDLPQDISNKRTMTQPPEGCRDQDMDSDRAYQYQEF TKNKVKKRKLKIIRQGPKIQDEGVVLESEETNQTNKDKMECEEQKVSDELMSESDSSSLS STDAGLFTNDEGRQGDDEQSDWFYEKESGGACGITGVVPWWEKEDPTELDKNVPDPVFES ILTGSFPLMSHPSRRGFQARLSRLHGMSSKNIKKSGGTPTSMLNKRCSGEWHRKRVTEVY RRKEVGESEQLHNQVIIPNPIQASCFYV >gi568815597r:217510320_217730971|GENSCAN_predicted_CDS_2|1167_bp atggaggagctggttcatgaccttgtctcagcattggaagagagctcagagcaagctcga ggtggatttgctgaaacaggagaccattctcgaagtatatcttgccctctgaaacgccag gcaaggaaaaggagagggagaaaacggaggtcgtataatgtgcatcacccgtgggagact ggtcactgcttaagtgaaggctctgattctagtttagaagaaccaagcaaggactataga gagaatcacaataataataaaaaagatcacagtgactctgatgaccaaatgttagtagca aagcgcaggccgtcatcaaacttaaataataatgttcgagggaaaagacctctatggcat gagtctgattttgctgtggacaatgttgggaatagaactctgcgcaggaggagaaaggta aaacgcatggcagtagatctcccacaggacatctctaacaaacggacaatgacccagcca cctgagggttgtagagatcaggacatggacagtgatagagcctaccagtatcaagaattt accaagaacaaagtcaaaaaaagaaagttgaaaataatcagacaaggaccaaaaatccaa gatgaaggagtagttttagaaagtgaggaaacgaaccagaccaataaggacaaaatggaa tgtgaagagcaaaaagtctcagatgagctcatgagtgaaagtgattccagcagtctcagc agcactgatgctggattgtttaccaatgatgagggaagacaaggtgatgatgaacagagt gactggttctacgaaaaggaatcaggtggagcatgtggtatcactggagttgtgccctgg tgggaaaaggaagatcctactgagctagacaaaaatgtaccagatcctgtctttgaaagt atcttaactggttcttttccccttatgtcacacccaagcagaagaggtttccaagctaga ctcagtcgccttcatggaatgtcttcaaagaatattaaaaaatctggagggactccaact tcaatgttaaataagagatgcagcggggaatggcacaggaagagggtgacagaagtgtat agacgaaaagaagtaggggagagtgaacagttgcataatcaagtaataattcccaaccca attcaggctagttgtttctatgtttga >gi568815597r:217510320_217730971|GENSCAN_predicted_peptide_3|60_aa MQMGRQPHQLFPAAGAPIGWRPAAPNINDTREPGLEAQAREQTPTTTAPATSKEQFSILR >gi568815597r:217510320_217730971|GENSCAN_predicted_CDS_3|183_bp atgcaaatgggccgccagcctcaccagctgttcccggctgctggagctccgatcggttgg cgcccggcggccccgaacattaacgacacccgggagcctggcctcgaagctcaggcccgt gaacagactccaactacaacagcaccggcgacttccaaagagcagttcagcattttgaga tga >gi568815597r:217510320_217730971|GENSCAN_predicted_peptide_4|62_aa MKKESAHEGSQYRESEPKSDEGNIQVGGEAMVQHEEVEPALGLREEAVVVTIVWLHKEDL IR >gi568815597r:217510320_217730971|GENSCAN_predicted_CDS_4|189_bp atgaaaaaagaatctgcacatgaaggtagccagtatagagaatcagagcccaagagtgat gagggaaacatccaagttggtggggaggcgatggttcagcatgaggaggtagagcccgca ctgggtctaagagaagaggcagttgtggtgacaatagtttggttacataaagaggatctg atcagataa >gi568815597r:217510320_217730971|GENSCAN_predicted_peptide_5|862_aa MGDFNTPLSTLDRSTRQTVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTVTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIKTQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLN QEEAESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELKLGIDGTYFKIIRA IYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGK EEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQ IMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINI VKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITL PDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHTYNYLIFDKPEKNKQWGKDSLF NKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGK DFMSKTPKAMAAKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISR IYNELKQIYKKKTYNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRY HLTPVRMGSDKSEASKAPEVQN >gi568815597r:217510320_217730971|GENSCAN_predicted_CDS_5|2589_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagacagtcaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccgtaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaataaagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagactaaac caggaagaagctgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aacagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactgaaattaggtattgatgggacgtatttcaaaataataagagct atctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccct ttgaaaactggcacaagacagggatgccctctctcaccgctcctatttaacatagtgttg gaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaa gaggaagtcaaattgtccctgtttgcagacgacatgattgtttatctagaaaaccccatc gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatc gtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacacta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcaatggaacagaacagagccctcagaaataacgccgcatacctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctgttt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaag gacttcatgtccaaaacaccaaaagcaatggcagcaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacaacatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacatacaaccccatcaaaaagtgg gcaaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatg aagaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatat catctcacaccagttagaatgggctcagacaagagtgaagcaagtaaggcccctgaagtg cagaattga >gi568815597r:217510320_217730971|GENSCAN_predicted_peptide_6|312_aa MGKKQNRKTGNSKTQSASPPPKERSSSPAMEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKPNLRLIGVPESDAENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQR YSSRRATPRHIIVRFTKVEMKEKMLRATREKGRVTLKGKPIRLTADLSAETLQARREWGP IFNILKEKNFQPRISYPAKLSFISEGEIKSFTHKQMLRDFVTTRPALKELLKEALNMERN NRYQPLQNHAKM >gi568815597r:217510320_217730971|GENSCAN_predicted_CDS_6|939_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaaaccaaatctacgtctgattggtgtacctgaaagt gatgcggagaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttc cccaacctagcaaggcaggccaacgttcagattcaggaaatacagagaacgccacaaaga tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatg aaggaaaaaatgttaagggcaaccagagagaaaggtcgggttaccctcaaagggaagccc atcagactaacagcggatctctcggcagaaaccctacaagccagaagagagtgggggcca atattcaacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaacta agcttcataagtgaaggagaaataaaatcctttacacacaagcaaatgctgagagatttt gtcaccaccaggcctgccctaaaagagctcctgaaggaagcgctaaacatggaaaggaac aaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815597r:217510320_217730971|GENSCAN_predicted_peptide_7|62_aa MAKVRGVNVLALTVPFMLFLALSTSTLPVGTPSECCWSLDAKTSPAEPSVTLFEPPEVDS AF >gi568815597r:217510320_217730971|GENSCAN_predicted_CDS_7|189_bp atggccaaggttcggggagtgaacgtcttggctctgactgttccattcatgctatttctt gcccttagtacttcaactctgccagtaggaacaccttccgagtgctgctggtctctggat gccaaaacatcccctgctgagccctctgtaactctatttgagccacctgaggtcgattct gctttctga