GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:01:06 Sequence gi568815594r:89147783_89350110 : 202328 bp : 38.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 425 1108 684 1 0 88 47 249 0.810 16.74 1.02 PlyA + 2076 2081 6 1.05 2.00 Prom + 9738 9777 40 -5.75 2.01 Init + 27102 27265 164 2 2 50 54 128 0.088 4.85 2.02 Intr + 27452 27513 62 2 2 73 50 66 0.057 -1.24 2.03 Intr + 27939 28120 182 1 2 61 74 72 0.021 1.77 2.04 Term + 38336 38401 66 1 0 92 39 102 0.057 2.66 2.05 PlyA + 38476 38481 6 1.05 3.02 PlyA - 38582 38577 6 -0.45 3.01 Sngl - 39668 39333 336 2 0 71 44 166 0.894 6.28 3.00 Prom - 51496 51457 40 -3.85 4.00 Prom + 54020 54059 40 -3.75 4.01 Sngl + 64960 65232 273 0 0 96 41 135 0.837 4.68 4.02 PlyA + 66037 66042 6 1.05 5.00 Prom + 70266 70305 40 -4.65 5.01 Init + 71972 72065 94 1 1 62 97 67 0.669 5.59 5.02 Intr + 87138 87244 107 1 2 110 37 12 0.135 -2.69 5.03 Intr + 90481 90510 30 0 0 117 83 34 0.338 3.21 5.04 Term + 95627 95731 105 1 0 72 38 94 0.170 0.23 5.05 PlyA + 95998 96003 6 1.05 6.02 PlyA - 96514 96509 6 1.05 6.01 Sngl - 102328 99998 2331 1 0 40 48 2115 0.816 195.15 6.00 Prom - 107624 107585 40 -7.35 7.00 Prom + 113142 113181 40 -3.85 7.01 Init + 114728 114816 89 1 2 86 60 118 0.995 8.96 7.02 Intr + 116517 116630 114 0 0 114 74 69 0.122 6.74 7.03 Term + 141906 142035 130 0 1 107 44 60 0.008 0.17 7.04 PlyA + 142997 143002 6 1.05 8.10 PlyA - 143739 143734 6 1.05 8.09 Term - 145594 145469 126 1 0 76 42 86 0.690 0.10 8.08 Intr - 145825 145669 157 0 1 50 78 126 0.502 6.89 8.07 Intr - 155658 155520 139 1 1 86 56 19 0.201 -2.90 8.06 Intr - 159148 158917 232 1 1 57 64 170 0.058 7.92 8.05 Intr - 159675 159604 72 0 0 91 94 14 0.025 0.98 8.04 Intr - 165593 165494 100 1 1 104 11 38 0.011 -3.11 8.03 Intr - 173155 172990 166 2 1 57 84 100 0.181 4.60 8.02 Intr - 200575 200493 83 0 2 55 100 58 0.291 2.06 8.01 Init - 201806 201388 419 2 2 71 53 179 0.398 8.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 116517 116634 118 0 1 114 48 74 0.869 3.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_1|227_aa MSSSSGLQNWHPSPRLQAYPSLKVDFTGDPPPSTQEHVCLPSHPWGPGCWHQGATAGQCP AILSAPSASLPMFLHAQSLKRTEASGGWHVSTALNVCTPIWAVTTSRLGPNPTWRCPRSA GIPGFANGLQLLAGRRGGGQLQLCRAWLLLILSFQEHRYAQLQLQLGQLQLHWGSSLSAN LEEAGPLLVPGSHQDLWSMAPSRNPFQPGMGRQVLTGPELVSGAGQC >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_1|684_bp atgagttcctcctctggtctgcagaactggcatcccagccccaggcttcaggcctatcct agcctgaaggtagacttcactggggacccacctccttccacccaggagcatgtctgtctc cccagtcatccatggggcccaggctgctggcaccaaggggcaactgcaggccagtgccca gccatcctcagtgccccctcagcttcccttcccatgttcctacatgcccaaagtctgaag aggactgaggcatcagggggctggcatgtcagcactgccctgaacgtgtgcacacccatc tgggctgtgacaacatccaggcttggccccaaccctacttggagatgcccaaggagtgcg ggtatccctgggtttgccaatgggctgcagctgcttgcagggaggcggggtggggggcag ctgcagctatgcagggcttggctcctgcttattctcagcttccaagagcacaggtatgcc caattgcagctgcagcttgggcagctgcagttgcactgggggagctccctctctgccaac ttggaagaggcagggcccctgcttgtccccgggtcccaccaagatctatggagcatggca cctagccgcaacccctttcagcctgggatggggcgccaggtcctcactgggcctgagctg gtatctggggcagggcaatgttga >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_2|157_aa MTFRGHSTGKGKKASDGPEYNFLNTLAVSPPKLRRALRMRAAHSKGRIMGKRRKIQILLR RQELRLLQTSMDSPPGLETSPHREKEQVASHTYYPHLVKNPGSRYRRTWQMAQKGFGIGS QQAQNHKGYTEEVETQPTQCEDNKDEDLYDGPLLLNE >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_2|474_bp atgaccttccggggccactctactggaaaagggaagaaagcctcagatgggcctgagtac aacttcctgaacacactggcagtctcacctcccaagctaaggagggcactgcgcatgcgg gcagcccactctaagggaagaatcatgggaaagaggcgcaaaattcaaattcttctgaga aggcaagaactgaggttgctgcagaccagtatggattcgccccccggattagaaacatct ccacacagagaaaaggaacaggtggcttcccatacctactatccccacctagtcaagaac cctgggtcaaggtacagaagaacatggcagatggcccagaaaggatttggaatagggagc caacaagctcagaatcacaaaggatacacagaggaagtagagactcagcctactcaatgt gaagacaacaaggatgaagacctttatgatggtccacttctacttaatgaatag >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_3|111_aa MGKDFMTKISKAIATKAKTDKWDLIKLKSFCTARVTIIRVNRQPTEWEKIFAIYPSDKGL ISRIYKERKQIYKKKTNNPIKKWMKDMNRHLSKEDIYAANEHEKKLIITGH >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_3|336_bp atgggcaaagacttcatgactaaaatatcaaaagcaattgcaacgaaagccaaaactgac aaatgggatctaattaaactaaagagcttctgcactgcaagagtaactatcatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctatccatctgacaaaggtcta atatctagaatctacaaggaaagaaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtggatgaaggatatgaacagacacttgtcaaaagaagacatttatgcagccaac gaacatgaaaaaaagctcatcatcactggtcattag >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_4|90_aa MALGELNLFSAKEIPKAAYSRRLSQQPGGMSAVQHNIHFNYPFNNLAFRSFSNAIHPITS KHSLALCQCRDLLYIRNLPVHVLPQLLWLS >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_4|273_bp atggccttgggagagctgaatctcttttcagccaaggaaattccaaaggcagcttacagc cgaaggctatctcagcagccggggggaatgtcggcagtgcagcacaacatccacttcaac tatccattcaacaatttagcctttaggtctttttcaaatgccattcatccaataacctca aaacattccttggcactttgtcaatgcagggacctgctttatattagaaatcttcctgtc catgtcctcccacaactgctgtggctgagctga >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_5|111_aa MRVCSLGVIGYTRGPQLLDHRPVQVLGRTAVEIPFLPNNLVEDQGGRGRKRVWKGHTFLS AALCLWQILDELKIFKEVAAADPNLERINQKGPRTTSGSIANRKQNAQIHM >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_5|336_bp atgcgggtttgtagcctaggagtaataggctataccaggggtcctcaactcctggaccat agaccagtacaagtactgggccgcacagcagtcgagattccatttctgccaaataatctg gttgaagaccagggaggaagaggaagaaagagggtctggaaaggtcacactttcctgagt gctgcactctgcctgtggcagatccttgatgaattgaagatattcaaggaggtggcagca gctgacccgaatttggagagaatcaaccaaaaagggccacgcacaacatcaggaagcatt gctaacagaaaacaaaatgctcagatacatatgtaa >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_6|776_aa MGTVPDPLRSAKTSLIAASGKEDDLGEPQAASPRHRPALLCKNANGFSGAPAEPDLSPRA AAEALMQVCEHETTQPDMSSPGVFNEVQKAPATFNSPGNPQLPGSSQPAASAPSSAAGRD LIHTPLTMPANQHTCQSIPGDQPNAITSSMPEDSLMRSQRTSNREQPEKPSCPVGGVLSS SKDQVSCEFPSPETIQGTVQTPVTAARVVSHSSSPVGGPEGERQGAICDSEMRSCKPLTR ESGCSENKQPSVTASGPQGTTSVTPQPTPLTSEPSACPPGPEKVPLPAQRQMSRFKEAST MTNQAESEIKEVPSRAWQDAEVQAVASVESRSVSTSPSILTAFLKESRAPEHFEQEQLRV ICHSSGSHTLELSDSTLAPQESSQCPGIMPQVHIQAAAAESTAFQRENKLASLPGGVLKT SSINLVSSNAQHTCKEDGRLAGMTPVREESTAKKLAGTNSSSLKATAIDQISISACSQAE TSYGLGKFETRPSEFAEKTTNGHKTDPDCKLSDSCGSISKADHSGSLDPTNKGDAREKKP ASPQVVKEKESTGTDTSDAKTLLLNPKSQESGGTESAANPTPSPIRKNQESTLEENRQTK TATSLSLPSDPMGDSSPGSGKKTPSRSVKASPRRPSRVSEFLKEQKLNVTAAAAQVGLTP GDKKKQLGADSKLQLKQSKRVRDVVWDEQGMTWEVYGASLDAESLGIAIQNHLQRQIREH EKLIKTQNSQTRRSISSDTSSNKKLRGRQHSVFQSMLQNFRRPNCCVRPAPSSVLD >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_6|2331_bp atggggactgtacctgaccctctgagatcagctaaaacttccctgattgcagcttccgga aaagaagacgatctaggagagccacaggctgcctcacctcggcatcgaccagctctcctg tgtaagaatgccaatggcttttcaggtgcccctgcagaaccagacctcagccccagggca gctgccgaagccctgatgcaggtttgtgagcatgagaccacccaaccagatatgtcttct cctggtgtgttcaatgaagtgcagaaagcacctgccacattcaactctcccggcaatccc cagctgccagggagcagccagcccgcagcatcagccccgagttctgcagcaggaagggat cttatacacacaccattgacaatgcccgccaatcagcacacctgccagtccatcccaggt gatcagcccaatgccatcacctcatccatgcctgaagattccctgatgagatcacagaga acctcaaatagagagcaacctgagaaaccaagttgtcctgtgggaggcgtcctcagtagc agcaaagatcaggtgtcctgtgagtttccttctccagaaacaatccagggaacagtgcag actccagtgacagcagccagggtggtcagtcactcatcctctcctgtaggtggacctgaa ggggaaaggcagggagccatctgtgactctgaaatgaggtcctgtaaacctctaactaga gaatctggatgttcagagaacaagcagccctctgtcactgcctcgggcccccaaggcaca acttctgtgacacctcaaccaacccccctcactagcgaaccttcggcatgtcccccaggt ccagagaaggtgccgctgccagcacagcgtcagatgtcaaggttcaaagaagccagtacg atgaccaaccaagctgaaagtgaaatcaaggaagttcccagcagggcttggcaagatgcg gaggtgcaggcagtggcgagtgtcgagagcagatccgtctccaccagccccagtatcctc actgcatttctgaaggaaagccgtgctcctgagcattttgaacaagagcagctgcgtgtc atttgccacagcagtgggagccacacactggagctctctgacagcacgctagccccccag gagtccagccagtgccctggcatcatgccacaggtgcacattcaggcagctgcagctgag tctacagctttccaacgggaaaataaacttgcgagcctaccaggtggggtccttaaaacc tcatcaatcaatttggtctccagtaatgcccagcatacgtgtaaagaagatgggaggtta gcaggaatgactccagtgagggaagagtcaactgctaaaaagctcgcaggtactaattct agctccctgaaagctaccgccattgaccagatttctatcagtgcatgcagtcaagctgaa acaagttatggattggggaaatttgaaaccaggccatctgagtttgcagagaaaacgaca aacggccacaaaacagacccagattgcaaactatctgactcttgtggctctatcagcaaa gctgatcattctgggagcttggatcccactaataaaggagatgcaagggaaaagaagcct gcatctcctcaggtagtaaaagaaaaagagtctactggcactgatacctcggatgccaaa accctactgctcaatcctaaatcccaagaaagtggaggcacagaatcagctgctaatcct acaccctccccaattaggaagaaccaggagagcaccttagaagaaaacagacagaccaag acagccaccagcctgagcctgccatctgatcccatgggtgactccagcccaggttctggc aagaagaccccatctcgctccgtcaaagccagcccacgcaggcccagccgcgtcagcgag ttcctcaaggagcaaaagttaaatgtgacagcagctgctgctcaggtaggactcactcca ggagataagaaaaagcagcttggcgcagactccaagctccagctgaaacagtccaagcgt gtcagggacgtcgtgtgggatgagcagggaatgacctgggaagtgtatggtgcatccttg gacgcagagtccctgggaatcgcgatccagaaccatttgcaaagacaaatcagggaacat gagaaattaatcaaaactcaaaatagccagacccggagatccatttcctcagatacttct tcaaataagaagctcagaggaaggcagcacagtgttttccagtccatgctgcagaacttc cgacgccccaactgctgcgtccgtcctgccccgtcttctgtgttagattga >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_7|110_aa MHADVSVSDDGFVLVYASTPFPSALCCAGSLISAHPGSLLPSTMSGSSLKPSPEADTGPR LFVQPAELCNSPTTACTFMHEAQLPQTQFFHSSNAAYVITNSDCLIIIVK >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_7|333_bp atgcacgctgacgtttctgtcagtgatgatggattcgtgctggtctacgcttcaacaccc tttccatctgctctttgctgtgcagggagtttgatctctgcacaccctggctcccttttg ccttctaccatgagtggaagcagcctgaagccttccccagaagcagatactggtcccagg ctttttgtacaacctgcagaactgtgtaatagtccaacaactgcctgtaccttcatgcat gaggcacagctgcctcagacacagtttttccactcctccaatgcagcttatgtaatcact aatagtgactgtctaataatcattgttaaatga >gi568815594r:89147783_89350110|GENSCAN_predicted_peptide_8|497_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKEAIIRVNRQPTEWQKIFSIYPSDKRL ISRIYKELKQIYKKKSNNPIKKWAKAINRHFSKEDIYAANRHMKKCSSSLAIREMQIKTT MRYHLTPVRVAIIKKSGNNRRPARASNHWRHHEFLKEDISLRTARDKEASKGAGKYLLNN FVLYMKYLQESLEMTTDSEIFGQIFRERTSHKQQLLSSVEDARTLNNTSDYEEREFQEEL GWGTNNEFMLLWNNILEGQSFSQVMNFSARSLLSLALALKADGNDVGIISDAIMDSERGK KGEVLNMPLGSEFWVMGRHRIQTRFQGLSGGPLGEITAWGVGSSRLRRCECPQSSRGVIY WLGLVQLNVTVELLTLRDTPHCGVIISFLRMQLQSLAQKAEVVILNDPDLNSPKGTGTLH CKAIRAPGPETKRGDLWGAKCPAAWLLPAMDKVIWEVTQISFVANWNEERFRKGNSGNEF PSDLRQWYKTTIEDASP >gi568815594r:89147783_89350110|GENSCAN_predicted_CDS_8|1494_bp atgggcaaggatttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaagctatcatcagagtg aacaggcaacctacagaatggcagaaaattttttcaatctacccatctgacaaaaggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatc aaaaagtgggcaaaggctataaacagacacttctccaaagaagacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagatatcatctcacaccagttagagtggcgatcattaaaaagtcaggaaacaacagg agacctgcccgtgcctcaaaccattggaggcatcatgagttcctgaaggaagacatatcc ctgaggacagccagagacaaagaggcttctaaaggtgcaggtaaatatcttctaaataac tttgtgctctacatgaaatacctacaggaaagcctagagatgactacagacagtgagata tttggccagatatttagagaacgtacttcacacaagcaacagttactgagttcagttgag gatgcgagaaccctgaataacacaagtgactatgaagaaagggaatttcaggaggagtta gggtggggcacaaataacgaattcatgttgttgtggaacaatatattagaggggcagtct ttctcgcaggtgatgaatttctctgcaaggtcattgctctctttggctctggctttgaag gcagatggaaacgatgtaggcatcatttcagacgcaataatggacagtgaaagagggaaa aaaggagaggtacttaatatgccactgggaagtgaattttgggtgatgggaaggcatcga atccagaccaggtttcaagggctttctggagggccattgggagaaatcaccgcttggggt gtaggcagttcaaggttaagacgatgtgaatgtccacaaagcagccgaggagtaatttac tggttaggccttgttcagttgaatgttactgttgaactattgacattacgagatacacca cattgtggcgtgattatttcatttctcagaatgcaactacagtcgttggcacagaaggcc gaggtggtcatcctcaatgaccctgacttgaacagcccgaaagggactggcactcttcac tgcaaagccattcgtgcacctggcccagaaaccaaaagaggagatctgtggggagccaaa tgtccggcagcctggctgctgcctgccatggacaaggtgatctgggaggtcacacaaatt tcctttgtagccaactggaatgaagaacgatttaggaaagggaattctgggaatgagttt ccatcagatctaagacagtggtacaaaaccaccatagaggatgctagcccataa