GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:55:36 Sequence gi568815583f:32630691_32831242 : 200552 bp : 40.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3243 3351 109 0 1 93 86 89 0.996 8.57 1.02 Intr + 5087 5225 139 1 1 105 83 137 0.999 14.02 1.03 Term + 5567 7155 1589 0 2 58 32 903 0.966 71.10 1.04 PlyA + 7186 7191 6 1.05 2.00 Prom + 7823 7862 40 -9.25 2.01 Init + 12903 13128 226 2 1 62 63 232 0.705 14.78 2.02 Intr + 21453 21655 203 1 2 -23 103 86 0.002 -2.92 2.03 Intr + 28624 28728 105 0 0 93 73 88 0.861 7.29 2.04 Term + 32961 33290 330 2 0 63 37 158 0.495 1.87 2.05 PlyA + 33994 33999 6 1.05 3.00 Prom + 34036 34075 40 -2.05 3.01 Init + 35588 35664 77 1 2 87 92 19 0.820 2.81 3.02 Intr + 39415 39624 210 1 0 48 48 115 0.055 0.61 3.03 Intr + 41208 41379 172 0 1 104 38 82 0.319 3.82 3.04 Intr + 42330 42451 122 2 2 49 94 58 0.303 0.87 3.05 Intr + 47014 47171 158 1 2 64 76 77 0.751 2.83 3.06 Intr + 49076 49225 150 0 0 112 92 163 0.993 18.31 3.07 Intr + 53867 53979 113 0 2 70 106 108 0.943 9.98 3.08 Intr + 55572 55720 149 2 2 -8 76 111 0.368 -1.59 3.09 Intr + 60279 60484 206 0 2 105 20 71 0.165 -0.08 3.10 Intr + 61020 61073 54 1 0 104 54 96 0.096 5.83 3.11 Term + 75414 75553 140 1 2 99 37 122 0.487 5.34 3.12 PlyA + 75805 75810 6 1.05 4.08 PlyA - 76079 76074 6 1.05 4.07 Term - 86911 86690 222 1 0 7 47 267 0.020 10.43 4.06 Intr - 87235 87193 43 0 1 42 41 47 0.017 -7.28 4.05 Intr - 87464 87336 129 1 0 34 38 222 0.045 10.69 4.04 Intr - 87674 87488 187 0 1 118 73 50 0.489 4.33 4.03 Intr - 88088 87729 360 0 0 44 -2 308 0.182 11.67 4.02 Intr - 88459 88301 159 2 0 40 45 116 0.368 1.54 4.01 Init - 95023 94459 565 1 1 71 15 201 0.157 6.35 4.00 Prom - 98734 98695 40 -7.05 5.00 Prom + 99086 99125 40 -8.65 5.01 Sngl + 100001 100555 555 1 0 20 46 733 0.991 56.17 5.02 PlyA + 101821 101826 6 1.05 6.08 PlyA - 102114 102109 6 1.05 6.07 Term - 106351 106144 208 0 1 76 41 117 0.251 1.63 6.06 Intr - 123872 123801 72 1 0 84 67 106 0.165 5.70 6.05 Intr - 130343 130090 254 2 2 48 61 146 0.059 3.21 6.04 Intr - 146229 146145 85 2 1 122 85 60 0.652 8.00 6.03 Intr - 168263 168114 150 1 0 88 116 61 0.799 7.26 6.02 Intr - 173642 173591 52 0 1 60 121 47 0.892 2.25 6.01 Init - 176156 175949 208 2 1 54 80 169 0.933 11.73 6.00 Prom - 181653 181614 40 -3.85 7.02 PlyA - 181836 181831 6 1.05 7.01 Term - 195712 195507 206 2 2 84 44 105 0.414 2.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 25588 25416 173 2 2 120 38 145 0.915 9.71 S.002 Init - 26173 26035 139 1 1 89 -33 120 0.806 0.35 S.003 Intr - 86969 86631 339 0 0 118 76 207 0.923 17.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:32630691_32831242|GENSCAN_predicted_peptide_1|612_aa XVESGKAGCFSPKISHKEKVRRSLRLKFNLGKNGREVNGCSGVNRYESVGWRLANQQSLK NRIESVKTGLLFSPDVDEKLPKKGSEKISKSEETLLTPERLVGTNYRMSWTGPNNSSFQE VDANEASSMVENLEVENSLEPDIMVEKSPATSCELTPSNLNNKHNSNITSSPLSGDENNM TKETLVKVQKAFSESGSNLHALMNQRQSSVTNVGKVKLTEPSYLEDSPEENLFETNDLTI VESKEKYEHHTGKGEKCFSERDFSPLQTQTFNRETTIKCYSTQMKMEHEKDIHSNMPKDY LSKQEFSSDEEIKKQQSPKDKLNNKLKENENMMEGNLPKCAAHSKDEARSSFSQQSTCVV TNLSKPRPMRIAKQQSLETCEKTVSESSQMTEHRKVSDHIQWFNKLSLNEPNRIKVKSPL KFQRTPVRQSVRRINSLLEYSRQPTGHKLASLGDTASPLVKSVSCDGALSSCIESASKDS SVSCIKSGPKEQKSMSCEESNIGAISKSSMELPSKSFLKMRKHPDSVNASLRSTTVYKQK ILSDGQVKVPLDDLTNHDIVKPVVNNNMGISSGINNRVLRRPSERGRAWYKGSPKHPIGK TQLLPTSKPVDL >gi568815583f:32630691_32831242|GENSCAN_predicted_CDS_1|1839_bp nnagtggaatcaggaaaagcaggctgcttttctcctaaaatcagccataaagaaaaggtt cgaagatctctgcgtttgaaattcaatctagggaaaaatggcagagaagtaaatggatgt tctggtgtcaatagatatgaaagtgttggttggcgacttgcaaatcaacaaagtttaaaa aatcgaattgaatctgtaaaaacaggtttgctttttagcccagatgttgatgaaaagtta ccaaagaaaggttcagaaaagatcagtaagtctgaggaaaccttactaactccagagcga ctagttggaacaaattaccggatgtcttggacaggacctaataattcaagttttcaagaa gtagatgcaaatgaagcttcttcaatggtggaaaatcttgaggtagaaaactctttggag cctgatattatggtagaaaagtcacctgctacttcatgtgaactcaccccttccaattta aacaataagcataatagcaacataacaagtagccctcttagcggggatgaaaataacatg accaaagagactttggtgaaagttcaaaaagcgttttctgaatctggaagtaatcttcac gcattgatgaatcagaggcagtcatcagtaactaatgtggggaaagtaaaattaactgaa ccatcttatttagaagatagcccagaggaaaatctatttgaaactaatgatttgactata gtagaatcaaaggagaaatatgaacaccacactggtaaaggtgaaaaatgtttttcagag agggacttttcaccccttcaaactcaaacatttaatagagaaacaactataaaatgttat tcaactcagatgaagatggaacatgaaaaagacattcattcaaatatgccaaaagattat ttaagcaagcaagaattctccagtgatgaagaaataaagaaacagcagtccccaaaggat aaactaaataataaattaaaagagaatgagaatatgatggaaggtaacttaccgaagtgt gcagcacatagcaaggacgaggctagatcctctttctcacagcagagtacatgtgttgta acaaacttgtcaaaacctaggcctatgagaattgctaaacagcagtcattggaaacatgt gagaaaacagtttctgaaagttcacaaatgacagaacatagaaaggtttctgatcacata cagtggtttaacaagctttctttaaatgaaccaaatagaataaaagtcaagtcacctctt aagtttcagcgtactcctgttcgtcagtccgtcagaagaattaattctttgttggagtat agcagacaacctacagggcataagttggcgagtcttggtgatacagcttctcctttggtc aaatcagtgagctgtgacggtgctctttcctcttgtatagaaagtgcatcaaaagattcc tctgtttcatgtatcaaatcaggtcctaaagaacagaagtccatgtcatgtgaagagtca aatattggtgcaatttcaaagtcaagcatggagttaccctcgaaatctttcttaaagatg aggaagcacccagattcagtgaatgcttctcttaggtctactacagtttataaacagaag atcttatctgatggccaagttaaggttcccttggatgatctgactaatcatgatatagta aaaccagttgtaaataacaacatgggcatttcttctgggataaataacagggtccttagg agaccatcagaaagaggaagggcctggtacaaaggttctccaaaacatcctatcggaaaa actcaattactaccaacaagtaaacctgtagatttgtaa >gi568815583f:32630691_32831242|GENSCAN_predicted_peptide_2|287_aa MVSRMVSTMLSGLLFWLASGWTPAFAYSPRTPDRVSEADIQRLLHGVMEQLGIARPRVEY PAHQAMNLVGPQSIEGSAERSHEIVKTTGCQVFTGGSNTEVTALWGLWAEPALGSSGGPP YLQGIEGINKEEMQRINVFPECQVIVHLDIFDYKLPECQTRTSHGKLDDPRCPVGGSAEF KYGSSRESELLGIFFSATSDSNDCLAVSSGTQVLNRGMRRKWHFPADGLYQLQLQVQCLL WERSHSLVIRHIYTATLVYVRLSSLTRHDNSAYLKAIKPMILALFTV >gi568815583f:32630691_32831242|GENSCAN_predicted_CDS_2|864_bp atggtctccaggatggtctctaccatgctatctggcctactgttttggctggcatctgga tggactccagcatttgcttacagcccccggacccctgaccgggtctcagaagcagatatc cagaggctgcttcatggtgttatggagcaattgggcattgccaggccccgagtggaatat ccagctcaccaggccatgaatcttgtgggcccccagagcattgaaggatcagcagagagg tcacatgaaatagtaaaaaccactgggtgtcaagtgttcactgggggtagcaacacagaa gttactgctttgtgggggctttgggcagagccggctctggggagtagtgggggcccacca tacctgcagggcattgaggggataaacaaggaagaaatgcaacgtataaatgtgttccca gaatgtcaggtgattgttcacttggacatatttgattataagctcccagagtgtcagaca cggacttctcatggcaagttagacgatcccaggtgcccagtgggtggcagtgcagaattc aaatatggtagttccagggagtctgagctgcttgggatttttttcagtgccaccagtgat tctaatgattgtctagctgtaagcagtgggactcaagttttgaacagaggcatgagaaga aaatggcatttccctgcagatgggctgtaccagctgcagcttcaagtccagtgcctcttg tgggaaaggagccactcccttgtcattcgccacatctacactgcaacgctggtttatgtc agactatcatctttgactcggcatgacaacagtgcatatttgaaggccatcaaaccaatg atattagccttgtttacggtttag >gi568815583f:32630691_32831242|GENSCAN_predicted_peptide_3|516_aa MPALIKLLVKELQKKHIYWAFSHWLRSLSFLDPSMRGKSRRRKRRNENKCLSRQNIRMGF ILLICQQAQPNLDLSYKDRIVPARVSALAGYVGQSLLGSLAASPPSLLVSSGALALAPSA LSESEGKPKEARRCLECPIRLLQAPRPLTKGKLPQPGEPLHRGGTSSGSPTDIAYTMRTH VQGRCLPLIPTELAGTVSVGLLCPHGVGKYEILLRNDLLWSWKRKLHEPYRMLRLQGKKV QKLTYIGGAHEGLQHLGPFGNIPNIVAELTGDNIPKDFSEDQGYPDPPNPCPVGKTADDG CLENTPDTAEFSREFQLHQHLFDPEHDYPGLGKWETEIIKAFYKRSYFDGILKDEVKGEE TNPRQEENMGSHFSYDDCEHREEGLSVLSFIISIEGKSVEHVSDILELIEAHYVFEIQQC VETLTQTVVTEPCPPTQHNFPFCQEVFWQFLKEQETPLREDEGRRETKAEVTLIHSEFGE TLHTIATRNGVRMGEYKSPGGKVVEHRDSTALREAC >gi568815583f:32630691_32831242|GENSCAN_predicted_CDS_3|1551_bp atgcctgctcttataaagcttctagtaaaggaactacaaaagaaacacatctactgggca ttctcacactggctgaggagcctgagttttttagatcctagtatgagggggaaaagcaga agaagaaagagaaggaatgaaaacaaatgcctttcaagacaaaatatccgaatgggtttc attttgcttatctgccaacaggctcaacccaatttagatctctcctacaaggacaggata gtgccagccagggtgtctgcccttgccggctatgtggggcagtccctcctggggagcctt gccgcctctccaccttctctcctggtctcctcgggcgctctcgcgctggcaccttcagct ctgtcagagtctgaggggaaaccaaaggaggccaggcggtgtctggaatgccccatccga ctgcttcaggctcccagaccgctgacaaaaggaaaactgccccagccaggagagccgttg catcgtggtgggacatcatcaggctctccaacggacatcgcttatactatgagaactcac gttcaagggaggtgtttacctctaattcccacagaactagcgggaactgtgagtgtgggg ctcctctgtcctcatggagttggtaagtatgaaatccttctaagaaatgatctgctgtgg tcctggaaaaggaagcttcatgagccttatagaatgttaaggctacaaggaaagaaagtc cagaaattaacatacattggtggagctcatgaaggacttcagcatttgggtccttttggc aacatccccaacatcgtggcagagttgactggagacaacattcctaaggactttagtgag gatcaggggtacccagaccctccaaatccctgtcctgttggaaaaacagcagatgatgga tgtctagaaaacacccctgacactgcagagttcagtcgagagttccagttgcaccagcat ctctttgatccggaacatgactatccaggcttgggcaagtgggagactgaaataataaaa gccttctataaaagaagttattttgatggcatattaaaggatgaagtaaaaggagaagag actaaccctagacaggaagaaaatatggggagtcatttcagttatgatgactgtgaacac cgtgaagaagggttaagtgttctgtcttttataatctcaatagaagggaaatctgttgag catgtgtctgatattctagagctcattgaagcccactatgtgtttgaaatacagcaatgt gtggaaacacttacacaaactgtggtaacagagccctgccccccgacccaacataacttt cctttctgccaagaagttttttggcagtttttaaaagaacaagaaactcctttacgagaa gatgaagggaggagagagacgaaagcggaggtgactctgatacacagtgaatttggggaa acattgcacacaattgcaaccagaaatggtgtgagaatgggtgaatacaaaagccccgga gggaaggttgtagaacacagggactccactgctctacgagaagcctgctaa >gi568815583f:32630691_32831242|GENSCAN_predicted_peptide_4|554_aa MGKDFMTKTPKAMATKAKTDKLDLIKLNSFCMAKETTIRVNRQPTEWEKIFAVYPSDKGL ISRIYQELKQIYKKKANNPIKKWAKDMNRHFSKEDIYGASKHMRKSSSSVVIREMQIKTT MRYHFTPVRMAVIKESGNNRCWRGCGEIGMLLHCWWECKLVQPLWKTVWQFLKDLEPEIP FDPGILLLAVEGPAEPARQRERTKDAQLVAAANPPQPPVPSPQELVRRCKARLGPLEHSL RAVGLPTVRFSAVAEHSGSALASLHLTSGIGKCTHHGFSGCSNPRTPMGAPRRPQTQRGA PSTWDPLKATRRAVSGRLAGPDAPALWRPGLRALAKRARLPGAAATSTRSVFGGLREVET GYPEQRGHGVRGLSRKDPAMVLHNSPPRTTHPSLRARRAAARGTERVLGWFAALFPPPTH IPAGAAGPRGLRFRRVALSAAAARAAGARRGVSGTESDAAAVHWVARRRRRRQTRYPPPF PRRLRWEERDLRGIVPDSAILPLWRERDCGMLWSSGRRGSPAAPPEAPNFEGADPGAPVN LAGSLAASRSLSPT >gi568815583f:32630691_32831242|GENSCAN_predicted_CDS_4|1665_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaactgac aaattggatctaattaaactaaacagcttctgcatggcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcagtctatccatctgacaaagggcta atatccagaatctaccaggaactcaaacaaatttacaagaaaaaagcaaacaaccccatc aaaaagtgggcaaaggatatgaacagacatttctcaaaagaagacatttatggggccagc aaacatatgagaaaaagctcatcatcggtggtcattagagaaatgcaaatcaaaaccaca atgagataccatttcacgccagttagaatggccgtcattaaagagtcaggaaacaacaga tgttggagaggatgtggagaaataggaatgcttttacactgttggtgggagtgtaaatta gttcaaccgttgtggaagacagtgtggcaattcctcaaggatctagaaccagaaatacca tttgacccaggaatcctattactggctgtggaggggccagcggagccagccaggcagcga gagagaacgaaagacgcccagctcgttgcagccgccaacccgcctcagccgccagtccct agtcctcaggagctagtgaggcgctgcaaggcgaggctggggcctctagaacacagttta agggctgtcggattacccaccgttcggttttccgctgtggctgaacacagcggctctgcc cttgccagtctccatctcacctcggggatcggcaaatgcacacatcatggattctccgga tgctccaacccgcggactccaatgggcgccccaagacgcccccagacccagcgcggcgcc ccaagcacatgggaccctctcaaggcaactcgccgcgcagtcagcggccgactagcaggt ccggatgctcctgcgctctggcggcctggacttcgcgccctcgctaagcgggcgcgtctg ccaggcgctgctgccaccagcaccaggagcgtgttcgggggcctccgggaggtggaaacg ggatacccagagcagcgaggtcacggtgtccgcggcctcagcaggaaggaccccgcgatg gttcttcacaattcacccccgcgaacaacgcacccgagcctgcgcgcacggagggctgcc gcgcgtggaactgagcgggtcctgggttggtttgcggccctcttccctccgcccactcac atccctgccggtgcggcgggtcctcggggcctgcgctttcgacgcgtggcgctgagtgcg gccgcggccagagccgccggggctcggcgcggggtcagcgggaccgagagtgacgcggcg gccgtgcactgggtcgcccggcgccgccgccgccgccagacccgctatccgccccctttc ccgcggagactgcgctgggaagagcgagatctccgtgggatcgtgccagattcggccatt cttcccctctggagggaacgggactgcgggatgctctggtcgagcggccgtcgtggctcc ccggcggccccgcccgaggcccccaacttcgagggcgctgaccccggcgcccccgttaac ctggcgggctcgctggccgccagcaggagcctgtcgccaacatga >gi568815583f:32630691_32831242|GENSCAN_predicted_peptide_5|184_aa MSRTAYTVGALLLLLGTLLPAAEGKKKGSQGAIPPPDKAQHNDSEQTQSPQQPGSRNRGR GQGRGTAMPGEEVLESSQEALHVTERKYLKRDWCKTQPLKQTIHEEGCNSRTIINRFCYG QCNSFYIPRHIRKEEGSFQSCSFCKPKKFTTMMVTLNCPELQPPTKKKRVTRVKQCRCIS IDLD >gi568815583f:32630691_32831242|GENSCAN_predicted_CDS_5|555_bp atgagccgcacagcctacacggtgggagccctgcttctcctcttggggaccctgctgccg gctgctgaagggaaaaagaaagggtcccaaggtgccatccccccgccagacaaggcccag cacaatgactcagagcagactcagtcgccccagcagcctggctccaggaaccgggggcgg ggccaagggcggggcactgccatgcccggggaggaggtgctggagtccagccaagaggcc ctgcatgtgacggagcgcaaatacctgaagcgagactggtgcaaaacccagccgcttaag cagaccatccacgaggaaggctgcaacagtcgcaccatcatcaaccgcttctgttacggc cagtgcaactctttctacatccccaggcacatccggaaggaggaaggttcctttcagtcc tgctccttctgcaagcccaagaaattcactaccatgatggtcacactcaactgccctgaa ctacagccacctaccaagaagaagagagtcacacgtgtgaagcagtgtcgttgcatatcc atcgatttggattaa >gi568815583f:32630691_32831242|GENSCAN_predicted_peptide_6|342_aa MVFYLRKLEKEILLLRPQCSEGAHRADCGQGVPERRNSKCKDPPMCTLSALKPEAAASEA GKEGVWERGAKKEHKMEESHLENAQKSFETTVRYFGMKPKSGEKEITPSYVFMVWYEFCS DFKTIWKRESKNISKERLKMAQESVSKLTSEKKVETKKINPTASLGVVLKYQGHLAMSRD IFDSQDWEGMLLASSGERPGKLLNFLQGIGHPFHNKQLSDRNFNSPEIRRYQMASALKCL EQQQHNPFWRLYKKHGTSTCIWREPQAVSTYGGRCGGHLHKQDEMSPKFGLDAETDDVTH TPREYETFITHMRLFVESSSRLLTWSENDLREPGKETGLGFL >gi568815583f:32630691_32831242|GENSCAN_predicted_CDS_6|1029_bp atggtcttttatttacggaagctggagaaggagatacttctgctgagacctcaatgcagt gagggagcccatcgtgcagactgtggtcagggtgtcccagagaggaggaacagcaagtgc aaagatcctccaatgtgcacgctcagtgcgctcaagccagaggcagctgccagcgaagct ggaaaggaaggggtgtgggaaagaggagccaaaaaagagcataagatggaagaaagtcac ttggagaatgcacagaaaagttttgaaacaacagtacgatattttgggatgaagccaaag tctggtgagaaggagatcacacccagctacgtgtttatggtgtggtatgagttctgcagt gacttcaagacaatttggaaacgggagagtaaaaacatatctaaagaaagattgaaaatg gctcaggaatcagtcagcaagttgacttcagagaagaaagtggagacaaagaaaatcaat cccactgctagcctgggagtggttctcaagtaccagggacatttggcaatgtctagagac atctttgatagtcaggactgggaagggatgctgttagcatctagtggggagaggccaggg aaactcctaaacttcctacaaggcataggacatcccttccacaataaacaattatcagac agaaatttcaatagtcctgaaataaggcgataccagatggctagtgctctcaagtgcctg gaacaacagcagcacaatcctttctggaggctgtacaagaagcatggcaccagcacctgc atctggcgagagcctcaggctgtttccacttatggtggaaggtgtggtggacatctgcat aaacaagatgaaatgtcaccaaaatttggtttagatgctgagactgatgatgttacacac accccaagagagtatgaaacatttattactcacatgaggctttttgtggagagcagctca cgtctcctaacctggtctgaaaatgacttgagagagccaggaaaggagactggtttgggc tttttatga >gi568815583f:32630691_32831242|GENSCAN_predicted_peptide_7|68_aa XQRFSPREFDLSVFQHLHCWPVASQAALSRALESLPLSLESCFSMGCGTLDLTCLPTTLV KEAAIRPS >gi568815583f:32630691_32831242|GENSCAN_predicted_CDS_7|207_bp nggcagcgtttttcccccagggaattcgatctcagtgtctttcaacaccttcactgttgg cctgtagcctctcaggctgcgctaagcagggccctggagagtcttccactctccctggag agctgcttctctatgggctgtgggacacttgacctcacgtgtctccccaccacccttgtg aaagaagcagccattaggccctcttga