GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:59:20 Sequence gi568815581r:60347584_60625931 : 278348 bp : 42.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1357 1352 6 1.05 1.02 Term - 43888 43707 182 2 2 81 44 122 0.931 3.89 1.01 Init - 44356 44299 58 1 1 85 78 111 0.965 11.32 1.00 Prom - 60586 60547 40 -5.25 2.00 Prom + 68825 68864 40 -5.75 2.01 Init + 73801 73950 150 0 0 84 63 119 0.927 9.18 2.02 Intr + 74529 74948 420 2 0 132 32 106 0.589 2.61 2.03 Intr + 78200 78301 102 1 0 103 74 87 0.885 8.25 2.04 Intr + 78578 78753 176 1 2 112 28 177 0.985 11.92 2.05 Intr + 79135 79234 100 1 1 94 81 122 0.533 11.29 2.06 Intr + 81779 81974 196 1 1 30 90 198 0.476 12.27 2.07 Term + 83598 83683 86 1 2 121 48 70 0.493 3.14 2.08 PlyA + 83798 83803 6 1.05 3.03 PlyA - 86120 86115 6 1.05 3.02 Term - 86696 86276 421 1 1 -22 42 378 0.239 15.98 3.01 Init - 87855 87551 305 0 2 116 22 363 0.452 29.29 3.00 Prom - 96139 96100 40 -6.15 4.23 PlyA - 97398 97393 6 1.05 4.22 Term - 100251 99998 254 1 2 118 55 331 0.998 27.62 4.21 Intr - 104462 104297 166 2 1 77 76 171 0.956 13.41 4.20 Intr - 106909 106719 191 2 2 68 70 68 0.839 1.38 4.19 Intr - 108798 108713 86 2 2 53 63 54 0.899 -2.06 4.18 Intr - 113204 113080 125 2 2 62 93 168 0.957 13.16 4.17 Intr - 114332 114227 106 1 1 83 91 23 0.631 1.40 4.16 Intr - 114478 114411 68 1 2 77 90 34 0.591 -0.72 4.15 Intr - 116527 116438 90 1 0 52 94 79 0.756 4.17 4.14 Intr - 118876 118708 169 0 1 36 107 91 0.054 4.83 4.13 Intr - 131688 131565 124 1 1 97 107 -13 0.011 0.22 4.12 Intr - 136907 136729 179 1 2 34 86 114 0.531 4.44 4.11 Intr - 140174 139845 330 1 0 70 44 298 0.033 17.42 4.10 Intr - 140528 140417 112 0 1 20 47 77 0.031 -4.68 4.09 Intr - 144884 144741 144 0 0 38 47 122 0.100 2.43 4.08 Intr - 147034 146883 152 0 2 120 97 42 0.435 7.19 4.07 Intr - 152904 152816 89 0 2 80 94 72 0.539 4.85 4.06 Intr - 165593 165489 105 2 0 63 23 98 0.005 0.29 4.05 Intr - 178635 178211 425 1 2 72 91 378 0.157 29.16 4.04 Intr - 181593 181460 134 2 2 77 71 84 0.461 4.97 4.03 Intr - 209606 209527 80 2 2 115 27 67 0.010 0.73 4.02 Intr - 210621 210514 108 0 0 105 92 -8 0.014 0.86 4.01 Init - 213192 213082 111 0 0 40 105 99 0.297 6.93 4.00 Prom - 215333 215294 40 -5.25 5.04 PlyA - 215428 215423 6 1.05 5.03 Term - 218043 217849 195 0 0 42 40 186 0.628 5.63 5.02 Intr - 221018 220922 97 1 1 69 84 37 0.165 0.49 5.01 Init - 237561 237527 35 0 2 113 89 8 0.342 2.92 5.00 Prom - 241619 241580 40 -5.65 6.00 Prom + 250413 250452 40 -7.55 6.01 Init + 252832 253303 472 0 1 107 106 481 0.710 46.03 6.02 Intr + 266080 266154 75 2 0 29 45 121 0.060 0.47 6.03 Term + 267086 267360 275 0 2 -32 45 313 0.221 9.65 6.04 PlyA + 267936 267941 6 1.05 7.00 Prom + 269789 269828 40 -5.95 7.01 Init + 275955 276166 212 2 2 59 77 297 0.824 24.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 55378 55170 209 2 2 132 49 166 0.976 13.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:60347584_60625931|GENSCAN_predicted_peptide_1|79_aa MGAKESRIGFLSYEEALRRALAGRGRGGGCKALLSATGSPFVSGESLPFPRAYACLLRAV AIRFCHPSPFERLTLIRTI >gi568815581r:60347584_60625931|GENSCAN_predicted_CDS_1|240_bp atgggtgccaaggagtcacggatcggattcctcagctacgaggaggcgctgaggagagcc ttagctggaaggggacgaggaggcggctgcaaagcgctcctctctgctactggaagcccc ttcgtatctggggagtccctgccttttcccagagcctatgcctgtctcttgagggctgtg gccatccgcttctgccacccttcaccgtttgagcggctgactttgataagaacgatctga >gi568815581r:60347584_60625931|GENSCAN_predicted_peptide_2|409_aa MPLSCARGGRWVAGEARADGGPTALSRRQAKNRATLSGDNWARAQRTRRKLSLDLCPKSE KNFTNCAFLSGSPHHGVTQQAGMAHVQENEFNSLTRWCQDQHLTVFLGEGAPASPGLLCA LPKSSLDPAHLAPFPSPDPSWRGQLQPPLRMRKWERGGGLGSVLAVAVALLRVLEPNRPL PETARSQKQWVTNVSCLETSSSASPARDSLMRHAKGLDQDTFKTCKEYLRPLKKFLRKLH LPRDLPQKKKLKYMKQSLVVLGDHINTFLQHYCQAWEIKHWRKMLWRFISLFSELEAKQL RRLYKYTKSSQPAKFLVTFCASDAPERSLLADREDSLPKLCHAWGLHSNISGMKERLSNM QTPGQGSPLPGQPRSQDHVKKDSLRELSQKPKLKRKRIKEAPETPETEP >gi568815581r:60347584_60625931|GENSCAN_predicted_CDS_2|1230_bp atgcccctctcctgtgcccgaggtggccgctgggtggcaggggaggcccgggccgacggc ggtcccaccgcactgtcgcgaaggcaggcgaagaacagggccacactgagtggggacaac tgggcccgggcccagaggacacgaaggaagctctccctagacctctgtcccaaatcagaa aagaattttacaaactgtgctttcctctcggggagcccccaccacggcgtaacccagcag gcaggaatggcacacgtgcaggaaaatgagttcaacagccttaccaggtggtgtcaggat cagcatctgacagtattcttgggagaaggggcgccagcctctcctggcctgctgtgcgct ctgccaaagtcttcgctcgaccccgcacatcttgctccgttccctagtccagatcccagc tggaggggccagctgcagccacccctaagaatgaggaaatgggagcggggtggggggctg ggttctgtgttggcggttgccgtggctttgctccgagttctggagccaaaccgcccactc cctgagacggctaggtcacagaagcagtgggtgacaaatgtgtcatgcctggagacaagc tccagcgccagccctgctagagactcgctcatgcgccatgccaagggcctggatcaggac accttcaaaacttgtaaagaatacctaagaccgctgaagaagttcctgcgaaagttgcac ctgcccagggaccttccccagaagaagaagctgaagtacatgaagcagagccttgtggtc ctaggggaccacatcaacacctttctgcagcattactgccaagcctgggaaatcaaacac tggagaaagatgctctggcggttcatctccctcttctcggagctggaagcaaagcagctt cgccgactttacaagtacaccaagagcagccagccggccaagttcctggtgacattctgc gcctcggatgcgccggagaggtccttgctggccgaccgggaagacagtctgcccaagctc tgccatgcatgggggctgcacagcaacatcagcggcatgaaggagcggctgtccaacatg cagaccccaggtcaagggagccccctgcctgggcagccaagatcccaggaccatgtcaag aaagattctttaagggagctttctcaaaaaccaaaactcaagaggaagaggataaaggaa gccccagaaactccagagactgaaccgtaa >gi568815581r:60347584_60625931|GENSCAN_predicted_peptide_3|241_aa MTKRKAEGDAKGDKAKTKDEPQKRSARLSAKPAFPKPEPKPKMAPAKKGEKVPKGKKGKA DAGKEGIALQKMKMPKQIRHRKLKVLEMPSEVCAFLITVYIWRPRCNFLRLSRIRVHLTP AASTMPPKFDPNEIKVIYLRCTGDATSALAPKIGPLGLSPKNVGGDIAKATGDWKGLRIT VKLTIQNRQAQIEVVPSASALIIKALKEPPETERNRKTLNTVGISLLMRSSTLLDRWSTD P >gi568815581r:60347584_60625931|GENSCAN_predicted_CDS_3|726_bp atgaccaagagaaaggctgaaggggatgctaaaggagataaagccaagacgaaggacgaa ccacagaaaagatccgcgaggttgtctgctaaacctgcttttccaaagccggagcccaag cctaaaatggcccctgcaaagaagggagagaaggtacccaaagggaaaaagggaaaagct gatgctggcaaggagggaatagccctgcagaaaatgaagatgccaaaacagatcaggcac agaaagctgaaggtgctggagatgccaagtgaagtgtgtgcatttttgataactgtgtac atctggaggccaaggtgcaactttcttcggttgtcccgaatccgggttcatctgacacca gccgcctccaccatgccgccaaagttcgaccccaacgagatcaaagtcatatacctgagg tgcaccggagatgccacttctgccctggcccccaagatcggccccctgggtctgtctcca aaaaacgttggtggtgacattgccaaggcaacgggtgactggaagggcctgaggattaca gtgaaactgaccattcagaacagacaggcccagattgaggtggtgccttctgcctctgcc ctgatcatcaaagccctcaaggaaccaccagagacagaaagaaacagaaaaacattaaac acagtgggaatatcacttttgatgagatcgtcaaccttgctcgacagatggagcaccgat ccttag >gi568815581r:60347584_60625931|GENSCAN_predicted_peptide_4|1115_aa MAGHPVFFLLIHLLPLDFSMGWTQTPGSNNWRRGWKERQAVAMFPGLTSNSRAQGILLCK EHGCNAARQAWAKGLSLSASSMATHHTQGETGQQNAAALGPQGVLPDFHGCFLKAHGLSS QCVVNAVWAETQPSGQKALLWTRAEISAGTEPGFLSNRDCDAFGLAGRCFLSHFLLGPAS AAAVPQFAPGGGRAAVPSAVRPRGCHRPSESSGAAEGFATEGGGGLREEEAEEAEEEGRK MAAVELEWIPETLYNTAISAVVDNYIRSRRDIRSLPENIQFDVYYKADEVTVANTEILLD TFRARWVSGAAQHQGWDINTQLYQQGRLCQLGSEFCELEVFAKVLRALDKRHLLHHCFQA LMDHGVKVASVLAYSFSRRCSYIAESDAAVKEKAIQVGFVLAAFTDWRRVSVVFSGTRCK LSVDLPFCGLEHSGPLLIAPLGSAPVGTLSRCPSETKLPEEGSGSNICCSAIFAVLQLSL VIPQQVRDCSSSPAMEQSQMENDFDKLTEVGFRRLVITDFSKLKEDVRTHRKEAKNLEKR LDKWLTRINSVEKTLNDLMELKTMARELHDTCTCFSSRFDLVEERVSVIEDQINEMNCFR ENKIPRNPTYKGCEGPLQGELQTTAQQNKRGHKQMEEHSMLMNRKNQYHENGCTVQGGFL SDAGWYSDAEKVFLSCLQLCTLHDEMLHWFRAVECCVRLLHVRNGNCKYHLGEETFKLAQ TYMDKLSKHGQQANKAALYGELCALLFAKSHYDEAYKWCIEAMKEITAGLPVKVVVDVLR QASKACVVKREFKKAEQLIKHAVYLARDHFGSKHPKYSDTLLDYGFYLLNVDNICQSVAI YQAALDIRQSVFGGKNIHVATAHEDLAYSSYVHQYSSGKFDNALFHAERAIGIITHILPE DHLLLASSKRVKALILEEIAIDCHNKETEQRLLQEAHDLHLSSLQLAKKAFGEFNVQTAK HYGNLGRLYQSMRKFKEAEEMHIKAIQIKEQLLGQEDYEVALSVGHLASLYNYDMNQYEN AEKLYLRSIAIGKKLFGEGYSGLEYDYRGLIKLYNSIGNYEKVFEYHNVLSNWNRLRDRQ YSVTDALEDVSTSPQSTEEVVQSFLISQNVEGPSC >gi568815581r:60347584_60625931|GENSCAN_predicted_CDS_4|3348_bp atggccggccacccagtgttctttctgctcatccacctactgcccttagacttcagcatg ggctggacccagaccccaggatctaacaactggcgacgaggatggaaggagagacaggct gttgctatgttccctgggctgacctcaaactcccgggctcaagggatcctcctgtgtaag gaacatggctgcaatgcagcgaggcaggcatgggccaagggtcttagcttgtcagccagt tccatggccactcatcatactcagggtgaaactggccagcagaatgctgcagccttgggg ccacagggagtactgccagacttccacggatgtttccttaaggcccacggcctctcaagt cagtgtgtggtgaatgctgtctgggcagagactcaaccgtcagggcagaaggctctactc tggaccagggcagagatcagcgctgggacggaacccgggttcctctcgaaccgggattgt gacgcttttggcctggctggccgctgttttctgtcccactttttactcgggcctgcgtcc gctgccgccgtccctcagtttgcccccggaggaggcagggcggccgtgccttctgccgtg cgcccgcgtggctgccaccgcccctccgaatcctccggggccgcagaggggttcgctacg gagggaggtgggggccttcgggaggaggaggcggaggaggcggaggaggagggaaggaag atggcggccgtggaactagagtggatcccagagactctctataacaccgccatctccgct gtcgtggacaactacatccgctcccgccgagacatccgctccttgcccgagaacatccag tttgatgtttactacaaggctgatgaggttactgtagcaaatactgaaattcttctggac accttccgagctcgctgggtttcaggcgctgcacagcatcaggggtgggacatcaacacc caactttaccaacagggacgcttatgtcaactgggcagtgaattttgtgaattggaagtt tttgctaaagtactgagagctttggataaaagacatttgcttcatcattgttttcaggct ttgatggatcatggtgttaaagttgcttcagtcttggcctactcattcagtaggcggtgc tcttatatagcagaatcagatgctgcagtaaaggaaaaagccattcaggttggctttgtt ttagctgctttcacggactggcgtcgagtgtctgtggttttttcaggcacacggtgcaag ctttcagtggatctgccattctgtggtctggagcacagtggccctcttctcatagctcca ctaggcagtgccccagtggggactctgtccaggtgcccctctgagacgaagcttccagag gaaggatcaggcagcaatatttgctgttctgcaatatttgctgttctgcagctttcgctg gtgataccccagcaagtaagggattgcagctcctcgccagcaatggaacaaagccagatg gagaatgactttgacaagttgacagaagtaggcttcagaaggttggtaataacagacttc tccaagctaaaggaggatgttcgaacccatcgcaaggaagctaaaaaccttgaaaaaaga ttagacaaatggctaactagaataaacagtgtagagaagaccttaaatgacctgatggag ctgaaaaccatggcgcgagaactacatgacacatgcacatgcttcagtagccgattcgat ctagtggaagaaagggtatcagtgattgaagatcaaattaatgaaatgaattgcttcaga gagaataaaatacctaggaatccaacctacaagggatgtgaaggacctcttcaaggagaa ctacaaactactgctcaacaaaataaaagaggacacaaacaaatggaagaacattccatg ctcatgaataggaagaatcagtatcatgaaaatggctgtactgtgcaaggtggctttctt tcagatgcaggctggtacagtgatgctgagaaagtttttctgtcctgccttcagttgtgt actctacacgatgagatgcttcattggtttcgtgcagtagaatgttgtgtgaggttgctt catgtgcgaaatggaaactgcaaatatcatttgggtgaagaaacatttaaattagctcag acatatatggataaactatcaaaacatggccagcaagcaaataaagctgcactctatgga gaactgtgtgcactcctatttgcaaaaagtcactatgatgaggcatacaaatggtgcatc gaggcaatgaaagaaattacagcaggcttaccagtgaaagttgtggtggatgtcttaaga caagcttctaaggcttgtgtagtaaaacgtgaatttaagaaggcagaacagttaattaaa catgcagtgtatttggcacgggatcattttggatccaaacacccaaaatattctgataca ctgctagattatgggttctacttactcaatgtagataatatctgtcagtctgttgcaatt tatcaggcagcccttgacattagacagtcagtgtttggtggcaaaaatatccacgtagca acagctcatgaagatttggcctactcttcttatgtccaccagtatagctctgggaaattt gacaatgcactatttcatgcagaaagagctattggtatcattacccacatcctacctgaa gatcatcttcttttggcttcttcaaagagggtgaaagcacttattttagaggagattgca attgattgtcataataaggaaactgaacagaggctgcttcaagaagctcatgatttgcac ctgtcttcactccaactagctaaaaaagcttttggggaatttaatgtacagactgcaaaa cactatggaaaccttggaagactttatcagtcaatgagaaaatttaaggaagctgaagaa atgcacatcaaagcaattcagattaaagaacaacttcttggtcaagaagattatgaagta gccctttcagtgggacatctggcttctttatataattatgacatgaatcagtatgaaaat gctgagaaactttatttgcgatctatagcaattgggaagaaactttttggtgagggctac agtggactagaatatgattatcgaggtctcattaaactttacaactccattggaaattac gagaaagtgtttgaatatcacaatgttctgtctaactggaaccggttgcgagatcggcaa tattcagtgacagatgctcttgaagatgtcagcaccagcccccagtccactgaagaagtg gtgcagtccttcctgatttctcagaatgtcgagggaccgagctgctga >gi568815581r:60347584_60625931|GENSCAN_predicted_peptide_5|108_aa MANIVKCCREVKLYSGMIKVHCILNLLGSSDPPTSACQAAGGPLIQETFWEPQLLVVIDS NNPTTTTGLSQRQLMLILSTIALCNTDSPLRYGDIAIPCNNKGAYLRV >gi568815581r:60347584_60625931|GENSCAN_predicted_CDS_5|327_bp atggccaatatcgtcaaatgctgtagagaggtcaagttgtacagtggcatgatcaaagtt cactgcatccttaacctcctgggctcaagtgatcctcccacatcagcgtgccaagcagct gggggaccactgatccaggaaaccttctgggagccacagcttctggtggttattgattcc aacaatccaacaacgaccaccggcctgtcacagaggcaacttatgttaatcttatctacc atcgctctgtgcaacacagattctcctttgcgctatggggacattgccatcccatgcaac aacaaaggagcttacttacgggtttga >gi568815581r:60347584_60625931|GENSCAN_predicted_peptide_6|273_aa MAGLYSLGVSVFSDQGGRKYMEDVTQIVVEPEPTAEEKPSPRRSLSQPLPPRPSPAALPG GEVSGKGPAVAAREARDPLPDAGASPAPSRCCRRRSSVAFFAVCDGHGGREAAQFAREHL WGFIKKQKGFTSSEPAKVCAAIRKGFLACHLAMWKKLARVPGGHGLRRPCTRSGQLALLA AGTVTLTAKVCSFTPEARETTNPPEGTNNSRRAALKAVTLTGKVCSFIPEASETTNPPEG RNSEHIRTSEGTNSGHTAFKNCNTRGKGLRFHS >gi568815581r:60347584_60625931|GENSCAN_predicted_CDS_6|822_bp atggcggggctgtactcgctgggagtgagcgtcttctccgaccagggcgggaggaagtac atggaggacgttactcaaatcgttgtggagcccgaaccgacggctgaagaaaagccctcg ccgcggcggtcgctgtctcagccgttgcctccgcggccgtcgccggccgcccttcccggc ggcgaagtctcggggaaaggcccagcggtggcagcccgagaggctcgcgaccctctcccg gacgccggggcctcgccggcacctagccgctgctgccgccgccgttcctccgtggccttt ttcgccgtgtgcgacgggcacggcgggcgggaggcggcacagtttgcccgggagcacttg tggggtttcatcaagaagcagaagggtttcacctcgtccgagccggctaaggtttgcgct gccatccgcaaaggctttctcgcttgtcaccttgccatgtggaagaaactggcgcgagtt ccaggtgggcatgggctccgcaggccctgcactcggagcggccagctggcgctgctggcc gcaggcactgtaacactcactgcgaaggtctgcagcttcactcctgaagccagggagacc accaacccaccagaaggaacgaacaactccagacgcgcagcattaaaagctgtaacactc accgggaaagtctgcagcttcattcctgaagccagcgagaccacgaacccaccagaagga agaaactccgaacacatccgaacatcagaaggaacaaattctggacacactgcctttaag aactgtaacactcgtggcaagggtctgcggtttcattcttga >gi568815581r:60347584_60625931|GENSCAN_predicted_peptide_7|71_aa MTGLPSTSGTTASVVIIRGMKMYVAHVGDSGVVLGIQDDPKDDFVRAVEVTQDHKPELPK ERERIEGLGGS >gi568815581r:60347584_60625931|GENSCAN_predicted_CDS_7|213_bp atgacgggtcttcctagcacatcagggacaactgccagtgtggtcatcattcggggcatg aagatgtatgtagctcacgtaggtgactcaggggtggttcttggaattcaggatgacccg aaggatgactttgtcagagctgtggaggtgacacaggaccataagccagaacttcccaag gaaagagaacgaatcgaaggacttggtgggagn