GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:06:10 Sequence gi568815592r:154059831_154365947 : 306117 bp : 39.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11341 11412 72 0 0 66 100 75 0.729 7.62 1.02 Term + 23709 23825 117 2 0 29 44 118 0.292 -0.94 1.03 PlyA + 24386 24391 6 1.05 2.00 Prom + 28157 28196 40 -5.85 2.01 Init + 30006 30348 343 2 1 89 113 283 0.012 28.34 2.02 Intr + 31122 31642 521 1 2 112 101 441 0.803 39.54 2.03 Intr + 46617 46650 34 2 1 76 107 -3 0.003 -2.72 2.04 Intr + 58853 59068 216 0 0 136 106 25 0.090 6.65 2.05 Intr + 75788 75861 74 0 2 82 77 40 0.278 0.51 2.06 Term + 76762 76956 195 0 0 54 49 153 0.674 4.43 2.07 PlyA + 77553 77558 6 1.05 3.03 PlyA - 77617 77612 6 1.05 3.02 Term - 100210 99998 213 1 0 80 53 293 0.997 21.25 3.01 Init - 102884 102783 102 2 0 71 83 70 0.366 5.09 3.00 Prom - 104421 104382 40 -1.85 4.03 PlyA - 107051 107046 6 1.05 4.02 Term - 108283 107877 407 2 2 78 47 272 0.669 16.46 4.01 Init - 108637 108493 145 1 1 92 11 28 0.204 -4.17 4.00 Prom - 108956 108917 40 -8.75 5.00 Prom + 111278 111317 40 -6.55 5.01 Init + 113375 113520 146 1 2 77 98 129 0.869 12.44 5.02 Intr + 113882 114096 215 2 2 -16 53 82 0.110 -8.26 5.03 Term + 115679 116721 1043 0 2 43 48 331 0.287 15.90 5.04 PlyA + 116753 116758 6 1.05 6.09 PlyA - 118020 118015 6 1.05 6.08 Term - 120780 120644 137 1 2 21 42 116 0.163 -2.50 6.07 Intr - 121911 121837 75 1 0 80 39 72 0.121 0.07 6.06 Intr - 140095 139838 258 2 0 53 92 189 0.933 12.31 6.05 Intr - 147096 146973 124 0 1 76 43 97 0.561 3.24 6.04 Intr - 153025 152940 86 2 2 86 76 104 0.726 7.62 6.03 Intr - 154446 154388 59 2 2 62 95 68 0.814 2.51 6.02 Intr - 163413 163340 74 0 2 65 92 95 0.583 4.89 6.01 Init - 186919 186761 159 1 0 74 101 71 0.383 6.87 6.00 Prom - 208102 208063 40 -4.95 7.00 Prom + 211226 211265 40 -6.25 7.01 Init + 213350 213515 166 1 1 36 57 123 0.340 3.84 7.02 Term + 218407 218639 233 2 2 75 42 179 0.933 7.65 7.03 PlyA + 219295 219300 6 1.05 8.05 PlyA - 219662 219657 6 1.05 8.04 Term - 228287 228201 87 2 0 101 47 49 0.048 -1.22 8.03 Intr - 235716 235658 59 1 2 71 85 29 0.014 -1.42 8.02 Intr - 245908 245867 42 2 0 96 98 44 0.367 3.49 8.01 Init - 251507 251411 97 2 1 58 88 90 0.746 6.52 8.00 Prom - 280404 280365 40 -4.65 9.00 Prom + 283274 283313 40 -4.45 9.01 Sngl + 284443 284847 405 0 0 29 52 280 0.443 14.53 9.02 PlyA + 286466 286471 6 1.05 10.02 PlyA - 289535 289530 6 1.05 10.01 Sngl - 294841 294536 306 1 0 55 43 364 0.598 24.22 10.00 Prom - 297830 297791 40 -9.65 11.00 Prom + 298953 298992 40 -4.65 11.01 Init + 302305 302419 115 0 1 50 103 118 0.845 9.92 11.02 Term + 303530 303642 113 0 2 97 29 77 0.758 0.44 11.03 PlyA + 305521 305526 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 29996 30348 353 2 2 126 113 302 0.988 30.54 S.002 Init - 141288 141234 55 0 1 92 70 46 0.807 4.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_1|62_aa MVKIQLKAQCVVIAAEGRKQLIKKILMVCHRGTRRKKKAEKIGRGQKSERMWRERVTCYQ ED >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_1|189_bp atggtgaagatccagctaaaggcacagtgtgtagtcatagctgctgaaggaaggaagcag ctcattaaaaagatactcatggtttgtcatcgggggacccgaaggaagaagaaagcagag aagattgggagaggacaaaagagcgagaggatgtggagagaaagggtgacatgctaccaa gaagactga >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_2|460_aa MKTATNIYIFNLALADALATSTLPFQSVNYLMGTWPFGTILCKIVISIDYYNMFTSIFTL CTMSVDRYIAVCHPVKALDFRTPRNAKIINVCNWILSSAIGLPVMFMATTKYRQGSIDCT LTFSHPTWYWENLLKICVFIFAFIMPVLIITVCYGLMILRLKSVRMLSGSKEKDRNLRRI TRMVLVVVAVFIVCWTPIHIYVIIKALVTIPETTFQTVSWHFCIALGYTNSCLNPVLYAF LDENFKRCFREFCIPTSSNIEQQNSTRIRQNTRDHPSTANTVDRTNHQTLPSSKAGKLYA RKSGSRNCSVALTGSHAIPTFTKLRSHHVCGSRLLQECVGGSNSLGKCLLLGHPTSFLSG HSALHIRGTAKMAGSLANLKRLYKRLGPAMVSNFSKPLRLVPGPSSCTGAASPQTVPKTV SEIQVQQKYTSPLLQIHGESGENWQLLSMTCPYDVDGAGI >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_2|1383_bp atgaagactgccaccaacatctacattttcaaccttgctctggcagatgccttagccacc agtaccctgcccttccagagtgtgaattacctaatgggaacatggccatttggaaccatc ctttgcaagatagtgatctccatagattactataacatgttcaccagcatattcaccctc tgcaccatgagtgttgatcgatacattgcagtctgccaccctgtcaaggccttagatttc cgtactccccgaaatgccaaaattatcaatgtctgcaactggatcctctcttcagccatt ggtcttcctgtaatgttcatggctacaacaaaatacaggcaaggttccatagattgtaca ctaacattctctcatccaacctggtactgggaaaacctgctgaagatctgtgttttcatc ttcgccttcattatgccagtgctcatcattaccgtgtgctatggactgatgatcttgcgc ctcaagagtgtccgcatgctctctggctccaaagaaaaggacaggaatcttcgaaggatc accaggatggtgctggtggtggtggctgtgttcatcgtctgctggactcccattcacatt tacgtcatcattaaagccttggttacaatcccagaaactacgttccagactgtttcttgg cacttctgcattgctctaggttacacaaacagctgcctcaacccagtcctttatgcattt ctggatgaaaacttcaaacgatgcttcagagagttctgtatcccaacctcttccaacatt gagcaacaaaactccactcgaattcgtcagaacactagagaccacccctccacggccaat acagtggatagaactaatcatcagaccttacctagcagtaaagcaggcaagttgtacgct agaaaatctggaagcagaaactgctccgttgccctaacagggtctcatgccattccgacc ttcaccaagcttagaagccaccatgtatgtggaagcaggttgcttcaagaatgtgtagga ggctctaattctctaggaaagtgcctgcttttaggtcatccaacctctttcctctctggc cactctgctctgcacattagagggacagccaaaatggcaggctccctggccaacttgaaa aggctatataaaaggcttggtccagccatggtgtccaacttctccaagcccctccgcttg gttcctgggccctcttcctgcactggcgcagcatccccacagacagtgcccaagacagtg tctgaaattcaggtacaacaaaaatatacatctcctttacttcaaatacatggggagagt ggtgaaaactggcagctgctttccatgacctgcccatatgatgtggatggagctgggatt tga >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_3|104_aa MNIDRAENRIQAGALEHSSEKRWRDKEESATKVGCKEHDLAMINQLLDDPKLTARKYREW KVMNTLLIQDIYQQQRASPAPDDTDDTPQELKKSPSSPSVENSI >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_3|315_bp atgaacatagatagagcagagaacaggatacaagctggggctttagaacactccagtgag aagaggtggcgtgacaaggaggaatcagcaactaaggtgggatgtaaagaacatgatctg gccatgattaaccagttgctggatgacccgaagctgacagccaggaaatacagagagtgg aaagtcatgaacaccctgctgatccaggacatctatcagcagcagcgggcttcgcctgcc cctgatgacactgatgacaccccccaggaactcaagaaatcaccttcttctccctctgtt gaaaattccatttga >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_4|183_aa MTDTLINRGSLDTETCMQGTPHVNIGVILPQTKKPLEARREAWNNSFPEETKVSEDDEME KLYKSLEQASLSPLGDRRPSTKKELRKSFVKRCKNPSINEKLHKIRTLNSTLKVTKFQIV DELYGCGARNGWFCREILANHSMFVWLQGRAVKVNITFIGQDDLDVLKALGHSHSSAQGS FSQ >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_4|552_bp atgactgataccctcataaatagggggagtttggacacagagacatgcatgcagggaaca ccccatgtgaatattggagttatactgccacaaaccaaaaaaccacttgaagccaggaga gaggcttggaataactccttccctgaagagacaaaagtgtctgaagatgatgaaatggag aagctgtacaaatcattagagcaagctagtctatctcctcttggggaccgacgaccttcg actaaaaaggagttgagaaaatcctttgttaagcggtgtaaaaatccatctataaacgag aaactccacaaaatccgaacattgaatagcacattaaaggtaacaaaatttcaaattgtg gatgaactctatggatgtggggcaaggaacggctggttctgcagagagattcttgctaat cacagcatgtttgtgtggttgcagggaagggctgtaaaagtaaatataacctttataggg caagatgatcttgacgttcttaaagcactaggacattcacacagtagcgcccaaggcagt ttttcacaatag >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_5|467_aa MGRNQSGKAENSKNQNTSSPPKEHNSSPAREQNWMENEFDELTEVGFRSDGENGTKLENT LQDIIQENFPNLARQVNIQIQEIQRTPQRYSLRRATPRHITVRFIKVEMKEKMLRAVREK EIQTTIREYYKHLYTNKLENLEETEKFMDTYTLPKLNQEEVESLNRLKIASEIEAIINSL PTKKNPGPDGFAAEFYQRYKEELVPFLLKLFQSIEKQGILPNSFYEASIILIPKPGRDTT KKGNFRPISLMNIDVKILNKILANQIQQQIKKLIHHNQVGFIPGMQDCFNVPKSIDIIYH INRTNDKNHMIMSIDAEKAFNKNQQPFVLKTLNKLGIDGMYLKIIRAIYDKPTANIILNG QKLEAFPLKTGIRQGCPLSPLPFNIVMEVLAKAIRQEKEIKCIQLGKEEVKLSLFVDHMI VYLENPIISAQNLLKLISNFSKVSGYKINVQKSQAFQYTNNRQRAKS >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_5|1404_bp atggggagaaaccagagtggaaaggctgaaaattccaaaaaccagaacacctcttctcct ccaaaggaacacaactcctcaccagcaagggaacaaaactggatggagaatgaatttgat gagttgacagaagtaggcttcagaagtgacggggagaatgggaccaaattggaaaacact cttcaggatattatccaggagaacttccccaacctagcaaggcaggtcaacattcaaatt caggaaatacagagaacgccacaaagatactccttgagaagagcaaccccaagacacata actgtgagattcatcaaggttgaaatgaaggaaaaaatgttaagggcagtcagagagaaa gaaatacaaactaccatcagagaatactataaacacctctacacaaataaactagaaaat ctagaagaaactgagaaattcatggacacatacaccctcccaaaactaaaccaggaagaa gttgaatctctgaatagactaaaaatagcttctgaaattgaggcaataattaatagccta ccaaccaaaaaaaatccaggaccagatggattcgcagccgaattctaccagaggtacaaa gaggagctggtaccattccttctgaaactattccaatcaatagaaaaacagggaatcctc ccaaactcattttatgaggccagcatcatcctgataccaaagcctggcagagacacaaca aaaaaagggaattttaggccaatatccctgatgaacattgatgtaaaaatcctcaataaa atactggcaaaccaaatccagcagcaaatcaaaaagcttatccaccacaatcaagtcggc ttcatccctgggatgcaagactgcttcaacgtacccaaatcaatagacataatctatcac ataaacagaaccaatgataaaaaccacatgattatgtcaatagatgcagaaaaggccttc aacaaaaatcaacagcctttcgtgctaaaaactctcaataaactaggtattgatggaatg tatctcaaaataataagagctatttatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcataagacaaggatgccctctctcacca ctcccattcaacatagtaatggaagttctggccaaagcaatcagacaagagaaagaaata aagtgtattcaattaggaaaagaggaggtcaaattgtctttgtttgtggatcacatgatt gtatatttagaaaaccccatcatctcagcccaaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattccaatacaccaat aacagacagagagccaaatcatga >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_6|323_aa MSRRRISCKDLGHADCQGWLYKKKEKGSFLSNKWKKFWVILKGSSLYWYSNQMAEKADGF VNLPDFTVERASECKKKQWLNKLGSAVIHQESTTKDEECYSESEQEDPEIAAETPPPPHA SQTQSLALGLSSGCHCSLAANFALIVSPTNGLLFPKSVPDIPALVPVERQSLPDTVNSLS AAEDEGQPITFAVQVHSPVPSEAGIHKALENSFVTSESGFLNSLSSDDTSSLSSNHDHLT VPDKPAGSKIMDKGLVPCGPVETEFEVQNLGTSAAEAPAKFWAYWGMPVLSEQTIDWGME EEEKGAAHSFGRVLERSELITVG >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_6|972_bp atgagtcggaggaggatatcgtgtaaagatctgggccatgctgactgccaagggtggctg tataagaaaaaggaaaagggaagtttcctaagcaacaaatggaaaaagttctgggtgata ctgaaggggtcgtcactgtactggtatagcaatcaaatggcagagaaagctgatggattt gtcaacctgcctgatttcactgtggaaagagcatctgaatgcaagaaaaagcagtggtta aataaacttggatcggctgtaatccatcaggaatccactacaaaggatgaagaatgttac agtgaaagtgaacaggaagatccagaaatagctgcggagacaccaccccctcctcacgct tcccagactcagtctttggcccttggcctgtcttcaggctgccattgcagtcttgctgcc aattttgcccttatcgtcagcccaacaaacggcctgctgttccccaaaagcgttccagac attcctgccctcgtacctgtagagagacaatccttgcctgacacagttaacagtttgtct gctgctgaagatgagggacaaccaataacgtttgctgtgcaagttcattcacctgtaccc tcagaggcaggcatccacaaggccctggaaaacagttttgtcacatcagaaagtggattt ttgaactctttatctagtgatgatacttcttcattgagtagcaatcatgaccatcttact gtcccagataagcctgctggatcaaagatcatggacaaaggtctcgtgccctgcggccct gtggagacagaatttgaggtacaaaacctgggaacctcagcagcagaggctccggccaaa ttctgggcctactggggcatgcctgtgctgtcagagcagacaatagactggggtatggaa gaggaggagaagggagcagcacattcatttggcagggtcttggaaaggagtgaacttata actgttggctaa >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_7|132_aa MLAFSKTLSQQQVDLYKVHKHRQTMCVILKPEVRAKRKQKNACYPRGRSLDRRQVGTTSW DFTSERGTCALAEARLTFTRSRSPDTAVLQRAEPCGQVRENETPSREWESKRRQGSRRMD AKVILKVEILRS >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_7|399_bp atgctggcattcagcaagacattatcacagcagcaagtggatttatacaaagtccacaaa cacaggcaaacaatgtgcgttattcttaagccagaagtcagagctaagaggaaacaaaaa aatgcttgttatcctcgtggcagaagtcttgatcgtcgacaagtaggcaccacttcatgg gattttacatctgaaagaggaacatgtgctctagcagaagccagactgacattcaccagg tccaggtctcctgacactgctgtgctacaaagagctgagccctgtggacaggtccgggag aatgagacgccaagtagagaatgggaatccaagagacgccaaggctccagacgtatggat gcaaaagtgatcttgaaagtagagatcctccgatcctag >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_8|94_aa MSLRRDERRLAESGVLKAKRKRVLGWDPVLQGAPLAADWMVPTRIEAMLTAKCVPLWKVL INEQKELLKPVVLVVQLMDIQISFGAYLSCIDVF >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_8|285_bp atgagcttgagaagagatgaaagaagactggcagaaagtggagtactgaaagccaagaga aaaagagtattaggatgggatcctgtgctgcagggagctccactggcagctgattggatg gtgcccacccgtattgaagcaatgctcacagcaaagtgtgtgccactatggaaagtcctc atcaatgagcagaaagagttattaaaacccgttgtcctggttgtccaactcatggacata caaattagctttggtgcataccttagctgtatagacgtcttctaa >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_9|134_aa MSQDPWCSSTRRSHIMTTSLSFHVLVLCFSSICISSKSLSPKILTSCFSTEKSHSHTLSY RNPSDFGACKDMRLPVSQMEKLRHEENNPMFTGLATGSIKIFPPVVQDSSRYRSAQYAGA RIPTTITDGARGEL >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_9|405_bp atgagtcaagatccctggtgttcctccactcgtcggagccatatcatgactacttcactg tcctttcatgtcctggttttgtgcttttcttccatctgtatatccagtaaatcattatca ccaaagatacttacatcctgtttttcaactgaaaaatctcattcacacacattatcttat agaaaccccagtgactttggggcatgtaaagatatgaggctccccgtttcacaaatggag aaactaagacatgaagaaaacaaccccatgttcacaggtttagcaactggcagcataaag atcttccctcctgtggtccaggattcttcccggtataggagtgctcaatatgccggagca cgaataccaactaccatcacagatggcgccaggggagagctctga >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_10|101_aa MVVEVEVEIKGVVLEVVVVGEVMVEVTVEMVEVVVEAMVEVVMEMVEMVIGVVEVMVEVT VEMVEVVVEEMVEVVVEEMVEVVVEVVVEMVVGGDGGGDRW >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_10|306_bp atggtggtggaggtggaggtggagataaagggggtggtgttggaagtggtggtcgtaggg gaggtgatggtggaagtgacggtggagatggtagaggtggttgtggaggcgatggtggag gtggtaatggagatggtagaaatggtgattggagtggtagaggtgatggtggaagtgacg gtagagatggtggaggtggttgtggaggagatggtggaggtggtggtggaggagatggtg gaggtggtggtggaggtggtggtggagatggtggttggaggtgacggtggaggtgacaga tggtag >gi568815592r:154059831_154365947|GENSCAN_predicted_peptide_11|75_aa MEVVESGKQKTQPGSVAKGNPWMGIRQLNHTANISMELASRACLQCTLSCVIIMMVNQGM RHAFSRSRSLEGPCA >gi568815592r:154059831_154365947|GENSCAN_predicted_CDS_11|228_bp atggaagtggtggaatccgggaaacagaagactcaacctgggagcgttgcaaagggaaat ccttggatgggaatcaggcagctgaaccacacagcaaacatcagcatggagctcgcatcc cgtgcatgtctccagtgcacattgtcctgtgttattataatgatggtgaatcagggcatg cgacatgcattctcaaggtcccggtccttagaagggccctgtgcttag