GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:24:14 Sequence gi568815596r:196037450_196263383 : 225934 bp : 38.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 7603 7878 276 0 0 39 42 280 0.561 13.63 1.02 PlyA + 8334 8339 6 1.05 2.04 PlyA - 9771 9766 6 1.05 2.03 Term - 10050 9899 152 1 2 50 38 156 0.854 3.89 2.02 Intr - 10955 10847 109 2 1 94 95 100 0.996 10.24 2.01 Init - 14757 14701 57 0 0 51 41 86 0.512 1.56 2.00 Prom - 17358 17319 40 -7.35 3.00 Prom + 20911 20950 40 -7.65 3.01 Init + 22131 22298 168 2 0 67 68 85 0.187 3.99 3.02 Term + 31018 31554 537 0 0 57 41 382 0.576 23.66 3.03 PlyA + 32744 32749 6 1.05 4.00 Prom + 41130 41169 40 -5.55 4.01 Init + 46378 46581 204 0 0 50 28 174 0.273 6.50 4.02 Term + 52808 52975 168 1 0 65 42 163 0.750 6.30 4.03 PlyA + 53628 53633 6 1.05 5.00 Prom + 53635 53674 40 -13.78 5.01 Sngl + 54003 54917 915 2 0 63 41 346 0.955 23.46 5.02 PlyA + 54975 54980 6 1.05 6.00 Prom + 55223 55262 40 -9.85 6.01 Sngl + 55434 56447 1014 2 0 43 38 394 0.953 26.66 6.02 PlyA + 57907 57912 6 1.05 7.00 Prom + 58829 58868 40 -6.35 7.01 Init + 66818 66910 93 1 0 66 69 80 0.002 4.33 7.02 Intr + 86410 86589 180 0 0 70 86 118 0.439 8.94 7.03 Term + 98936 99043 108 1 0 71 28 148 0.243 4.63 7.04 PlyA + 99159 99164 6 1.05 8.08 PlyA - 99689 99684 6 -0.45 8.07 Term - 100280 99998 283 1 1 66 42 139 0.708 1.01 8.06 Intr - 102350 102171 180 1 0 123 89 68 0.860 8.46 8.05 Intr - 103848 103800 49 1 1 82 83 54 0.821 1.12 8.04 Intr - 106237 106111 127 1 1 48 82 75 0.188 2.23 8.03 Intr - 108606 108462 145 2 1 63 92 34 0.194 0.66 8.02 Intr - 119202 118990 213 2 0 98 92 96 0.487 7.91 8.01 Init - 125934 125813 122 0 2 71 94 29 0.438 1.51 8.00 Prom - 130244 130205 40 -7.85 9.17 PlyA - 131029 131024 6 1.05 9.16 Term - 133473 133289 185 1 2 37 34 176 0.470 3.82 9.15 Intr - 134178 133884 295 0 1 52 49 246 0.580 12.76 9.14 Intr - 143358 143162 197 1 2 15 86 135 0.000 4.21 9.13 Intr - 146054 145864 191 1 2 67 110 42 0.005 2.61 9.12 Intr - 161189 160988 202 0 1 62 111 35 0.016 0.62 9.11 Intr - 163698 163522 177 1 0 90 43 89 0.055 3.57 9.10 Intr - 178613 178416 198 0 0 84 98 58 0.784 4.80 9.09 Intr - 182704 182590 115 1 1 134 52 119 0.940 12.00 9.08 Intr - 183492 183346 147 0 0 57 88 194 0.945 15.81 9.07 Intr - 184891 184762 130 0 1 59 89 171 0.733 14.08 9.06 Intr - 188421 188323 99 2 0 118 64 44 0.357 3.31 9.05 Intr - 190805 190653 153 1 0 63 115 42 0.159 2.67 9.04 Intr - 203113 203000 114 0 0 79 96 127 0.873 11.24 9.03 Intr - 204755 204635 121 0 1 85 106 80 0.986 8.13 9.02 Intr - 216580 216471 110 0 2 96 109 69 0.810 8.81 9.01 Intr - 220457 220374 84 1 0 63 73 72 0.340 1.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 63470 63445 26 2 2 98 94 20 0.933 -0.49 S.002 Intr - 63796 63591 206 2 2 94 43 138 0.954 7.90 S.003 Init - 70727 70718 10 2 1 89 74 14 0.838 0.76 S.004 Sngl - 143377 142892 486 1 0 50 39 287 0.944 15.81 S.005 Term - 177398 177259 140 0 2 45 42 126 0.838 0.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_1|91_aa MRFCLKKERERRKEGEGEGGGGGGRGDGGGGEGGGEEGGGGGREEEKEEEKEEEEEKEKK KEREGRKRRREGNEREGGRRKKRKKKKKRKE >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_1|276_bp atgagattctgtctcaaaaaagagagggagagaagaaaagaaggagaaggtgaaggagga ggaggaggaggaagaggagatggaggaggaggagaaggaggaggagaagaaggaggagga ggaggaagagaagaggagaaggaggaggaaaaggaggaggaggaggagaaggagaagaag aaagagagggagggaagaaaaagaagaagggaagggaatgaaagggaaggagggaggaga aagaaaagaaaaaagaaaaaaaagaggaaagaataa >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_2|105_aa MGDVAVAHWMTKTFAEELQVSTKPHWQQAAPSFHLSVKQDDESPEPFSVKNEQSHAEYME RFGKKGKLPHQVDDSYVGPSTSKSKGKSPHKERENFRSTLVNVIM >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_2|318_bp atgggagatgtggctgtggcccactggatgacaaagacatttgctgaagagctgcaggtg agtacaaagccccactggcagcaggcagctccatcattccatttgagtgtaaagcaggat gatgagagtccagaaccatttagtgttaaaaatgaacagtcccatgctgaatacatggaa cgttttggaaaaaagggcaaattaccccaccaagttgatgatagttatgttggaccatct acttccaaatcaaagggcaaatctccacataaagaacgagaaaactttagaagtactctt gttaatgtcattatgtaa >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_3|234_aa MVWLPAVSNLVNLWCEANSEADTLQQVSPASLWAAFHKFHLHHRGRLPCKLSGSQQAARF GCYDDQGQTAGKGTYKGIHGKRSVTQATSGVKEAVHRGVTAGEFARQEGLPPLPKRLGSV AAAASLAKSSPLTCSLLMAARTRWPHRCFWVAPARGTPRTIEAGPRDLQRSQLPPHQSRL ASGQRLLLSNGPAQVPFPVICSFWGWTRKLARGPARGAFWGAAASVVRSKVCVE >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_3|705_bp atggtgtggctgccagcagtcagcaaccttgtcaatctgtggtgtgaggctaactcagag gcagacaccctccagcaagtttcgccagcatccctgtgggcagctttccacaagtttcac ttacatcacagagggagacttccttgcaaactttcaggatcccagcaggcggctaggttt ggttgttatgacgaccaagggcaaacagctgggaagggaacttataagggcattcacgga aagcgctcagtaacccaggccacaagtggggtgaaggaagctgtacaccgcggagtcaca gctggggagttcgctaggcaggaggggcttccaccacttccgaaacgcctgggaagcgtc gcggcggcggcgagcctggcaaagagcagccctctcacctgctcactgctcatggctgcg aggacgcgctggcctcaccggtgcttctgggttgctcctgcccgcggaacccctaggacg atagaggcagggccccgggacttgcagcggtctcagctccctccgcaccagagccgtcta gcgtccgggcagcgtttgttgctaagcaacgggcccgcgcaggttccatttccggtcatc tgttccttctggggctggacccggaagctggcgcgcggtcccgcaagaggagctttctgg ggcgctgctgccagtgttgtgaggtctaaggtgtgcgttgaataa >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_4|123_aa MWDQSPPTHRVPTGAPPSGAMRRGPPSSRPQNGRSTNNVHCMPGKAADTQHQPVKAARSG ALPCKATEEHSSSPAMEQSWMENDFDELREDSFRQSIITNFSELKEDVRTHRKEAKNLGK KIR >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_4|372_bp atgtgggatcagagcccccccacacacagagttcctactggggcaccacctagtggagct atgagaagagggccaccatcctccagaccccagaatggtaggtccaccaacaacgtgcac tgtatgcctggaaaagctgcagacactcaacatcagcccgtgaaagcagccaggagtggg gctttaccctgcaaagccacagaggaacacagctcctcaccagcaatggaacaaagctgg atggagaatgactttgatgagttgagagaagacagcttcagacaatcgataataacaaac ttctctgagctaaaggaggatgttcgaacccatcgcaaagaagctaaaaaccttggaaaa aagattagatga >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_5|304_aa MEEDLPSKWKTKQNKKAGIAILVSDKTDFKPTKIRRDKEGHYIMVKGSIQQEELTILNIY APNTGAPRFIKQVLRDLQRDLDSHTILMGDFNTPLSTLDRSTRQKVNKDIQELNSALHQA DLIDIYRTLHPRSTEYTFFSASHHTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKL ELRIKKLTQNHSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVC RGKFIALNAHKRKQERSKIDILTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKP FENQ >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_5|915_bp atggaggaagatttaccaagcaaatggaaaacaaaacaaaacaaaaaagcagggattgca atcctagtctctgataaaacagactttaaaccaacaaagatcagaagagacaaagaaggc cattacataatggtaaagggatcaattcaacaagaagagctaactatcctaaatatatat gcacccaatacaggagcaccaagattcataaagcaagtccttagagacctacaacgagac ttagactcccacacaatactaatgggagactttaacaccccactgtcaacattagacaga tcaacgagacagaaagttaacaaggatatccaggaattgaactcagctctgcaccaagcg gacctaatagacatctacagaactctccaccccagatcaacagaatatacattcttttca gcatcacaccacacctattccaaaattgaccacatagttggaagtaaagcactcctcagc aaatgtaaaagaacagaaattataacaaactatctctcagaccacagtgcaatcaaacta gaactcaggattaagaaactcactcaaaaccactcaactacatggaaactcaacaacctg ctcctgaatgactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaa acaaacaagaacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgt agagggaaatttatagcactaaatgcccacaagagaaagcaggaaagatcgaaaattgac atcctaacatcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagc agaaggcaagaaataactaagatcagagcagaactgaaggaaatagagacacaaaaaccc ttcgaaaatcaatga >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_6|337_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQVWFNICKSKNVIHHINRIKDENHM IISIDAEKAYNKIQQHFMLKTLNKLGIDGTYLKIIRPIYDKPTANIILNGQKLEAFPLKT GTGQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEKVKLSLFADDMIVYLENPIVSA QNLLKLISNFSKISGYKINVQKSQAFLYTNNRQTESQIMGELPFTIASKRIKYLGIQLTR DVKDLFKENYKSLLNKIKEDTNKCKNIPCSWIGRNNIVKMAILPKVIYRFNTIPIKLLMT FFTELEKTIKVHMEPKKSPHCQDNPKPKEQSWRHHTT >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_6|1014_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaagtctggtttaac atatgcaaatcaaaaaatgtaatccatcatataaacagaatcaaagacgaaaaccacatg attatctcgatagatgcagaaaaggcctacaacaaaattcaacagcacttcatgctaaaa actctcaataaactaggtattgatgggacgtatctcaaaataataagacctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaggacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaagagaaagaaataaagggtattcaattaggaaaagagaaagtc aaattgtccctgtttgcagatgacatgattgtatatctagaaaatcccatcgtctcggcc caaaatctccttaagctgataagcaacttcagcaaaatctcaggatacaaaatcaatgtg caaaaatcacaagcattcctctacaccaataacagacaaacagagagccaaatcatgggt gaacttccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctctttaaggagaactacaaatcactgctcaacaaaataaaagaggac acaaacaaatgcaagaacattccatgctcatggataggaagaaacaatattgtgaaaatg gccatactgcccaaggtaatttatagattcaataccatccccatcaagctactaatgact ttcttcacagaattggaaaaaactattaaagttcatatggagccaaaaaagagcccacat tgccaagacaatcctaagccaaaagaacaaagctggaggcatcacactacctga >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_7|126_aa MAQLANSNVCYTPSSRFLVSGQPYQTEQSSKLPYLSVPKGKGMCLLRPTVFTGAHCMSVK FSDYPGDSPPPSVTELFICVLQPDLIFLSVQEFHTRTSDEPDVMFYDHISLATADSGTVL IDLPVL >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_7|381_bp atggcacagttagcaaatagcaatgtctgttatacaccctcttcaaggttccttgtgagc ggtcaaccttaccagacagaacaatcttccaagcttccctatcttagtgtgcctaaaggg aaaggaatgtgcttattaaggcccactgtttttactggggcccactgtatgagtgtgaag tttagtgattatccaggagactcgccccctccttctgtgaccgagttgtttatctgtgtt ttacagcctgacctgatcttcttatctgtccaggaatttcacacaagaacttcagacgag cctgatgtcatgttttatgatcacatcagtcttgctaccgctgactcgggcactgtgtta atagacctaccggtgctgtaa >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_8|372_aa MSRRRFDCRSISGLLTTTPQIPIKMENFNNFYILTSKELGRGKFAVVRQCISKSTGQEYA AKFLKKRRRGQDCRAEILHEIAVLELAKSCPRVINLHEVYENTSEIILILEYAAGGEIFS LCLPELAEMVSENDVIRLIKQILEGVYYLHQNNIVHLDLKPQNILLSSIYPLGDIKIVDF GMSRKIGHACELREIMGTPEYLAPEILNYDPITTATDMWNIGIIAYMLLTHTSPFVGEDN QETYLNISQVNVDYSEETFSSVSQLATDFIQSLLVKNPEKRPTAEICLSHSWLQQWDFEN LFHPEETSSSSQTQDHSVRSSEDKTSKSSCNGTCGDREDKENIPEDSSMVSKRFRFDDSL PNPHELVSDLLC >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_8|1119_bp atgtcgaggaggagatttgattgccgaagtatttcaggcctactaactacaactcctcaa attccaataaaaatggaaaactttaataatttctatatacttacatctaaagagctaggg agaggaaaatttgctgtggttagacaatgtatatcaaaatctactggccaagaatatgct gcaaaatttctaaaaaagagaagaagaggacaggattgtcgagcagaaattttacacgag attgctgtgcttgaattggcaaagtcttgtccccgtgttattaatcttcatgaggtctat gaaaatacaagtgaaatcattttgatattggaatatgctgcaggtggagaaattttcagc ctgtgtttacctgagttggctgaaatggtttctgaaaatgatgttatcagactcattaaa caaatacttgaaggagtttattatctacatcagaataacattgtacaccttgatttaaag ccacagaatatattactgagcagcatataccctctcggggacattaaaatagtagatttt ggaatgtctcgaaaaatagggcatgcgtgtgaacttcgggaaatcatgggaacaccagaa tatttagctccagaaatcctgaactatgatcccattaccacagcaacagatatgtggaat attggtataatagcatatatgttgttaactcacacatcaccatttgtgggagaagataat caagaaacatacctcaatatttctcaagttaatgtagattattcggaagaaactttttca tcagtttcacagctggccacagactttattcagagccttttagtaaaaaatccagagaaa agaccaacagcagagatatgcctttctcattcttggctacagcagtgggactttgaaaac ttgtttcaccctgaagaaacttccagttcctctcaaactcaggatcattctgtaaggtcc tctgaagacaagacttctaaatcctcctgtaatggaacctgtggtgatagagaagacaaa gagaatatcccagaggatagcagcatggtttccaaaagatttcgtttcgatgactcatta cccaatccccatgaacttgtttcagatttgctctgttag >gi568815596r:196037450_196263383|GENSCAN_predicted_peptide_9|839_aa XEKIQFIRTEGTPGLVRLSSDADLVMLLSLFEEEIMSYVPPHALLHPSYCQSPRGSPVSS PQNSPGTQRANARAPAPYKRDFEAKLRNFYRKLETKGYGQGPGKLKLIIRRDHLLEDAFN QIMGYSRKDLQRNKLYVTFVGEEGLDYSGPSREFFFLVSRELFNPYYGLFEYSANDTYTV QISPMSAFVDNHHEWFRFSGRILGLALIHQYLLDAFFTRPFYKALLRILCDLSDLEYLDE EFHQSLQWMKDNDIHDILDLTFTVNEEVFGQITERELKPGGANIPVTEKNKKEYIERMVK WRIERGVVQQTESLVRGFYEVVDARLVSVFDARELELVIAGTAEIDLSDWRNNTEYRGGA LPVNDVMPFLAVGLWLHLLLLFSWVDSLLQAHPAFPMKDLLHSEGVTAQEDSVWRNGGKS LLFPGTSALSHLGTGDESSWKDLGELDAQGTTQPSFNQQFLTAKLFPFVMFQDKDEPIHD QLHEDTLNTSYMYAEHLLDERTSTSLAFPLCQFDVCLTMELQESKSAFEKLSSQCSRVDS KVQKSVFNKICSWFAILSSVLNLITLGGKSFLIFLNPHLKPPQAIWATAGSECGPARWGD VNTVLGLLVLHTPAGEDDDTIAQTQDSTLQLVPWTLEMPLDLLLPGPDCSFRLSILHGPD LLAPDSKVQVHMCDWLIEPKGEGSGERRCPRRGCSFQTATASFGSPLRCCRQESLHEKPG HNRRPLSGKVKVDPATFGAPWRLARLELENSSCGGPRRGAGQRPPGGGSPGRTCGGRSAS PESQLRRGRVPIGTRGRVLLPAAFRLEKVRVHLVGRFQLLLGNSVLKPARDQDGLQLFP >gi568815596r:196037450_196263383|GENSCAN_predicted_CDS_9|2520_bp nnggagaagatccaatttatccgaactgaagggactccaggattggtgcgcctttcaagc gatgcagaccttgttatgttgctgagcttatttgaagaagagataatgtcgtatgtgcct ccacatgccttactccaccccagctactgtcagtccccacgtggctctcccgtgtcatct cctcagaactcgccaggtacccagcgtgccaatgcccgggctccagccccttacaagcgg gatttcgaagccaaactgaggaacttttacaggaagttagagactaaaggatatggacaa ggcccagggaagttaaagttaattatccgaagagatcacttactagaagatgcttttaat cagattatgggctactccagaaaagacctgcagagaaataagctatatgtcaccttcgtt ggggaggaagggctggattacagtgggccttctagagagtttttcttcctggtatccaga gaactctttaacccatattatggcttatttgaatattcagccaatgacacatacacagta caaataagtcctatgtctgcttttgtagacaatcaccatgaatggttccgattcagtggt aggatccttggtcttgcactaatacaccagtatttgttggatgccttcttcacacggccc ttttataaggctcttctcagaattctatgtgacctgagtgacctagaataccttgatgaa gagttccatcagagcctgcagtggatgaaagacaatgatatccatgacatcctagacctc acgttcactgtgaacgaagaagtatttgggcagataactgaacgagaattaaagccaggg ggtgccaatatcccagttacagagaagaacaagaaggagtacatcgagaggatggtgaag tggaggattgagaggggtgttgtacagcaaacagagagcttagtgcgtggcttctatgag gtggtggatgccaggctggtatctgtttttgatgcaagagaactggaattggtcatcgca ggcacagctgaaatagacctaagtgattggagaaacaacacagaatatagaggaggagct ctgcccgtgaatgacgtgatgcctttcctggctgttggtctctggcttcaccttctcctt ctgttttcttgggttgacagtttgttacaggcacatccagcattccctatgaaggatttg cttcactccgagggagtaacggcccaagaagattctgtgtggagaaatgggggaaaatca ctgctcttcccaggaacaagtgctctgtcacatttggggactggagatgagtcctcttgg aaggatttgggtgagcttgatgcccagggaacaacccaaccgtctttcaatcaacagttc ttgactgccaaactttttccatttgttatgttccaagacaaagatgaacccatacatgat cagctccacgaagacactcttaatacatcatatatgtatgcagaacatctcttagatgag agaacaagcacatcattggcattccctctgtgccagtttgatgtttgtttgacgatggag ctacaagagtctaaatctgcctttgaaaagctcagctcacagtgtagtagagtagactcc aaggttcaaaaaagtgtttttaataaaatttgctcttggtttgcaattcttagttctgtg ctaaatcttattactttaggtggcaaatcattcttgattttcctaaatccccacctgaaa cctcctcaggcaatctgggccacagcagggagtgagtgtggcccggccagatggggagat gtgaacactgttttggggcttctggtgctgcacacacctgcaggggaagatgacgacact attgcccaaactcaggattctacactgcagcttgtgccgtggacactggagatgccacta gacctgctgcttccagggcctgattgctccttccgtctcagcattcttcatggcccagac ttactagctccagattcaaaggttcaagtgcatatgtgtgattggttgatagaacctaag ggggagggatctggcgaacggcgatgccccagacgcggctgcagttttcaaaccgcgact gcaagcttcggtagtcctctccgctgctgtcgccaggagtcacttcacgagaagccaggt cacaaccgtcggcccttgtctggaaaagtaaaagtggatcctgccacgttcggagctccc tggcgcctcgcccggctggagctagagaactcgtcctgtggcggcccccggcgtggggcg ggacagcggccccctggagggggcagtcccgggagaacctgcggcggccggagcgcctct cccgagtcccagctgcggcgtgggcgcgttcccatcgggactcgtggacgcgtcctgctc ccagctgcctttcgtctagagaaagttcgtgttcatcttgtgggacgttttcagttactg cttgggaacagtgttttaaaaccagcgagagatcaagacgggctacagctgtttccgtga