GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:40:12 Sequence gi568815591r:50492955_50803910 : 310956 bp : 45.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 316 422 107 2 2 64 55 108 0.891 3.67 1.02 PlyA + 594 599 6 1.05 2.15 PlyA - 3026 3021 6 1.05 2.14 Term - 3979 3852 128 2 2 41 42 111 0.606 0.24 2.13 Intr - 5517 5427 91 0 1 45 54 38 0.247 -4.33 2.12 Intr - 6288 6194 95 1 2 26 92 106 0.549 4.48 2.11 Intr - 7644 7533 112 0 1 71 91 45 0.746 3.05 2.10 Intr - 10285 10020 266 2 2 134 51 94 0.430 7.53 2.09 Intr - 34672 34660 13 1 1 103 94 7 0.158 -2.85 2.08 Intr - 35326 35183 144 1 0 104 78 95 0.956 10.58 2.07 Intr - 36388 36254 135 1 0 49 109 177 0.999 16.66 2.06 Intr - 45025 44906 120 1 0 114 110 40 0.976 9.49 2.05 Intr - 46564 46442 123 1 0 121 83 -1 0.888 3.48 2.04 Intr - 47074 46961 114 1 0 124 68 221 0.948 24.34 2.03 Intr - 51143 50931 213 2 0 69 98 339 0.609 31.91 2.02 Intr - 62798 62716 83 0 2 81 103 73 0.905 7.46 2.01 Init - 69153 69075 79 0 1 70 68 81 0.380 3.54 2.00 Prom - 78366 78327 40 -1.56 3.02 PlyA - 78754 78749 6 1.05 3.01 Sngl - 81494 80652 843 2 0 49 43 211 0.775 8.46 3.00 Prom - 81587 81548 40 -7.56 4.02 PlyA - 81756 81751 6 1.05 4.01 Sngl - 83000 82611 390 2 0 88 54 341 0.948 26.92 4.00 Prom - 97930 97891 40 -4.86 5.11 PlyA - 99367 99362 6 1.05 5.10 Term - 100144 99998 147 1 0 99 47 214 0.974 16.30 5.09 Intr - 112452 112336 117 0 0 89 86 85 0.978 9.06 5.08 Intr - 113460 113383 78 0 0 98 103 55 0.970 7.75 5.07 Intr - 121926 121816 111 0 0 37 75 212 0.968 15.38 5.06 Intr - 123393 123256 138 0 0 60 105 23 0.768 1.86 5.05 Intr - 126331 126216 116 2 2 103 101 82 0.971 11.17 5.04 Intr - 131498 131352 147 0 0 39 21 125 0.372 1.11 5.03 Intr - 134024 133868 157 2 1 112 63 96 0.954 9.08 5.02 Intr - 139744 139596 149 2 2 103 35 62 0.160 2.35 5.01 Init - 157056 156990 67 0 1 84 75 74 0.659 6.95 5.00 Prom - 164034 163995 40 -4.16 6.10 PlyA - 164136 164131 6 -0.45 6.09 Term - 164475 164328 148 2 1 95 43 51 0.106 -1.33 6.08 Intr - 166346 166297 50 2 2 101 89 32 0.159 2.08 6.07 Intr - 171433 171413 21 1 0 95 113 -9 0.065 0.04 6.06 Intr - 175287 175165 123 0 0 48 107 23 0.181 0.98 6.05 Intr - 176909 176768 142 1 1 90 109 56 0.628 8.26 6.04 Intr - 177083 176970 114 1 0 64 79 40 0.322 0.26 6.03 Intr - 181704 181482 223 1 1 85 96 217 0.970 19.39 6.02 Intr - 210954 210867 88 0 1 101 86 60 0.051 6.64 6.01 Init - 220874 220494 381 2 0 69 40 513 0.027 41.47 6.00 Prom - 221414 221375 40 -5.76 7.00 Prom + 223586 223625 40 -7.86 7.01 Sngl + 223647 223976 330 2 0 58 55 210 0.458 10.72 7.02 PlyA + 224015 224020 6 1.05 8.15 PlyA - 224102 224097 6 1.05 8.14 Term - 226563 226495 69 0 0 115 54 54 0.701 2.64 8.13 Intr - 228698 228594 105 2 0 37 35 120 0.491 2.01 8.12 Intr - 229358 229312 47 0 2 69 111 29 0.635 1.43 8.11 Intr - 233081 233059 23 1 2 106 111 13 0.047 2.59 8.10 Intr - 254730 254634 97 1 1 105 53 29 0.016 0.17 8.09 Intr - 263102 262933 170 1 2 51 115 141 0.904 12.69 8.08 Intr - 275837 275588 250 0 1 41 115 149 0.166 9.40 8.07 Intr - 277062 276966 97 0 1 94 76 14 0.080 0.38 8.06 Intr - 277686 277578 109 2 1 73 60 56 0.079 1.59 8.05 Intr - 280365 280266 100 1 1 133 76 -18 0.054 0.67 8.04 Intr - 286208 286181 28 2 1 105 99 21 0.019 2.59 8.03 Intr - 289643 289470 174 2 0 81 78 128 0.013 11.24 8.02 Intr - 289885 289770 116 2 2 95 -15 74 0.007 -2.03 8.01 Init - 307082 307040 43 2 1 89 107 44 0.340 7.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 91626 91744 119 1 2 25 44 165 0.836 4.50 S.002 Init + 105724 105793 70 0 1 90 73 65 0.852 6.41 S.003 Init + 211837 211903 67 0 1 104 113 70 0.972 10.34 S.004 Term + 211997 212073 77 0 2 57 41 98 0.846 0.00 S.005 Init - 266969 266872 98 2 2 68 92 92 0.907 5.39 S.006 Intr + 290081 290371 291 1 0 141 28 180 0.932 14.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_1|35_aa XIEDGLRAHENPSSGQPISPALLKGGSRASFVEGI >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_1|108_bp ngcattgaggacggcctcagggcgcacgagaacccctcctccggacagcccatcagccct gccctgctcaagggtggttctagggcttcctttgtggaaggcatctga >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_2|571_aa MLLKVRGWMALLGAFLALDCATSPGCGSNSSPDKVLMKIKYENQAKSLTWFLAHSPDTMN ASEFRRRGKEMVDYMANYMEGIEGRQVYPDVEPGYLRPLIPAAAPQEPDTFEDIINDVEK IIMPGVTHWHSPYFFAYFPTASSYPAMLADMLCGAIGCIGFSWGCCLGRGSNQPIRCVQS LGAGGSLEKGLAPACKPPVPPYLEAASPACTELETVMMDWLGKMLELPKAFLNEKAGEGG GVIQGSASEATLVALLAARTKVIHRLQAASPELTQAAIMEKLVAYSSDQAHSSVERAGLI GGVKLKAIPSDGNFAMRASALQEALERDKAAGLIPFFNVRGGWECSDRSAVSGAAQVGPS DHIPVACARSHNSYQGIGFLLSLHKPSTVLTSPNPRVTHSPYSEEWTLLSWALASLLGVS SWGRPVLKVQMDAAVSQEEHKALGFAEFLPEESLSNHGMSMPVAQISGNKEDIWLHVDAA YAGSAFICPEFRHLLNGVEGRKSLLPKNSALNAIEGPERQSLAVFPREGEGNEPGSPEQW KVVIHQPNASQDAPQVPQASAASSVHIPETK >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_2|1716_bp atgctgcttaaagtcagaggatggatggctctccttggggcatttctagctctggactgt gccacaagccctggatgtgggagtaatagtagtcctgacaaggttcttatgaagattaaa tacgagaaccaggcaaagagcttaacgtggttcttggcacatagcccagacaccatgaac gcaagtgaattccgaaggagagggaaggagatggtggattacatggccaactacatggaa ggcattgagggacgccaggtctaccctgacgtggagcccgggtacctgcggccgctgatc cctgccgctgcccctcaggagccagacacgtttgaggacatcatcaacgacgttgagaag ataatcatgcctggggtgacgcactggcacagcccctacttcttcgcctacttccccact gccagctcgtacccggccatgcttgcggacatgctgtgcggggccattggctgcatcggc ttctcctggggctgctgcctggggagaggctccaaccagcctatcagatgtgtccagagc cttggagcaggggggtctctggaaaagggcctagctcctgcctgcaaaccaccagtccca ccctatctggaggcggcaagcccagcatgcacagagctggagactgtgatgatggactgg ctcgggaagatgctggaactaccaaaggcatttttgaatgagaaagctggagaaggggga ggagtgatccagggaagtgccagtgaagccaccctggtggccctgctggccgctcggacc aaagtgatccatcggctgcaggcagcgtccccagagctcacacaggccgctatcatggag aagctggtggcttactcatccgatcaggcacactcctcagtggaaagagctgggttaatt ggtggagtgaaattaaaagccatcccctcagatggcaacttcgccatgcgtgcgtctgcc ctgcaggaagccctggagagagacaaagcggctggcctgattcctttctttaatgtgcgg ggaggatgggaatgttctgaccgctcggctgtgagtggagctgcccaagtggggccttca gatcacatacctgtggcctgtgccagatcccacaacagctaccagggaattggcttcctc cttagtctgcacaagccaagcacagttctcacctctcctaacccaagagtcacccatagt ccatactcagaggagtggaccctcctctcctgggcgttggcctcccttcttggggtcagc agctggggaaggccggtgctgaaggtgcagatggatgcagcagtgagccaggaagagcac aaggccttaggatttgcagagtttctccctgaggagtcactcagtaatcatgggatgtcc atgcctgtagctcagatcagcggcaacaaggaagacatatggctgcacgttgatgcagcc tacgcaggcagtgcattcatctgccctgagttccggcaccttctgaatggagtggagggt aggaagtctcttctccccaagaactcagctttgaatgccatagagggtcctgagaggcag tctttggcagtgtttccaagagaaggagaaggaaatgaacctgggtctccagagcagtgg aaagttgtcatccaccaaccaaatgcctcccaggacgccccccaggtccctcaagcttct gctgcctcctcggtgcacatccctgaaaccaagtag >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_3|280_aa MGEFNTPLSTLDRSTRQKVNKDIQELNSALHQVELIDIYRTLHPKSTEYTFFSAPQCTYS KIDHIVGSKALLSKRKRTEIITNCLSDHSAIKLELRIKKLTQNCSITWKLNNLLLNDYWV HNEMKAEIKMFFESNENKDTAYQNLWNTFKAVCRGKLIALNAHKRKQERFKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKISESRSWFFEKINKLDRLLARLIK EKREKNQIDAIKNDKGVSPPIPQKYKLPSENTINTSMQIN >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_3|843_bp atgggagaatttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagtggaactaatagacatctacaga actcttcatcccaaatcaacagaatatacattcttctcagcaccacagtgcacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaacgtaaaagaacagaaatt ataacaaactgtctctcggaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaattacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaagcaacgagaacaaagacaca gcataccagaatctctggaacacattcaaagcagtgtgtagagggaaattaatagcacta aatgcccacaagagaaagcaggaaagatttaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcagtgaatcc aggagctggttttttgaaaagatcaacaaacttgatagactgctagcaagactaataaag gagaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggtatcaccaccg atcccacagaaatacaaactaccatcagagaatactataaacacctctatgcaaataaac tag >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_4|129_aa MGKKQSRKTGNSKNQSASPPPKERSSSPAMEQSWTENDFDELREEGFRRSNYSKLKEEVR THGKEVKNLEEKLDKWLTRITNAEKSLKDLMELKTMARELHDECTSLSSQFDQPEERVSV MEDQMNEMK >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_4|390_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagtgcctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccaagctaaaggaggaagttcga acccatggaaaagaagttaaaaaccttgaagaaaaattagacaaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatggcacgagaacta catgatgaatgcacaagcctaagtagccaatttgatcaaccggaagaaagggtatcagtg atggaagatcaaatgaatgaaatgaaatga >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_5|408_aa MPSDAISLLMTQIYNQDKSPLKASSPSNWPNQLGFSQVIRTALSSWYSGPQDSLQRKPRL SGHTLLGLVSLRDVKVFSEDGTSKVVEILADMTARDLCQLLVYKSHCVDDNSWTLVEHHP HLGLAFNYQKLISEKMQGCRVPLREPVEYLSFIPVGKGRKGRKANRGFFFIMDERCLEDH ELVVQVESTMASESKFLFRKNYAKYEFFKNPMNFLNSSSCPEIQGFLHVKELGKKSWKKL YVCLRRSGLYCSTKGTSKEPRHLQLLADLEDSNIFSLIAGRKQYNAPTDHGLCIKYGMLL YQNYRIPQQRKALLSPFSTPVRSVSENSLVAMDFSGQTGRVIENPAEAQSAALEEGHAWR CEDDGQTFFSLDDGNTKFSDLIQLVDFYQLNKGVLPCKLKHHCIRVAL >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_5|1227_bp atgccgtcagatgccatctctctgctcatgacccagatctataaccaggacaagtcacca ctgaaagcatccagccctagcaactggcccaaccagctaggcttctcccaagtcatccgc actgccctgagctcctggtactcaggacctcaggactccctgcaacggaaaccaaggcta tcaggccacacccttctgggtctggtctcactaagggatgttaaagtctttagtgaagat gggacaagcaaagtggtggagattctagcagacatgacagccagagacctgtgccaattg ctggtttacaaaagtcactgtgtggatgacaacagctggacactagtggagcaccacccg cacctaggattagcattcaactatcagaagctcattagtgagaaaatgcagggctgcagg gtgcctctcagagagcctgttgagtatctcagctttatccccgttggaaagggacgcaag ggccgcaaggccaacaggggcttcttctttatcatggatgagaggtgcttggaagaccat gagctggtggtccaggtggagagtaccatggccagtgagagtaaatttctattcaggaag aattacgcaaaatacgagttctttaaaaatcccatgaattttctgaactccagtagttgt cctgaaattcaagggtttttgcatgtgaaagagctgggaaagaaatcatggaaaaagctg tatgtgtgtttgcggagatctggcctttattgctccaccaagggaacttcaaaggaaccc agacacctgcagctgctggccgacctggaggacagcaacatcttctccctgatcgctggc aggaagcagtacaacgcccctacagaccacgggctctgcataaagtatggaatgctcctt taccagaattaccgaatccctcagcagaggaaggccttgctgtccccgttctcgacgcca gtgcgcagtgtctccgagaactccctcgtggcaatggatttttctgggcaaacaggacgc gtgatagagaatccggcagaggcccagagcgcagccctggaggagggccacgcctggagg tgcgaggacgacgggcagacgttcttcagcctagatgacgggaacaccaaattctctgac ctgatccagctggttgacttttaccagctgaacaaaggagtcctgccttgcaaactcaag caccactgcatccgagtggccttatga >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_6|429_aa MVEMVEVMVEVVEEMEEVEVRDRDGGGGGGDVIVEEVEVRMEEVEVTMEEVEVVVEEVEV MGEVEVMVEEVEVMVEMVEEVEVRMEMVEEVEVIEVAEEVEVMVEIVEEMVEMVEELVEV LEVMMEVDKVEQTPRSQQDPAGPGLPAQSDRLANHQEDDVDLEALVNDMNASLESLYSAC SMQSDTVPLLQNGQHARSQPRASGPPRSIQPQVSPRQRVQRSQPVHILAVRPLGCLYFLA FVKDATVNVGMPVTPRSPVSCFLGVSPQWRLQEEDQQFRTSSLPAIPNPFPELCGPGSPP VLTPGSLPPSQAAAKQVCPLERETFFQYPLAPKCGSRSDLRPPEQAALWHYQNLEGEVLG EERKNASYLSCPEGEKAMQVQILEGNEKQCCHARSRGNITDTRGSGKQHCFLLHLTLEPK CDCPFDSDH >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_6|1290_bp atggtggagatggtggaagtgatggtggaggtggtagaggagatggaggaggtggaggtg agggatagagatggtggaggaggtggaggtgatgtgatagtagaggaggtggaggtgagg atggaggaggtggaggtgacaatggaggaggtggaggtggtggtggaggaggtggaggtg atgggggaggtggaggtgatggtggaggaggtggaggtgatggtggagatggtagaggag gtggaggtgaggatggagatggtggaggaggtggaggtgatagaggtagcagaggaggtg gaggtgatggtggagatagtagaggagatggtggagatggtagaggagttggttgaagtg ctggaggtgatgatggaggtggacaaggtggagcagacacctcgcagtcaacaagacccg gcaggaccaggactccccgcacagtctgaccgacttgcgaatcaccaggaggatgatgtg gacctggaagccctggtgaacgatatgaatgcatccctggagagcctgtactcggcctgc agcatgcagtcagacacggtgcccctcctgcagaatggccagcatgcccgcagccagcct cgggcttcaggccctcctcggtccatccagccacaggtgtccccgaggcagagggtgcag cgctcccagcctgtgcacatcctcgctgtcagacccttgggatgcctctactttttggct tttgtgaaggatgccactgtgaacgtgggcatgccagtgactcctagaagccctgtgtcc tgctttctgggtgtatcaccacagtggcgccttcaggaggaagaccagcagtttagaacc tcatctctgccggccatccccaatccttttcctgaactctgtggccctgggagcccccct gtgctcacgccgggttctttacctccgagccaggccgccgcaaagcaggtctgtcctctt gagagagagacctttttccaatacccgctggctcccaagtgtgggtcccgttctgacctc aggcccccggagcaggctgcattatggcactatcaaaacttagaaggtgaggtcctggga gaggagagaaagaatgccagttacctgagttgtcctgagggggaaaaagccatgcaggtt caaatcctggaagggaatgagaagcagtgttgccatgccaggagccgcgggaacatcaca gacactcgaggaagtgggaagcaacattgctttcttcttcatttaaccttggagcctaag tgtgactgcccttttgactctgaccattga >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_7|109_aa MSDKERSGRYKMTKSNPTDEKCNTGDEKYVGRINQTLDTTLLSPPSIPSDQSPKTHCEAP SPNGWRGLERSRVGAPYEDIVIETIQNGTQREKEDGKDYDKLPPNDEAR >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_7|330_bp atgagtgataaggagagaagtggaagatataaaatgaccaaatccaaccctacagatgaa aaatgcaacacaggagatgaaaaatatgtgggcaggattaaccagacattggacaccacg cttctctcccctccctccatccccagtgaccaatctccaaagacacactgtgaggctccc tcgcccaacggctggcgtggtctggagaggtctagggtaggagcaccctatgaagacata gtaatagaaactatccagaatggcacacagagagaaaaggaagatggaaaagattacgat aaactacctccaaatgatgaggccagatga >gi568815591r:50492955_50803910|GENSCAN_predicted_peptide_8|475_aa MALRMTWTEILDKAGAPSPSEAEYCSRAAAGSAEGPPGTCVVMALGAPGHARAAPGRGGA VEDQPGPAPSAVHAERCPRAGHCGGRAPKPYPGRRRTRSDPVRPVLPPALQKFSEEDTLP AVLIGVTSFGCVPNLILNCSSHNFHMLWEGPGGRKLRCAFATECKGQGHIVSYYLEDNFG FTTPGGLAIGCLLSQCDVSCVGCPGIYDRFYPQVTFTGGKTEDTCEQAPPESLHAAFAEL IKAMVGHACHKLLLLRSSFYDLDQWVKFGCTFGILHSQTYRNQIREHVGGGSVSDNAKIE NIVMTCFGADHNAEQEAAAAGPVTGSSVTSSPVTGSQVLTWVLSVKPGWPPQQTVLQSQL LTGGLFSAEKLGSHGAALLHACMPFLSALEQSPQQKPGLADIGSRFGSWNDPCRKSLNVR TPNSQAAEPYEGQRQELTDRGLGQQSKKVVAGGAPVYDVPLPVSTCSHCSTAAYE >gi568815591r:50492955_50803910|GENSCAN_predicted_CDS_8|1428_bp atggccctgagaatgacgtggacagaaatcctggacaaggctggagcgcccagtccctcg gaggctgagtattgcagccgggcggcagccggctccgcggaggggcccccgggcacctgc gtggtgatggcgctgggagcccccgggcacgctcgggcggcccctggccggggcggagcc gtggaggaccagcccggcccggctccgagcgctgtccatgcggagcgctgtccacgcgcc gggcactgcgggggccgggccccgaagccctacccgggccggcggcgcacacgcagcgac cccgtgcggccagtgctgccgcccgctctccagaaattctctgaagaggacactttacct gctgtcctaataggtgttacatcgtttggctgtgtcccaaatctcatcttgaattgtagc tcccataatttccacatgctatgggagggacctggtgggagaaaactccgctgtgcattc gcgacggaatgcaagggacaaggacacatcgtatcttactatcttgaagataattttggc tttaccactcctggaggtcttgccatagggtgtttgttgagccagtgtgatgtgtcctgt gttggctgcccaggtatatacgaccgtttctatcctcaggtcacctttactggggggaag actgaagacacttgtgaacaagctcctccagaatccttgcatgcagcttttgcagaactg ataaaagcaatggtcggtcatgcttgccacaagctgcttctgttgagatcatcattctac gatcttgaccaatgggtgaagtttggttgcacttttggcatcttgcattcacagacctac agaaaccagattagggagcatgttggaggtggctctgtaagtgacaatgctaagattgag aacatagtgatgacatgctttggcgctgaccacaatgctgagcaggaagcagcagctgca ggcccagtgactggtagctcagtgaccagcagcccagtgaccggcagccaggtcctcacc tgggtcctctcagtgaagccagggtggccgccccagcagacagtgctacagagccaactc ctgacaggcggcctgttttcagcagagaagctggggtcccatggagctgccctgctccat gcttgcatgccatttctctcagcgttggagcagagccctcaacagaaaccagggcttgca gatattggaagccgctttggatcttggaatgacccttgcaggaagtccttgaacgtcaga acacctaattctcaagctgcagagccctacgaggggcagagacaagagctaacagaccgg ggcttgggccagcagagcaagaaggtggtagcaggtggggccccagtgtatgatgtgccc ctccctgtgtccacgtgttctcattgttcaactgccgcttatgagtga