GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:00:48 Sequence gi568815588f:69080292_69309032 : 228741 bp : 43.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7870 7945 76 0 1 80 116 130 0.801 14.25 1.02 Intr + 16793 16940 148 0 1 53 94 60 0.302 2.39 1.03 Intr + 23580 23753 174 0 0 72 -12 162 0.024 3.65 1.04 Intr + 30436 30580 145 1 1 -3 74 114 0.013 1.18 1.05 Intr + 32150 32257 108 1 0 75 32 90 0.415 2.58 1.06 Intr + 34373 34502 130 1 1 74 39 84 0.593 2.37 1.07 Intr + 34541 34678 138 0 0 -18 18 220 0.474 4.84 1.08 Term + 35063 35268 206 0 2 31 39 169 0.555 3.83 1.09 PlyA + 35391 35396 6 1.05 2.00 Prom + 45702 45741 40 -1.96 2.01 Init + 52679 52756 78 1 0 69 94 71 0.297 6.86 2.02 Intr + 75521 75596 76 1 1 73 97 5 0.317 -1.01 2.03 Intr + 76716 76872 157 1 1 54 64 177 0.996 10.97 2.04 Intr + 77756 77920 165 2 0 117 106 105 0.893 14.28 2.05 Intr + 82115 82221 107 2 2 84 111 16 0.544 3.36 2.06 Intr + 85751 85819 69 0 0 55 87 84 0.528 4.15 2.07 Intr + 88198 88340 143 2 2 78 80 34 0.195 1.67 2.08 Term + 90865 90978 114 0 0 86 43 22 0.166 -3.93 2.09 PlyA + 91411 91416 6 1.05 3.00 Prom + 97308 97347 40 -3.96 3.01 Init + 100001 100271 271 1 1 57 70 218 0.221 12.03 3.02 Intr + 105696 105773 78 1 0 77 91 75 0.261 6.22 3.03 Intr + 118813 118906 94 2 1 90 78 78 0.954 6.02 3.04 Intr + 119989 120208 220 1 1 76 90 40 0.723 1.30 3.05 Intr + 122148 122228 81 2 0 61 87 63 0.906 3.33 3.06 Intr + 122576 122752 177 1 0 84 115 24 0.855 4.82 3.07 Intr + 127502 127650 149 1 2 109 76 119 0.997 11.83 3.08 Term + 128309 128744 436 2 1 87 29 242 0.999 12.96 3.09 PlyA + 128784 128789 6 1.05 4.00 Prom + 136946 136985 40 -6.66 4.01 Init + 140145 140207 63 2 0 52 116 113 0.934 11.65 4.02 Intr + 146916 147078 163 2 1 99 117 222 0.994 25.75 4.03 Intr + 152473 152621 149 2 2 60 67 208 0.969 15.85 4.04 Intr + 152723 152842 120 1 0 105 94 3 0.812 3.29 4.05 Intr + 158751 158846 96 2 0 52 100 25 0.263 0.31 4.06 Intr + 160361 160460 100 1 1 98 18 274 0.817 21.08 4.07 Intr + 162891 163074 184 1 1 80 74 301 0.856 26.75 4.08 Intr + 165788 165943 156 2 0 57 74 158 0.976 10.43 4.09 Intr + 167069 167302 234 2 0 67 70 344 0.983 27.30 4.10 Intr + 168133 168437 305 1 2 78 80 567 0.990 51.13 4.11 Intr + 169999 170144 146 2 2 88 107 235 0.913 25.40 4.12 Intr + 170242 170361 120 0 0 108 116 107 0.997 16.19 4.13 Intr + 176745 176840 96 2 0 115 24 98 0.987 6.31 4.14 Intr + 177036 177135 100 2 1 111 96 152 0.999 17.98 4.15 Intr + 178485 178668 184 1 1 61 57 251 0.966 18.15 4.16 Intr + 178772 178836 65 2 2 44 103 84 0.976 3.86 4.17 Intr + 180814 181003 190 2 1 109 55 286 0.890 26.04 4.18 Intr + 185294 185527 234 2 0 98 70 239 0.895 19.90 4.19 Intr + 186319 186438 120 1 0 98 16 107 0.020 4.11 4.20 Intr + 198109 198232 124 1 1 101 71 31 0.052 3.29 4.21 Intr + 208380 208479 100 2 1 100 111 46 0.447 7.78 4.22 Term + 215026 215132 107 2 2 83 42 52 0.072 -1.33 4.23 PlyA + 216170 216175 6 1.05 5.02 PlyA - 217264 217259 6 1.05 5.01 Term - 220525 220218 308 2 2 70 48 201 0.675 9.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 23580 23829 250 0 1 72 42 170 0.959 5.98 S.002 Init + 30515 30580 66 1 0 63 74 67 0.810 3.87 S.003 Term + 186319 186466 148 1 1 98 49 141 0.877 8.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:69080292_69309032|GENSCAN_predicted_peptide_1|374_aa MQKLLKCSRLVLALALILVLESSVQGYPTRRARYQWVRCNPDSNSANCLEEKGPMFELLP GESNKIPRLRTDLFPKTRIQDLNRIFPLSEDYSGSGFGSGSGSGSGSGSGFLTEMEQDYQ LVDESDAFHDNLRKIGRLGTKQLRVKACGWTNERRQSVKMYVSNVNAHQKTSITEEALNN QERLADGCKFLDAESQFIQGHAPFSWTAHSQRLIYVGTTEIMLRQKQIQAIFLFEFKTGL KAAETTRNINNPFGAGIANKHESLEDEKRSGRPWEFDNDQLRAIIEADPLTTTQEVAEKL NIDHSTAGIGPQKGPVLLHKNAQVQVVEPMLQKLNELGYEVLPHPPYLPDFLPTDYHSFK HLGNFLQGKRFHNQ >gi568815588f:69080292_69309032|GENSCAN_predicted_CDS_1|1125_bp atgcagaagctactcaaatgcagtcggcttgtcctggctcttgccctcatcctggttctg gaatcctcagttcaaggttatcctacgcggagagccaggtaccaatgggtgcgctgcaat ccagacagtaattctgcaaactgccttgaagaaaaaggaccaatgttcgaactacttcca ggtgaatccaacaagatcccccgtctgaggactgacctttttccaaagacgagaatccag gacttgaatcgtatcttcccactttctgaggactactctggatcaggcttcggctccggc tccggctctggatcaggatctgggagtggcttcctaacggaaatggaacaggattaccaa ctagtagacgaaagtgatgctttccatgacaaccttagaaagattggaagattgggcacg aagcagctcagggtaaaggcatgtggatggaccaatgagagaagacaaagtgtgaagatg tacgtatcaaatgtgaatgcccaccagaaaacatccatcacagaagaggcactaaacaac caggaaaggttggctgacggctgcaagttcttagatgcagagagccagttcatccaaggt catgctcccttctcatggacagctcacagtcaaagactcatctatgtggggactacggaa ataatgttaagacaaaagcaaattcaagctattttcttattcgagttcaaaacaggtctt aaagcagcagagacaactcgcaacatcaacaacccatttggcgcaggaattgctaacaaa catgagagccttgaagatgagaagcgtagtggccggccatgggaatttgacaatgaccaa ttgagagcaatcattgaagctgatcctcttacaactacacaagaagttgctgaaaaactc aacatcgaccattctacagccggcataggtccacagaaaggcccagttctgctccataag aacgcccaagtgcaggtcgtagaaccaatgcttcaaaagttgaatgaattgggctacgaa gttttgcctcatccgccatatttacctgacttcttgccaactgactaccactccttcaag catctcggcaactttttgcagggaaaacgcttccacaaccagtag >gi568815588f:69080292_69309032|GENSCAN_predicted_peptide_2|302_aa MAEMKTEDGKVEKHYLFYDGESVSGKVNLAFKQPGKRLEHQGIRIEFVGQIELFNDKSNT HEFVNLVKELALPGELTQSRSYDFEFMQVEKPYESYIGANVRLRYFLKVTIVRRLTDLVK EYDLIVHQLATYPDVNNSIKMEVGIEDCLHIEFEYNKSKYHLKDVIVGKIYFLLVRIKIQ HMELQLIKKEITGIGPSTTTETETIAKYEIMDGAPVKGESIPIRLFLAGYDPTPTMRDVN KKFSVRYFLNLVLVDEEDRRYFKQQEIILWRKAPEKLRKQRTNFHQRFESPESQASAEQP EM >gi568815588f:69080292_69309032|GENSCAN_predicted_CDS_2|909_bp atggcagaaatgaaaactgaagatggcaaagtagaaaaacactatctcttctatgacgga gaatccgtttcaggaaaggtaaacctagcctttaagcaacctggaaagaggctagaacac caaggaattagaattgaatttgtaggtcaaattgaacttttcaatgacaagagtaatact catgaatttgtaaacctagtgaaagaactagccttacctggagaactgactcagagcaga agttatgattttgaatttatgcaagttgaaaagccatatgaatcttacatcggtgccaat gtccgcttgaggtattttcttaaagtgacaatagtgagaagactgacagatttggtaaaa gagtatgatcttattgttcaccagcttgccacctatcctgatgttaacaactctattaag atggaagtgggcattgaagattgtctacatatagaatttgaatataataaatcaaagtat catttaaaggatgtgattgttggaaaaatttacttcttattagtaagaataaaaatacaa catatggagttacagctgatcaaaaaagagatcacaggaattggacccagtaccacaaca gaaacagaaacaatcgccaaatatgaaataatggatggtgcaccagtaaaaggtgaatca attccaataaggctatttttagcaggatatgacccaactccaacaatgagagatgtgaac aaaaaattttcagtaaggtactttttgaatttagtgcttgttgatgaggaagaccggagg tacttcaaacagcaggagataattttatggagaaaagctcctgaaaaactgaggaaacag agaacaaactttcaccagcgatttgaatctccagaatcacaggcatctgccgaacagcct gaaatgtga >gi568815588f:69080292_69309032|GENSCAN_predicted_peptide_3|501_aa MSFSRALLWARLPAGRQAGHRAAICSALRPHFGPFPGVLGQVSVLATASSSASGGSKIPN TSLFVPLTVKPQGPSADGDVGAELTRPLDKNEVKKVLDKFYKRKEIQKLGADYGLDGTKL AQAKKFNDPNDPCKILVATDAIGMGLNLSIRRIIFYSLIKPSINEKGERELEPITTSQAL QIAGRAGRFSSRFKEGEVTTMNHEDLSLLKEILKRPVDPIRAAGLHPTAEQIEMFAYHLP DATLSNLIDIFVDFSQVDGQYFVCNMDDFKFSAELIQHIPLSLRVRYVFCTAPINKKQPF VCSSLLQFARQYSRNEPLTFAWLRRYIKWPLLPPKNIKDLMDLEAVHDVLDLYLWLSYRF MDMFPDASLIRDLQKELDGIIQDGVHNITKLIKMSETHKLLNLEGFPSGSQSRLSGTLKS QARRTRGTKALGSKATEPPSPDAGELSLASRLVQQGLLTPDMLKQLEKEWMTQQTEHNKE KTESGTHPKGTRRKKKEPDSD >gi568815588f:69080292_69309032|GENSCAN_predicted_CDS_3|1506_bp atgtccttctcccgtgccctattgtgggctcggctcccggcggggcgccaggctggccac cgggcagccatctgctctgcccttcgtccccactttgggccctttcccggggttctgggg caagtttctgtccttgccaccgcctcctcctctgcctccggtggctccaaaataccaaac acgtccttgttcgtgcccctgactgtgaaacctcagggccccagcgccgacggcgacgtc ggggccgagctaacccggcctctggacaagaatgaagtaaagaaggtcttagacaaattt tacaagaggaaagaaattcagaaactgggtgctgattatggacttgatgggaccaaactt gctcaagcaaaaaagtttaatgatcccaatgacccatgcaaaatcttggttgctacagat gcaattggcatgggacttaatttgagcataaggagaattattttttactcccttataaag cccagtatcaatgaaaagggagagagagaactagaaccaatcacaacctctcaagccctg cagattgctggcagagctggcagattcagctcacggtttaaagaaggagaggttacaaca atgaatcatgaagatctcagtttattaaaggaaattttgaagaggcctgtggatcctata agggcagctggtcttcatccaactgctgagcagattgaaatgtttgcctaccatctccct gatgcaacactgtccaatctcattgatatttttgtagacttttcacaagttgatgggcag tattttgtctgcaatatggatgattttaaattttctgcagagttgatccagcatattcca ctaagtctgcgagtgaggtatgttttctgcacagctcctatcaacaagaagcagcctttt gtgtgttcttcactgttacagtttgccaggcagtatagcaggaatgagcccctgaccttt gcatggttacgccgatacatcaaatggcctttacttccacctaagaatattaaagacctc atggatcttgaagctgtccacgatgtcttggatctttacttgtggctaagctaccgattt atggatatgtttccagatgccagccttattcgagatctccagaaagaactagatggtatt atccaagatggtgtgcacaatatcactaaattgattaaaatgtctgagacgcataagctg ttgaatttggagggctttccatcagggagccagtcacgattgtcaggaaccttaaagagc caagctagaaggacacgcggcaccaaagctctagggagtaaagctactgagccacccagc cccgatgcaggagagctgtcccttgcttccagattggtgcagcaaggactcctcactcca gacatgctgaaacagctagaaaaagagtggatgacacaacaaactgaacacaacaaagaa aaaacagagtctgggactcatccaaaagggacgagaagaaagaagaaggaacctgattcg gactag >gi568815588f:69080292_69309032|GENSCAN_predicted_peptide_4|1051_aa MFAVHLMAFYFSKLKEDQIKKVDRFLYHMRLSDDTLLDIMRRFRAEMEKGLAKDTNPTAA VKMLPTFVRAIPDGSENGEFLSLDLGGSKFRVLKVQVAEEGKRHVQMESQFYPTPNEIIR GNGTELFEYVADCLADFMKTKDLKHKKLPLGLTFSFPCRQTKLEEGVLLSWTKKFKARGV QDTDVVSRLTKAMRRHKDMDVDILALVNDTVGTMMTCAYDDPYCEVGVIIGTGTNACYME DMSNIDLVEGDEGRMCINTEWGAFGDDGALEDIRTEFDRELDLGSLNPGKQLFEKMISGL YLGELVRLILLKMAKAGLLFGGEKSSALHTKGKIETRHVAAMEKYKEGLANTREILVDLG LEPSEADCIAVQHVCTIVSFRSANLCAAALAAILTRLRENKKVERLRTTVGMDGTLYKIH PQYPKRLHKVVRKLVPSCDVRFLLSESGSTKGAAMVTAVASRVQAQRKQIDRVLALFQLT REQLVDVQAKMRAELEYGLKKKSHGLATVRMLPTYVCGLPDGTEKGKFLALDLGGTNFRV LLVKIRSGRRSVRMYNKIFAIPLEIMQGTGEELFDHIVQCIADFLDYMGLKGASLPLGFT FSFPCRQMSIDKGTLIGWTKGFKATDCEGEDVVDMLREAIKRRNEFDLDIVAVVNDTVGT MMTCGYEDPNCEIGLIAGTGSNMCYMEDMRNIEMVEGGEGKMCINTEWGGFGDNGCIDDI WTRYDTEVDEGSLNPGKQSCRMSVKYPIGTKEKYDDKFATGLPQPYPSSPNRYEKMTSGM YLGEIVRQILIDLTKQGLLFRGQISERLRTRGIFETKFLSQIESDRLALLQVRRILQQLG LDSTCEDSIVVKEVCGAVSRRAAQLCGAGLAAIVEKRREDQGLEHLRITVGVDGTLYKLH PHFSRILQETVKELAPRCDVTFMLSEDGSGKGAALITAVAKSCYMESTMDQIGNPSLLSK AAKAMWSGSGPQIPVLISESAFQAFKTQLLRVEKQKKGPEVSKCPPHNGADLPARIVDCG LLEGRTTVVFHLGLSSFQHSAWYICKNLVFV >gi568815588f:69080292_69309032|GENSCAN_predicted_CDS_4|3156_bp atgtttgcggtccacttgatggcattttacttcagcaagctgaaggaggaccagatcaag aaggtggacaggttcctgtatcacatgcggctctccgatgacacccttttggacatcatg aggcggttccgggctgagatggagaagggcctggcaaaggacaccaaccccacggctgca gtgaagatgttgcccaccttcgtcagggccattcccgatggttccgaaaatggggagttc ctttccctggatctcggagggtccaagttccgagtgctgaaggtgcaagtcgctgaagag gggaagcgacacgtgcagatggagagtcagttctacccaacgcccaatgaaatcatccgc gggaacggcacagagctgtttgaatatgtagctgactgtctggcagatttcatgaagacc aaagatttaaagcataagaaattgccccttggcctaactttttctttcccctgtcgacag actaaactggaagagggtgtcctactttcgtggacaaaaaagtttaaggcacgaggagtt caggacacggatgtggtgagccgtctgaccaaagccatgagaagacacaaggacatggac gtggacatcctggccctggtcaatgacaccgtggggaccatgatgacctgtgcctatgac gacccctactgcgaagttggtgtcatcatcggaactggcaccaatgcgtgttacatggag gacatgagcaacattgacctggtggagggcgacgagggcaggatgtgcatcaacacagag tggggggccttcggggacgacggggccctggaggacattcgcactgagttcgacagggag ctggacctcggctctctcaacccaggaaagcaactgttcgagaagatgatcagtggcctg tacctgggggagcttgtcaggcttatcttgctgaagatggccaaggctggcctcctgttt ggtggtgagaaatcttctgctctccacactaagggcaagatcgaaacacggcacgtggct gccatggagaagtataaagaaggccttgctaatacaagagagatcctggtggacctgggt ctggaaccgtctgaggctgactgcattgccgtccagcatgtctgtaccatcgtctccttc cgctcggccaatctctgtgcagcagctctggcggccatcctgacacgcctccgggagaac aagaaggtggaacggctccggaccacagtgggcatggacggcaccctctacaagatacac cctcagtacccaaaacgcctgcacaaggtggtgaggaaactggtcccaagctgtgatgtc cgcttcctcctgtcagagagtggcagcaccaagggggccgccatggtgaccgcggtggcc tcccgcgtgcaggcccagcggaagcagatcgacagggtgctggctttgttccagctgacc cgagagcagctcgtggacgtgcaggccaagatgcgggctgagctggagtatgggctgaag aagaagagccacgggctggccacggtcaggatgctgcccacctacgtctgcgggctgccg gacggcacagagaaaggaaagtttctcgccctggatcttgggggaaccaacttccgggtc ctcctggtgaagatcagaagtggacggaggtcagtgcgaatgtacaacaagatcttcgcc atccccctggagatcatgcagggcactggtgaggagctctttgatcacattgtgcagtgc atcgccgacttcctggactacatgggcctcaagggagcctccctacctttgggcttcaca ttctcatttccctgcaggcagatgagcattgacaagggaacactcatagggtggaccaaa ggtttcaaggccactgactgtgaaggggaggacgtggtggacatgctcagggaagccatc aagaggagaaacgagtttgacctggacattgttgcagtcgtgaatgatacagtggggacc atgatgacctgtggctatgaagatcctaattgtgagattggcctgattgcaggaacaggc agcaacatgtgctacatggaggacatgaggaacatcgagatggtggaggggggtgaaggg aagatgtgcatcaatacagagtggggaggatttggagacaatggctgcatagatgacatc tggacccgatacgacacggaggtggatgaggggtccttgaatcctggcaagcagagctgt cgaatgtcagtgaaatatcctattggaacaaaagaaaaatatgatgacaagtttgcaaca ggtctgccccaaccttatccttcttctccaaacagatacgagaaaatgaccagtgggatg tacttgggggagattgtgcggcagatcctgatcgacctgaccaagcagggtctcctcttc cgagggcagatttcagagcgtctccggaccaggggcatcttcgaaaccaagttcctgtcc cagatcgaaagcgatcggctggcccttctccaggtcaggaggattctgcagcagctgggc ctggacagcacgtgtgaggacagcatcgtggtgaaggaggtgtgcggagccgtgtcccgg cgggcggcccagctctgcggtgctggcctggccgctatagtggaaaaaaggagagaagac caggggctagagcacctgaggatcactgtgggtgtggacggcaccctgtacaagctgcac cctcacttttctagaatattgcaggaaactgtgaaggaactagcccctcgatgtgatgtg acattcatgctgtcagaagatggcagtggaaaaggggcagcactgatcactgctgtggcc aagagctgctacatggaaagtactatggatcaaattggaaatccatctctcctctcaaag gcagctaaagctatgtggtcaggctctgggccccagattccagtcctgatttctgagtct gctttccaggcgttcaagacccagctgttgagagtagaaaagcagaagaaaggacccgag gtcagcaagtgccctccccacaatggggcagatctgccagcgagaatcgtggattgtggg ctccttgagggccgaaccacggttgtctttcatcttggactctccagcttccaacacagt gcctggtacatttgtaagaacctagtgtttgtgtga >gi568815588f:69080292_69309032|GENSCAN_predicted_peptide_5|102_aa XVHINVLADALKSINNAEKRDKHQVLRPCSKVIVWFLTVMMKHGYIGEFEIMDDHRARQI VVNLTGSLNKCGMISPRFDVQLKDLEKCRIICFHLASLISLY >gi568815588f:69080292_69309032|GENSCAN_predicted_CDS_5|309_bp ntggtgcacattaatgtcctggctgatgctctcaagagcatcaacaatgctgaaaagaga gacaaacaccaggttcttaggccatgctccaaagtcatcgtctggtttctcactgtgatg atgaagcatggttatattggcgaatttgaaatcatggatgatcacagagctaggcaaatt gttgtgaacctcacaggcagtctaaacaagtgtggcatgatcagccccagatttgatgtg caactcaaagatctagagaaatgcagaataatctgcttccatctggccagtttgatttca ttgtactga