GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:04:25 Sequence gi568815596r:112675070_112883770 : 208701 bp : 43.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8939 9071 133 2 1 94 81 57 0.643 6.25 1.02 Term + 13803 14006 204 2 0 20 48 183 0.954 4.87 1.03 PlyA + 14848 14853 6 1.05 2.04 PlyA - 16855 16850 6 1.05 2.03 Term - 18652 18560 93 1 0 98 43 73 0.946 1.63 2.02 Intr - 18824 18729 96 2 0 44 85 96 0.453 5.11 2.01 Init - 20377 20345 33 1 0 48 93 79 0.889 2.29 2.00 Prom - 24629 24590 40 -4.66 3.03 PlyA - 26274 26269 6 1.05 3.02 Term - 33013 32902 112 0 1 89 53 69 0.822 1.53 3.01 Init - 36347 36244 104 2 2 65 91 88 0.876 6.51 3.00 Prom - 41762 41723 40 -2.46 4.00 Prom + 42057 42096 40 -6.46 4.01 Init + 42831 42922 92 2 2 84 119 25 0.869 5.17 4.02 Intr + 46749 46822 74 0 2 137 40 139 0.994 13.05 4.03 Intr + 46917 47034 118 1 1 68 89 282 0.905 25.82 4.04 Intr + 47114 47209 96 2 0 83 55 227 0.841 18.02 4.05 Intr + 47429 47520 92 2 2 83 68 101 0.998 7.24 4.06 Intr + 47645 47702 58 0 1 111 40 99 0.779 5.34 4.07 Intr + 48012 48105 94 0 1 86 109 163 0.998 18.17 4.08 Intr + 48349 48399 51 0 0 82 76 86 0.981 5.90 4.09 Intr + 48650 48733 84 1 0 61 82 95 0.977 6.22 4.10 Intr + 49025 49057 33 1 0 86 105 48 0.893 4.72 4.11 Intr + 49712 49837 126 1 0 44 78 67 0.658 2.18 4.12 Intr + 50105 50171 67 1 1 94 66 51 0.915 1.98 4.13 Intr + 50313 50483 171 1 0 39 84 230 0.995 17.61 4.14 Intr + 51169 51220 52 2 1 82 44 60 0.884 -1.03 4.15 Intr + 51609 51669 61 0 1 70 75 72 0.813 2.84 4.16 Term + 52730 52942 213 1 0 45 47 334 0.837 22.23 4.17 PlyA + 55718 55723 6 1.05 5.15 PlyA - 56337 56332 6 1.05 5.14 Term - 63979 63754 226 0 1 102 28 120 0.879 3.65 5.13 Intr - 65938 65749 190 2 1 53 87 98 0.885 4.84 5.12 Intr - 67700 67637 64 2 1 97 74 30 0.984 0.79 5.11 Intr - 71506 71351 156 1 0 84 79 108 0.987 9.71 5.10 Intr - 77405 77198 208 1 1 67 98 119 0.982 9.88 5.09 Intr - 82145 80908 1238 2 2 103 75 433 0.967 30.80 5.08 Intr - 85695 85644 52 2 1 95 84 27 0.841 1.91 5.07 Intr - 89613 89493 121 1 1 65 109 136 0.837 12.95 5.06 Intr - 100198 100168 31 1 1 94 72 33 0.335 0.00 5.05 Intr - 103042 102918 125 2 2 75 86 93 0.998 8.10 5.04 Intr - 104597 104427 171 0 0 59 88 213 0.989 18.31 5.03 Intr - 106757 106535 223 2 1 123 116 286 0.999 32.60 5.02 Intr - 107758 107647 112 0 1 25 115 70 0.750 3.78 5.01 Init - 108701 108655 47 2 2 101 110 56 0.966 9.25 5.00 Prom - 117509 117470 40 -4.06 6.00 Prom + 121845 121884 40 1.34 6.01 Sngl + 123166 123453 288 0 0 87 34 159 0.760 6.05 6.02 PlyA + 123834 123839 6 1.05 7.04 PlyA - 123869 123864 6 1.05 7.03 Term - 126237 126112 126 0 0 -12 42 157 0.125 -0.62 7.02 Intr - 128102 127955 148 1 1 34 15 109 0.043 -1.56 7.01 Init - 139501 139356 146 1 2 55 81 108 0.654 6.39 7.00 Prom - 142568 142529 40 -4.56 8.08 PlyA - 142832 142827 6 1.05 8.07 Term - 155504 155292 213 2 0 106 43 206 0.996 15.13 8.06 Intr - 156353 156223 131 0 2 105 107 166 0.999 20.61 8.05 Intr - 157757 157593 165 0 0 61 94 173 0.999 15.13 8.04 Intr - 158506 158305 202 1 1 125 63 298 0.966 29.86 8.03 Intr - 160548 160497 52 2 1 83 116 67 0.999 7.91 8.02 Intr - 161175 161114 62 0 2 93 82 88 0.220 6.23 8.01 Init - 167497 167417 81 1 0 80 78 30 0.070 2.17 8.00 Prom - 191116 191077 40 -3.46 9.00 Prom + 191714 191753 40 -2.76 9.01 Init + 202716 202804 89 2 2 78 73 98 0.435 7.42 9.02 Term + 206689 206707 19 1 1 123 44 -3 0.129 -3.41 9.03 PlyA + 207432 207437 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_1|112_aa XSFPPGLPDDPGDRLPGPIRRPFSYQPQAAMRLLVPPNVQLPDSKWGALAGGIVWRWKVD LPTAIVGVPSLSLLFMMLLICLPVGMNIKFRKQNEASGQPTPLCPPDEDALS >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_1|339_bp nnctccttcccgccaggtttaccagatgaccctggagataggcttccaggtcctatcagg cgccctttctcctatcagcctcaggcagctatgcgcctgctggtcccacccaatgtgcaa ctccctgactcgaagtggggggcactcgccggaggcatcgtctggaggtggaaagtggat ctgccgacagccatcgtgggtgttccgtccttgagtctgcttttcatgatgttactcatc tgtttgcctgtggggatgaacatcaaattccgaaaacaaaatgaagccagtggccagccc acaccgctgtgccctcctgatgaagatgccctgagctag >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_2|73_aa MTAPLHSAWATQLQMYSNGHLCACSAFLPRFHEDMAGGFGDREPHFRVIPSLAHFDPFSR QQERPDVDERIGV >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_2|222_bp atgacggctccgctgcactcggcctgggccacgcaactacagatgtacagcaacgggcac ctctgcgcctgctccgctttcctgcctcgtttccatgaagacatggctgggggctttggg gacagggagcctcacttccgggtcattccttccctcgcccactttgacccattctccaga cagcaggagagacctgatgtggatgagagaattggtgtttag >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_3|71_aa MRDVASASMAAKDRAPGPLERDRKRGKNGAPGTTSAVITAQLLYQRTADPDQTEDTASKE PDPQSRLVLCD >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_3|216_bp atgagagatgttgcatcggcatctatggcagctaaagatcgagcaccaggacccttggag agggacagaaagagaggcaagaacggagccccgggaacaaccagtgctgtgattacagct cagcttctttatcaacggacagctgatccagatcaaacagaggacactgcgagcaaggaa ccagatcctcagagccgtctggtgctttgtgactga >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_4|493_aa MGRGGFAYRLRIRMLAPSCLGSVPLCHLSAWIFVNRSLALGKIRCFGFDMDYTLAAYKSP AYEALTFELLLERLVCIGYPHEILRYTYDPTFPTRRLVFDELYGNLLKVDAHGNVLLGAY GFTFLSEAEIWSFYPSKFIQRDDLQCFYILNMLFNLPETYLYACLVDFFSGCSRYTNCDT GYQHGNLFMSFRSLFQDVTDAMNNIHQSGCLKKTLEDLEKYVKKDPRLPILLGKMKEVGK VFLATNSSYNYTNAIMTYLFSISEAEASGRPWRSYFDLIVVDTQKPHFFAEGLVLRQVNT VMAGAEDSGKLHVGTYTGPHQHCAVYSGGSSDMVCELLGVRGMDILYIGDHIFGDILKSK KRQGWRTCLVVPELSWELDIWAQEKERLEELKRLDTHLADIYQHMDGSSCELQVINFTKR EIQRVTQELDLCYSTMGSLFRCGFRQTLFSSQLMRYADLYTAACLDLLYFRVSSALSGGP GVGGSLRNLHGCL >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_4|1482_bp atgggcagaggcggttttgcctaccgattaagaatacggatgctggcacccagctgcctg ggctcagtcccactctgccatttatcagcctggatttttgtcaaccgcagcctggcgctg gggaagattcgttgctttggcttcgacatggactacactctggctgcctacaagtcccca gcttatgaggccctgaccttcgagctgctgctggagcgcctggtgtgcattgggtacccg catgagatcctgcgctacacctacgaccccaccttccccaccaggcggctggtgttcgat gaactctatgggaacctgctgaaggtggacgcccacgggaatgtgctgctgggtgcctat ggcttcaccttcctctcggaggcagagatctggagcttctaccccagcaagttcattcag agggacgacctgcagtgtttctacatactcaacatgctcttcaacctgcctgaaacctac ctctatgcctgcttggtggacttcttctctggctgctcccgttacactaattgtgacacc ggctatcagcatgggaacctcttcatgtccttccgaagcctcttccaggatgtgactgat gccatgaataacatccaccagtcgggctgtctcaagaagaccctggaggacttggagaaa tatgtgaagaaggatccacgcctccccatcctgctggggaagatgaaggaggttgggaaa gtgtttctggccaccaacagcagctacaactacaccaatgccatcatgacctacctgttc agcatcagtgaggctgaagcctcgggcaggccctggaggtcctactttgacctgatcgtg gtggacacgcagaagccccacttctttgcagaggggttggtcctgaggcaggtcaacacg gtaatggcaggtgcagaggactcaggaaagctccacgtgggcacctacacagggccccac cagcactgtgctgtctactctggaggctcttcggacatggtgtgcgagctgcttggggtt cgggggatggacatcctgtacattggggaccacatttttggggacattctcaagtccaag aagcgtcagggctggcggacttgcctggtggttcctgagctgtcctgggagctggacatc tgggcccaggagaaggagcggttggaggagctgaagagactggacacgcacctggcagac atataccagcacatggatgggagcagttgtgagctgcaagtcatcaacttcaccaagaga gagatccagagggtcacccaggagctggacctgtgctacagcaccatgggcagcttgttc cgctgcggtttccgccagacactcttctccagccagctgatgcgctatgccgacctctac actgccgcctgcctcgacctcctgtacttccgagtgagctcagctctctcgggcggcccc ggagttggtgggtccctgcggaatctccacggctgtttgtga >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_5|987_aa MAKVPDMFEDLKNCYSTLARDHTEITTITFSSSYFDSENEEDSSSIDHLSLNQKSFYHVS YGPLHEGCMDQSVSLSISETSKTSKLTFKESMVVVATNGKVLKKRRLSLSQSITDDDLEA IANDSEEEIIKPRSAPFSFLSNVKYNFMRIIKYEFILNDALNQSIIRANDQYLTAAALHN LDEAVKFDMGAYKSSKDDAKITVILRISKTQLYVTAQDEDQPVLLKEMPEIPKTITGANE DAWTRGTPVLATGAAFEEKLSLKSHGGARAYRCCRCRPYLKSKNNCQNQPPSKSTIRPKN DVTNHVVLPVKPKRSISIKLQPRPPNTAGSQKPKLEPPKLLGKRLTSECVSSNPYSKPSS KSFQQCEAGSSTTGELSRKPVGSLNIEQLKTTKQQLTDQGNGKCIDFMNNIHVENESLDN FLKETNKENLLDILTEPERKPDPKLYTRSKPKTDSYNQTKNSLVPKQALGKSSVNSAVLK DRVNKQFVGETQSRTFPVKSQQLSRGADLARPGVKPSRTVPSHFIRTLSKVQSSKKPVVK NIKDIKVNRSQYERPNETKIRSYPVTEQRVKHTKPRTYPSLLQGEYNNRHPNIKQDQKSS QVCIPQTSCVLQKSKAISQRPNLTVGRFNSAIPSTPSIRPNGTSGNKHNNNGFQQKAQTL DSKLKKAVPQNHFLNKTAPKTQADVTTVNGTQTNPNIKKKATAEDRRKQLEEWQKSKGKT YKRPPMELKTKRKVIKEMNISFWKSIEKEEEEKKAQLELSSKINNTLTECLNLIEGGVPS NEILNILSSIPEAEKFAKFWICKAKLLASKGTFDVIGLYEEAIKNGATPIQELRKVVLNI LQDSNRTTEGITSDSLVAETSITSVEELAKKMESVKSCLSPKEREQVTATPRIAKAEQHN YPGIKLQIGPIPRINGMPEVQDMKFITPVRRSSRIERAVSRYPEMLQEHDLVVASLDELL EVEETKCFIFRRNEALPVTLGFQTPES >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_5|2964_bp atggccaaagttccagacatgtttgaagacctgaagaactgttacagtacattggccaga gaccacactgaaataacaacaattacattctcatcatcttattttgacagtgaaaatgaa gaagacagttcctccattgatcatctgtctctgaatcagaaatccttctatcatgtaagc tatggcccactccatgaaggctgcatggatcaatctgtgtctctgagtatctctgaaacc tctaaaacatccaagcttaccttcaaggagagcatggtggtagtagcaaccaacgggaag gttctgaagaagagacggttgagtttaagccaatccatcactgatgatgacctggaggcc atcgccaatgactcagaggaagaaatcatcaagcctaggtcagcaccttttagcttcctg agcaatgtgaaatacaactttatgaggatcatcaaatacgaattcatcctgaatgacgcc ctcaatcaaagtataattcgagccaatgatcagtacctcacggctgctgcattacataat ctggatgaagcagtgaaatttgacatgggtgcttataagtcatcaaaggatgatgctaaa attaccgtgattctaagaatctcaaaaactcaattgtatgtgactgcccaagatgaagac caaccagtgctgctgaaggagatgcctgagatacccaaaaccatcacaggagccaatgag gacgcgtggacgcgcggcacgccggtcctggctacaggcgcggcgtttgaagaaaaactg tcactgaagagtcatggtggggcccgggcctaccgctgctgccgctgtcggccttatcta aaatccaagaataattgccagaatcaaccaccttctaaatctactattagacccaaaaat gatgttaccaaccatgttgttttgcctgtcaaacctaaaaggtccatcagcattaaactc cagcccagaccacctaatactgcagggtcccagaagccgaagttggagccaccaaaactt ctgggcaaaaggctgacttcagaatgtgtttcttctaacccatactctaagccttctagc aagagttttcaacagtgtgaagctggatcgtccacaacaggagaactgtcaagaaaacct gtggggtcacttaatatagagcaattgaaaactacaaagcagcagttaacagatcaagga aatggtaaatgtatagactttatgaataatatccatgttgaaaacgaatctttggataac tttctaaaagaaacaaacaaagagaacttgctcgatatcttaacagaacctgagaggaag ccagatcctaaattatataccagaagtaagccaaagactgactcttataatcaaaccaag aacagtttagttcctaaacaagccttgggcaaaagttcagttaatagtgctgttctgaaa gatagggttaataaacaatttgttggagaaacacaaagcaggactttcccagtaaaatca cagcaactctctagaggagcagatcttgcaagaccaggagtaaaaccctcaaggacggtt ccctctcactttattcggacccttagtaaagttcagtcatcaaagaaaccagtagtcaag aacatcaaagatataaaggttaataggagtcaatatgaaagaccaaatgaaactaagata cggtcataccctgttactgaacagagagtgaagcacaccaaacccagaacataccccagt ttgcttcagggtgaatataacaacagacatccaaacatcaagcaagatcagaagtccagc caagtttgtatacctcagacatcatgtgtactgcaaaagtcaaaagccataagccagagg cctaatttgacagttggcagatttaattcagccattccaagcacccctagcataagacca aatggaaccagtggtaataaacataacaataatggctttcagcaaaaagcacagactttg gactccaagttgaaaaaggctgttccccagaaccattttctgaacaagacagctcccaaa actcaagctgatgtcacaaccgtaaatgggacccaaacaaacccaaatattaaaaagaag gcaacagcagaggatcgaaggaaacaactagaagaatggcagaaatctaagggaaaaacc tataaacggcctcctatggaacttaaaacaaaaagaaaagtaataaaggaaatgaatatt tcattctggaagagcattgaaaaagaagaggaagaaaagaaagcacaactcgaactgtcc agtaaaattaacaacactctgacagaatgtctgaacctcatcgaagggggtgtaccttct aatgaaatacttaacatattgtccagcattcctgaagctgaaaaatttgctaaattctgg atctgcaaagcaaagttgttggcaagtaaaggcacctttgatgttattgggctatatgaa gaggccattaaaaatggggcaacaccaatacaagagttgcggaaagttgttcttaatatc ttgcaagactcaaacagaaccacagaagggattacttctgactctttagttgctgaaact agtataacatcagtggaagagctggccaagaagatggaatctgtgaagtcttgtctttct ccaaaagagagggaacaagtcacggcgacaccccgaatagccaaggcagaacagcataat tatcctggtatcaaattacagattggtccaatccctagaataaatgggatgccggaagtg caagacatgaaatttatcactcctgtacggcgttcgtcgaggattgagcgagcagtgtcc cgctacccagaaatgctgcaggaacacgatttagtagtggcttctcttgatgaactgtta gaagtggaagaaacaaaatgttttatattccgtagaaatgaggcgctgcctgtaacattg gggtttcaaacccctgaatcataa >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_6|95_aa MGQMRAPGSRSRKMGGNVESVMSWRQGSRQLWFIHVEIKSVWYTPPCEAKIITGSSLLLV FLIALSSNLWPDFSAFSSVIQCHILPPALSQLRGK >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_6|288_bp atggggcagatgagggccccgggaagcagaagccggaagatgggagggaatgtggagagt gtgatgtcctggagacaagggagccgacagctgtggttcatccacgtagagataaaatcg gtgtggtacacacctccctgtgaagccaaaatcatcaccggctcctcattgctgctggtg ttcctcattgctctttccagcaacctctggcctgatttcagtgccttctcttcagtcatt cagtgccatattctgcccccagctctttctcaacttagaggaaaatag >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_7|139_aa MYELDPELMDCAMREIPKTAGEALEEGRGPASASPGKSLHSIDAHLSGWITEADDPPSDI SSGQYQPKHCITMPTSVTSLHLIKEAMDHLTSSQEEEWKGPDEKDFSKGTYYQKAAKASN GAQREYANNKKLNTLNKKG >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_7|420_bp atgtatgaacttgaccctgagctcatggactgtgccatgagggaaattcctaaaacagca ggagaggccctggaggaaggcagaggccctgcatcagcaagtccaggcaaaagcctgcat tccatagatgctcatctctctggctggatcaccgaagcagatgaccctccttctgacata tcatcaggccaatatcagcctaaacactgcatcactatgcccacatcagtcacctcactt catctcatcaaggaggcaatggatcacctcacatcatcacaagaagaagagtggaaaggg ccagacgagaaggacttttctaaaggcacctactaccaaaaagctgccaaggcgtccaat ggagcccagagagaatatgctaacaataaaaagttgaacaccctcaataaaaaagggtaa >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_8|301_aa MQQQKWREITDPRHSGSAQNEPFKELMVSEAAMAEVPELASEMMAYYSGNEDDLFFEADG PKQMKCSFQDLDLCPLDGGIQLRISDHHYSKGFRQAASVVVAMDKLRKMLVPCPQTFQEN DLSTFFPFIFEEEPIFFDTWDNEAYVHDAPVRSLNCTLRDSQQKSLVMSGPYELKALHLQ GQDMEQQVVFSMSFVQGEESNDKIPVALGLKEKNLYLSCVLKDDKPTLQLESVDPKNYPK KKMEKRFVFNKIEINNKLEFESAQFPNWYISTSQAENMPVFLGGTKGGQDITDFTMQFVS S >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_8|906_bp atgcagcagcagaaatggagagaaataacagatcccaggcactcaggaagcgctcagaat gagcccttcaaagaacttatggtgtctgaagcagccatggcagaagtacctgagctcgcc agtgaaatgatggcttattacagtggcaatgaggatgacttgttctttgaagctgatggc cctaaacagatgaagtgctccttccaggacctggacctctgccctctggatggcggcatc cagctacgaatctccgaccaccactacagcaagggcttcaggcaggccgcgtcagttgtt gtggccatggacaagctgaggaagatgctggttccctgcccacagaccttccaggagaat gacctgagcaccttctttcccttcatctttgaagaagaacctatcttcttcgacacatgg gataacgaggcttatgtgcacgatgcacctgtacgatcactgaactgcacgctccgggac tcacagcaaaaaagcttggtgatgtctggtccatatgaactgaaagctctccacctccag ggacaggatatggagcaacaagtggtgttctccatgtcctttgtacaaggagaagaaagt aatgacaaaatacctgtggccttgggcctcaaggaaaagaatctgtacctgtcctgcgtg ttgaaagatgataagcccactctacagctggagagtgtagatcccaaaaattacccaaag aagaagatggaaaagcgatttgtcttcaacaagatagaaatcaataacaagctggaattt gagtctgcccagttccccaactggtacatcagcacctctcaagcagaaaacatgcccgtc ttcctgggagggaccaaaggcggccaggatataactgacttcaccatgcaatttgtgtct tcctaa >gi568815596r:112675070_112883770|GENSCAN_predicted_peptide_9|35_aa MILHDIKEHGPVTLQAMDVVVKFDLMMERRDINQA >gi568815596r:112675070_112883770|GENSCAN_predicted_CDS_9|108_bp atgatcctacatgatattaaggaacacgggccagtaaccctccaagcaatggatgtggtg gtgaagtttgacctcatgatggagcggagggatataaaccaggcttag